Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Infra Intern: GPU Kernel Optimization & LLM Profiling

$19 - $65 per hour

A Medium Corporation

PlusAI is seeking a Machine Learning Infrastructure Engineer Intern to advance their AI-based virtual driver software. The role involves identifying bottlenecks in BEV model training and implementing high-performance custom kernels using CUDA or C++. Interns will explore using LLMs for code generation to optimize processes. The position offers competitive hourly pay ranging from $19 to $65, based on experience and education level. Join PlusAI for hands-on work in an innovative, dynamic field with opportunities for personal and professional growth. #J-18808-Ljbffr

Vacancy posted 13 hours ago
Similar jobs that could be interesting for youBased on the ML Infra Intern: GPU Kernel Optimization & LLM Profiling in Santa Clara, CA vacancy
  • $19 - $65 per hour

     ...a Machine Learning Infrastructure Engineer Intern to work on high-performance kernels for BEV model training. In this role, you will...  ...internship also explores the use of LLMs to optimize code generation and performance profiling. The hourly pay ranges from $19 to $65 based... 
    Internship
    Hourly pay

    PlusAI, Inc.

    Santa Clara, CA
    13 hours ago
  • $207k - $300k

    Software Engineer, GDC LLM Serving and GPU Performance Google...  ...sequential decision making), ML infrastructure, or...  .... You could be optimizing KV cache transfer mechanisms...  ...down to performance profiling, ensuring Google’s...  ...GPU libraries and kernels. Collaborate with research... 
    Suggested
    Full time

    Google Inc.

    Sunnyvale, CA
    12 hours ago
  •  ...Stack: Establish best practices and optimize performance from the lowest‑level GPU kernels to large‑scale distributed...  ...design. Deep experience using GPU profiling and performance analysis tools (e...  ...plus. Relevant publications in AI/ML, GPU computing, or system optimization... 
    Suggested

    AMD

    Santa Clara, CA
    4 days ago
  • $176k - $420k

     ...Expect The Performance Optimization team takes research...  ...development, kernel optimization, and hardware...  ..., optimize, and profile highly performant...  ...to distributed LLM inference Work with...  ...Understanding of computer and GPU architecture, SIMD,...  ..., hardware, and ML teams Degree in... 
    Suggested
    Hourly pay
    Full time
    Temporary work
    Flexible hours

    Tesla

    Palo Alto, CA
    5 days ago
  • $272k - $431.25k

     ...We are seeking a Principal AI and ML Infra Software Engineer, GPU Clusters at NVIDIA to join our Hardware Infrastructure team. As an Engineer, you...  ...and long‑term roadmaps for such initiatives. Monitor and optimize the performance of our infrastructure ensuring high availability... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  •  ...Advanced Micro Devices in Santa Clara seeks a Senior ML Engineer focused on optimizing large language model inference runtimes. The role involves architecting distributed systems and enhancing performance across GPUs. Ideal candidates will have expertise in Python and... 

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    1 day ago
  •  ...NVIDIA Gruppe is seeking a Principal AI and ML Infra Software Engineer to join our Hardware Infrastructure team in Santa Clara, CA. In this...  ...efficiency by addressing infrastructure deficiencies for GPU Clusters, fostering innovations in AI/ML research. The ideal candidate... 

    NVIDIA Gruppe

    Santa Clara, CA
    13 hours ago
  • $272k - $431.25k

     ...NVIDIA Corporation seeks a Principal AI and ML Infra Software Engineer in Santa Clara, California, to enhance the efficiency of AI/ML research on GPU Clusters. The role involves collaboration with various teams, monitoring infrastructure performance, and implementing improvements... 

    NVIDIA

    Santa Clara, CA
    13 hours ago
  • $168k - $258.75k

     ...coding agents synthesize, optimize, and deploy GPU kernels automatically. This job...  ..., you will act as the internal champion for AI agents and LLM-based coding workflows...  ...agents with compilers, profilers, execution sandboxes,...  ...platform products in AI, ML infrastructure, or high... 

    NVIDIA

    Santa Clara, CA
    4 days ago
  •  ...Senior Staff AI Infra Engineer who is passionate...  ...focus on AI/ML workloads and GPU-accelerated...  ...and software to optimize performance for next...  ...Optimize and accelerate LLM training and...  ...GPUs, improving kernel, communication, and...  ...Experience with profiling and performance-... 

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    5 days ago
  • $184k - $287.5k

     ...libraries, code generators, and GPU kernel technologies for NVIDIA's...  ...attention kernel implementations, new LLM inference runtimes components,...  ...Designing, implementing, and optimizing kernels for high impact AI...  ...academic/ industry) experience with ML/DL systems development... 

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  •  ...seniorities. As an Acceleration Kernel Developer at Tenstorrent, you will play a crucial role in optimizing low-level workloads, kernel...  ...and pipelines. Performance Profiling: Identify performance...  ...parallel algorithms on CPU, or GPU acceleration. High degree of... 
    Internship
    Permanent employment

    Tenstorrent

    Santa Clara, CA
    4 days ago
  •  ...Training Performance Optimization ENGINEER THE ROLE: We...  ...EXPERIENCE: Experience with ML/DL frameworks such as...  ...plus. Experience with GPU kernel optimization is a plus...  ...including debugging, profiling, and performance...  ...scale. Experience with ML infra at kernel, framework,... 

    Advanced Micro Devices , Inc.

    San Jose, CA
    13 hours ago
  • $136k - $218.5k

    Senior Power Analysis and Optimization Engineer, AI-LLM Systems page is loaded## Senior...  ...and unit‑level power using internal and industry‑standard RTL...  ...and flows, including ML/RL‑based techniques for anomaly...  ...and help shape the energy profile of NVIDIA’s future products... 

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  •  ...As a senior member of the LLM inference framework team, you...  ...responsible for building and optimizing production-grade single-node...  ...engines, distributed systems, and GPU runtime and kernel backends. THE...  ...You are a systems-minded ML engineer who thinks in terms... 

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    2 days ago
  • $152k - $287.5k

     ...accelerate the development of machine learning innovations. In this role, you'll design and implement solutions for GPU clusters, enabling researchers to optimize their work. Strong expertise in software engineering and languages like Python or C++ is required. The ideal... 

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $184k - $287.5k

     ...Learning Software Engineer, LLM Performance page is...  ...enable the performance optimization, deployment and serving...  ...in developing GPU-accelerated Deep learning...  ...SGLang, Triton and CUDA kernels. Work and collaborate with...  ...performance modeling, profiling, debug, and code optimization... 

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $19 - $65 per hour

     ...s Scania, MAN, and International brands, Hyundai Motor...  ...performance custom kernels (using CUDA, Triton...  ...generation, kernel optimization, and automated performance profiling with Nsight and...  ...by both human and LLM-assisted workflows to maximize GPU utilization and reduce... 
    Internship
    Hourly pay

    PlusAI

    Santa Clara, CA
    4 days ago
  • $152k - $218.5k

     ...now looking for a Senior Kernel Performance Architect...  ...will be doing: Craft GPU-accelerated system...  ...Analyze, visualize, and optimize software performance using...  ...performance issues. AI/ML training and inference...  ...analysis and profiling to identify performance... 
    Work experience placement

    NVIDIA

    Santa Clara, CA
    5 days ago
  • $40 - $70 per hour

     ...Location Toronto Employment Type Intern Location Type Hybrid Department R&D - SW Kernels & Workloads Compensation CA$40 – CA...  ...e.g., LLMs, CNNs) Experience with ML frameworks such as PyTorch and ML...  ...understanding of computer architecture (CPU, GPU, custom ASICs, etc.) Currently... 
    Internship
    Hourly pay

    MixMode

    Santa Clara, CA
    13 hours ago
  • $184k - $287.5k

     ...state‑of‑the‑art model optimization techniques—speculative...  ...and efficient attention kernels optimized for KV‑caching...  ..., layer‑by‑layer model profiling to identify compute and...  ...with modern LLM/VLM inference stacks, such...  ...Strong understanding of GPU architecture, the compilation... 

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...GenAI Inference Optimization Engineer to...  ...workloads on AMD GPU platforms. You...  ...multiple layers—from kernels and runtimes to...  ...efficiency for LLM and multimodal...  ...and use profiling, benchmarking,...  ...Contribute to internal tools and, where...  ...Experience with ML frameworks (PyTorch... 

    Advanced Micro Devices , Inc.

    San Jose, CA
    13 hours ago
  • $184k - $287.5k

     ...Architect to assist customers in building AI/ML and HPC software solutions at scale. As a...  ...aspects related to tasks like large scale LLM training and inference. Conducting regular...  ...diagnostics. Hands-on experience with GPU systems in general including but not limited... 

    NVIDIA

    Santa Clara, CA
    1 day ago
  •  ...effortlessly run large-scale ML applications, without...  ...10 times faster than GPU-based hyperscale cloud...  ...to join our on‑field Kernel Reliability team. You’ll...  ...inference, training, and internal production services. In...  ..., tracing, sanitizers, profilers, etc.). Familiarity... 
    Internship

    Dormont Manufacturing Company

    Sunnyvale, CA
    13 hours ago
  • $124k - $195.5k

     ...Deep Learning Kernel Software Performance Architect...  ...computing. An era in which our GPU acts as the brains of...  ...issues* Engage AI/ML training and inference...  ...teams to identify and optimize critical deep learning...  ...performance analysis and profiling to identify performance... 
    Work experience placement

    NVIDIA

    Santa Clara, CA
    12 hours ago
  • $272k - $431.25k

     ...Dynamo orchestrates GPU shards, routes...  ...deployment of cutting-edge LLM workloads. We...  ...the team in internal reviews and external...  ...performance storage, or ML systems infrastructure...  ...especially designs optimized for low latency and...  ...~ Strong skills in profiling and optimizing... 
    Local area
    Remote work

    NVIDIA

    Santa Clara, CA
    3 days ago
  •  ...for inference applications including deep learning framework optimizations and GPU kernel technologies. You will closely collaborate with other...  ...in Computer Science, strong programming skills in C/C++, and significant experience with ML frameworks. #J-18808-Ljbffr... 

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $130k - $170k

     ...is hiring a Platform Engineer in Santa Clara, California, to lead the design and operation of scalable infrastructure supporting AI/LLM-based solutions. The ideal candidate will have over 5 years of experience in Platform Engineering. Your role involves managing Kubernetes... 

    NTT DATA

    Santa Clara, CA
    13 hours ago
  •  ...Technical Staff, Machine Learning Kernels to design, optimize, and benchmark high-...  ...optimize high-performance ML kernels, primarily...  ...and memory efficiency. Profile, benchmark, and analyze performance...  ...accelerators. Advise internal teams on GPU and accelerator... 
    Visa sponsorship
    Relocation package

    Netpreme

    Santa Clara, CA
    2 days ago
  •  ...work spans low-level kernel performance debugging and optimization, system-level...  ...the art and customer ML models. Optimize and...  ...level deep learning / LLM math. Strong analytical...  ...Computer Architecture, CPU/GPU Performance, Kernel...  ...to performance profiling and debug on any... 

    Dormont Manufacturing Company

    Sunnyvale, CA
    13 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Infra Intern: GPU Kernel Optimization & LLM Profiling. Be the first to apply!