ML Infra Intern: GPU Kernel Optimization & LLM Profiling

$19 - $65 per hour

A Medium Corporation

PlusAI is seeking a Machine Learning Infrastructure Engineer Intern to advance their AI-based virtual driver software. The role involves identifying bottlenecks in BEV model training and implementing high-performance custom kernels using CUDA or C++. Interns will explore using LLMs for code generation to optimize processes. The position offers competitive hourly pay ranging from $19 to $65, based on experience and education level. Join PlusAI for hands-on work in an innovative, dynamic field with opportunities for personal and professional growth. #J-18808-Ljbffr

Apply

Vacancy posted 13 hours ago

Similar jobs that could be interesting for youBased on the ML Infra Intern: GPU Kernel Optimization & LLM Profiling in Santa Clara, CA vacancy

ML Infra Engineer Intern: Optimize BEV Training on GPUs
$19 - $65 per hour
...a Machine Learning Infrastructure Engineer Intern to work on high-performance kernels for BEV model training. In this role, you will... ...internship also explores the use of LLMs to optimize code generation and performance profiling. The hourly pay ranges from $19 to $65 based...
Internship
Hourly pay
PlusAI, Inc.
Santa Clara, CA
13 hours ago
Software Engineer, GDC LLM Serving and GPU Performance
$207k - $300k
Software Engineer, GDC LLM Serving and GPU Performance Google... ...sequential decision making), ML infrastructure, or... .... You could be optimizing KV cache transfer mechanisms... ...down to performance profiling, ensuring Google’s... ...GPU libraries and kernels. Collaborate with research...
Suggested
Full time
Google Inc.
Sunnyvale, CA
12 hours ago
Staff Software Development Engineer- GPU, LLM, AI
...Stack: Establish best practices and optimize performance from the lowest‑level GPU kernels to large‑scale distributed... ...design. Deep experience using GPU profiling and performance analysis tools (e... ...plus. Relevant publications in AI/ML, GPU computing, or system optimization...
Suggested
AMD
Santa Clara, CA
4 days ago
Kernel Optimization Software Engineer, AI Hardware
$176k - $420k
...Expect The Performance Optimization team takes research... ...development, kernel optimization, and hardware... ..., optimize, and profile highly performant... ...to distributed LLM inference Work with... ...Understanding of computer and GPU architecture, SIMD,... ..., hardware, and ML teams Degree in...
Suggested
Hourly pay
Full time
Temporary work
Flexible hours
Tesla
Palo Alto, CA
5 days ago
Principal AI and ML Infra Software Engineer, GPU Clusters
$272k - $431.25k
...We are seeking a Principal AI and ML Infra Software Engineer, GPU Clusters at NVIDIA to join our Hardware Infrastructure team. As an Engineer, you... ...and long‑term roadmaps for such initiatives. Monitor and optimize the performance of our infrastructure ensuring high availability...
Suggested
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Senior LLM Inference Kernel Engineer (Distributed GPU)
...Advanced Micro Devices in Santa Clara seeks a Senior ML Engineer focused on optimizing large language model inference runtimes. The role involves architecting distributed systems and enhancing performance across GPUs. Ideal candidates will have expertise in Python and...
Advanced Micro Devices , Inc.
Santa Clara, CA
1 day ago
Principal AI/ML Infra Engineer for GPU Clusters
...NVIDIA Gruppe is seeking a Principal AI and ML Infra Software Engineer to join our Hardware Infrastructure team in Santa Clara, CA. In this... ...efficiency by addressing infrastructure deficiencies for GPU Clusters, fostering innovations in AI/ML research. The ideal candidate...
NVIDIA Gruppe
Santa Clara, CA
13 hours ago
Principal AI/ML Infra Engineer GPU Clusters & HPC
$272k - $431.25k
...NVIDIA Corporation seeks a Principal AI and ML Infra Software Engineer in Santa Clara, California, to enhance the efficiency of AI/ML research on GPU Clusters. The role involves collaboration with various teams, monitoring infrastructure performance, and implementing improvements...
NVIDIA
Santa Clara, CA
13 hours ago
Product Manager, AI Platform SW - Agentic AI Kernel Generation
$168k - $258.75k
...coding agents synthesize, optimize, and deploy GPU kernels automatically. This job... ..., you will act as the internal champion for AI agents and LLM-based coding workflows... ...agents with compilers, profilers, execution sandboxes,... ...platform products in AI, ML infrastructure, or high...
NVIDIA
Santa Clara, CA
4 days ago
Principal AI Inference Systems Engineer
...Senior Staff AI Infra Engineer who is passionate... ...focus on AI/ML workloads and GPU-accelerated... ...and software to optimize performance for next... ...Optimize and accelerate LLM training and... ...GPUs, improving kernel, communication, and... ...Experience with profiling and performance-...
Advanced Micro Devices , Inc.
Santa Clara, CA
5 days ago
Senior AI Software Engineer, Kernel Libraries
$184k - $287.5k
...libraries, code generators, and GPU kernel technologies for NVIDIA's... ...attention kernel implementations, new LLM inference runtimes components,... ...Designing, implementing, and optimizing kernels for high impact AI... ...academic/ industry) experience with ML/DL systems development...
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Acceleration Kernel Developer Intern
...seniorities. As an Acceleration Kernel Developer at Tenstorrent, you will play a crucial role in optimizing low-level workloads, kernel... ...and pipelines. Performance Profiling: Identify performance... ...parallel algorithms on CPU, or GPU acceleration. High degree of...
Internship
Permanent employment
Tenstorrent
Santa Clara, CA
4 days ago
Principal ML Engineer - Large Scale Training Performance Optimization
...Training Performance Optimization ENGINEER THE ROLE: We... ...EXPERIENCE: Experience with ML/DL frameworks such as... ...plus. Experience with GPU kernel optimization is a plus... ...including debugging, profiling, and performance... ...scale. Experience with ML infra at kernel, framework,...
Advanced Micro Devices , Inc.
San Jose, CA
13 hours ago
Senior Power Analysis and Optimization Engineer, AI-LLM Systems
$136k - $218.5k
Senior Power Analysis and Optimization Engineer, AI-LLM Systems page is loaded## Senior... ...and unit‑level power using internal and industry‑standard RTL... ...and flows, including ML/RL‑based techniques for anomaly... ...and help shape the energy profile of NVIDIA’s future products...
NVIDIA Corporation
Santa Clara, CA
1 day ago
Senior Software Development Engineer - LLM Inference Framework
...As a senior member of the LLM inference framework team, you... ...responsible for building and optimizing production-grade single-node... ...engines, distributed systems, and GPU runtime and kernel backends. THE... ...You are a systems-minded ML engineer who thinks in terms...
Advanced Micro Devices , Inc.
Santa Clara, CA
2 days ago
Senior ML Infra Engineer - GPU Clusters, Reliability & Ops
$152k - $287.5k
...accelerate the development of machine learning innovations. In this role, you'll design and implement solutions for GPU clusters, enabling researchers to optimize their work. Strong expertise in software engineering and languages like Python or C++ is required. The ideal...
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Senior Deep Learning Software Engineer, LLM Performance
$184k - $287.5k
...Learning Software Engineer, LLM Performance page is... ...enable the performance optimization, deployment and serving... ...in developing GPU-accelerated Deep learning... ...SGLang, Triton and CUDA kernels. Work and collaborate with... ...performance modeling, profiling, debug, and code optimization...
NVIDIA Corporation
Santa Clara, CA
2 days ago
Machine Learning Infrastructure Engineer Intern
$19 - $65 per hour
...s Scania, MAN, and International brands, Hyundai Motor... ...performance custom kernels (using CUDA, Triton... ...generation, kernel optimization, and automated performance profiling with Nsight and... ...by both human and LLM-assisted workflows to maximize GPU utilization and reduce...
Internship
Hourly pay
PlusAI
Santa Clara, CA
4 days ago
Senior Deep Learning Kernel Software Performance Architect
$152k - $218.5k
...now looking for a Senior Kernel Performance Architect... ...will be doing: Craft GPU-accelerated system... ...Analyze, visualize, and optimize software performance using... ...performance issues. AI/ML training and inference... ...analysis and profiling to identify performance...
Work experience placement
NVIDIA
Santa Clara, CA
5 days ago
Software Engineering Intern - Kernels
$40 - $70 per hour
...Location Toronto Employment Type Intern Location Type Hybrid Department R&D - SW Kernels & Workloads Compensation CA$40 – CA... ...e.g., LLMs, CNNs) Experience with ML frameworks such as PyTorch and ML... ...understanding of computer architecture (CPU, GPU, custom ASICs, etc.) Currently...
Internship
Hourly pay
MixMode
Santa Clara, CA
13 hours ago
Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous Vehicles
$184k - $287.5k
...state‑of‑the‑art model optimization techniques—speculative... ...and efficient attention kernels optimized for KV‑caching... ..., layer‑by‑layer model profiling to identify compute and... ...with modern LLM/VLM inference stacks, such... ...Strong understanding of GPU architecture, the compilation...
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Principal GenAI Inference Optimization Engineer
...GenAI Inference Optimization Engineer to... ...workloads on AMD GPU platforms. You... ...multiple layers—from kernels and runtimes to... ...efficiency for LLM and multimodal... ...and use profiling, benchmarking,... ...Contribute to internal tools and, where... ...Experience with ML frameworks (PyTorch...
Advanced Micro Devices , Inc.
San Jose, CA
13 hours ago
Senior Solutions Architect, GPU Performance and LLM - Cloud Service Providers
$184k - $287.5k
...Architect to assist customers in building AI/ML and HPC software solutions at scale. As a... ...aspects related to tasks like large scale LLM training and inference. Conducting regular... ...diagnostics. Hands-on experience with GPU systems in general including but not limited...
NVIDIA
Santa Clara, CA
1 day ago
Software Engineer, Kernel Reliability
...effortlessly run large-scale ML applications, without... ...10 times faster than GPU-based hyperscale cloud... ...to join our on‑field Kernel Reliability team. You’ll... ...inference, training, and internal production services. In... ..., tracing, sanitizers, profilers, etc.). Familiarity...
Internship
Dormont Manufacturing Company
Sunnyvale, CA
13 hours ago
Deep Learning Kernel Software Performance Architect - New College Grad 2026
$124k - $195.5k
...Deep Learning Kernel Software Performance Architect... ...computing. An era in which our GPU acts as the brains of... ...issues* Engage AI/ML training and inference... ...teams to identify and optimize critical deep learning... ...performance analysis and profiling to identify performance...
Work experience placement
NVIDIA
Santa Clara, CA
12 hours ago
Principal Software Engineer - Large-Scale LLM Memory and Storage Systems
$272k - $431.25k
...Dynamo orchestrates GPU shards, routes... ...deployment of cutting-edge LLM workloads. We... ...the team in internal reviews and external... ...performance storage, or ML systems infrastructure... ...especially designs optimized for low latency and... ...~ Strong skills in profiling and optimizing...
Local area
Remote work
NVIDIA
Santa Clara, CA
3 days ago
Senior AI Kernel & Inference Engineer
...for inference applications including deep learning framework optimizations and GPU kernel technologies. You will closely collaborate with other... ...in Computer Science, strong programming skills in C/C++, and significant experience with ML frameworks. #J-18808-Ljbffr...
NVIDIA
Santa Clara, CA
1 day ago
AI/LLM Platform Engineer Lead Kubernetes & GPU Infra
$130k - $170k
...is hiring a Platform Engineer in Santa Clara, California, to lead the design and operation of scalable infrastructure supporting AI/LLM-based solutions. The ideal candidate will have over 5 years of experience in Platform Engineering. Your role involves managing Kubernetes...
NTT DATA
Santa Clara, CA
13 hours ago
Member of Technical Staff, ML Kernels
...Technical Staff, Machine Learning Kernels to design, optimize, and benchmark high-... ...optimize high-performance ML kernels, primarily... ...and memory efficiency. Profile, benchmark, and analyze performance... ...accelerators. Advise internal teams on GPU and accelerator...
Visa sponsorship
Relocation package
Netpreme
Santa Clara, CA
2 days ago
ML Systems Performance Engineer
...work spans low-level kernel performance debugging and optimization, system-level... ...the art and customer ML models. Optimize and... ...level deep learning / LLM math. Strong analytical... ...Computer Architecture, CPU/GPU Performance, Kernel... ...to performance profiling and debug on any...
Dormont Manufacturing Company
Sunnyvale, CA
13 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Infra Intern: GPU Kernel Optimization & LLM Profiling. Be the first to apply!