ML Infra Intern: GPU Kernel Optimization & LLM Profiling
$19 - $65 per hourA Medium Corporation
PlusAI is seeking a Machine Learning Infrastructure Engineer Intern to advance their AI-based virtual driver software. The role involves identifying bottlenecks in BEV model training and implementing high-performance custom kernels using CUDA or C++. Interns will explore using LLMs for code generation to optimize processes. The position offers competitive hourly pay ranging from $19 to $65, based on experience and education level. Join PlusAI for hands-on work in an innovative, dynamic field with opportunities for personal and professional growth. #J-18808-Ljbffr
$19 - $65 per hour
...a Machine Learning Infrastructure Engineer Intern to work on high-performance kernels for BEV model training. In this role, you will... ...internship also explores the use of LLMs to optimize code generation and performance profiling. The hourly pay ranges from $19 to $65 based...InternshipHourly pay$207k - $300k
Software Engineer, GDC LLM Serving and GPU Performance Google... ...sequential decision making), ML infrastructure, or... .... You could be optimizing KV cache transfer mechanisms... ...down to performance profiling, ensuring Google’s... ...GPU libraries and kernels. Collaborate with research...SuggestedFull time- ...Stack: Establish best practices and optimize performance from the lowest‑level GPU kernels to large‑scale distributed... ...design. Deep experience using GPU profiling and performance analysis tools (e... ...plus. Relevant publications in AI/ML, GPU computing, or system optimization...Suggested
$176k - $420k
...Expect The Performance Optimization team takes research... ...development, kernel optimization, and hardware... ..., optimize, and profile highly performant... ...to distributed LLM inference Work with... ...Understanding of computer and GPU architecture, SIMD,... ..., hardware, and ML teams Degree in...SuggestedHourly payFull timeTemporary workFlexible hours$272k - $431.25k
...We are seeking a Principal AI and ML Infra Software Engineer, GPU Clusters at NVIDIA to join our Hardware Infrastructure team. As an Engineer, you... ...and long‑term roadmaps for such initiatives. Monitor and optimize the performance of our infrastructure ensuring high availability...Suggested- ...Advanced Micro Devices in Santa Clara seeks a Senior ML Engineer focused on optimizing large language model inference runtimes. The role involves architecting distributed systems and enhancing performance across GPUs. Ideal candidates will have expertise in Python and...
- ...NVIDIA Gruppe is seeking a Principal AI and ML Infra Software Engineer to join our Hardware Infrastructure team in Santa Clara, CA. In this... ...efficiency by addressing infrastructure deficiencies for GPU Clusters, fostering innovations in AI/ML research. The ideal candidate...
$272k - $431.25k
...NVIDIA Corporation seeks a Principal AI and ML Infra Software Engineer in Santa Clara, California, to enhance the efficiency of AI/ML research on GPU Clusters. The role involves collaboration with various teams, monitoring infrastructure performance, and implementing improvements...$168k - $258.75k
...coding agents synthesize, optimize, and deploy GPU kernels automatically. This job... ..., you will act as the internal champion for AI agents and LLM-based coding workflows... ...agents with compilers, profilers, execution sandboxes,... ...platform products in AI, ML infrastructure, or high...- ...Senior Staff AI Infra Engineer who is passionate... ...focus on AI/ML workloads and GPU-accelerated... ...and software to optimize performance for next... ...Optimize and accelerate LLM training and... ...GPUs, improving kernel, communication, and... ...Experience with profiling and performance-...
$184k - $287.5k
...libraries, code generators, and GPU kernel technologies for NVIDIA's... ...attention kernel implementations, new LLM inference runtimes components,... ...Designing, implementing, and optimizing kernels for high impact AI... ...academic/ industry) experience with ML/DL systems development...- ...seniorities. As an Acceleration Kernel Developer at Tenstorrent, you will play a crucial role in optimizing low-level workloads, kernel... ...and pipelines. Performance Profiling: Identify performance... ...parallel algorithms on CPU, or GPU acceleration. High degree of...InternshipPermanent employment
- ...Training Performance Optimization ENGINEER THE ROLE: We... ...EXPERIENCE: Experience with ML/DL frameworks such as... ...plus. Experience with GPU kernel optimization is a plus... ...including debugging, profiling, and performance... ...scale. Experience with ML infra at kernel, framework,...
$136k - $218.5k
Senior Power Analysis and Optimization Engineer, AI-LLM Systems page is loaded## Senior... ...and unit‑level power using internal and industry‑standard RTL... ...and flows, including ML/RL‑based techniques for anomaly... ...and help shape the energy profile of NVIDIA’s future products...- ...As a senior member of the LLM inference framework team, you... ...responsible for building and optimizing production-grade single-node... ...engines, distributed systems, and GPU runtime and kernel backends. THE... ...You are a systems-minded ML engineer who thinks in terms...
$152k - $287.5k
...accelerate the development of machine learning innovations. In this role, you'll design and implement solutions for GPU clusters, enabling researchers to optimize their work. Strong expertise in software engineering and languages like Python or C++ is required. The ideal...$184k - $287.5k
...Learning Software Engineer, LLM Performance page is... ...enable the performance optimization, deployment and serving... ...in developing GPU-accelerated Deep learning... ...SGLang, Triton and CUDA kernels. Work and collaborate with... ...performance modeling, profiling, debug, and code optimization...$19 - $65 per hour
...s Scania, MAN, and International brands, Hyundai Motor... ...performance custom kernels (using CUDA, Triton... ...generation, kernel optimization, and automated performance profiling with Nsight and... ...by both human and LLM-assisted workflows to maximize GPU utilization and reduce...InternshipHourly pay$152k - $218.5k
...now looking for a Senior Kernel Performance Architect... ...will be doing: Craft GPU-accelerated system... ...Analyze, visualize, and optimize software performance using... ...performance issues. AI/ML training and inference... ...analysis and profiling to identify performance...Work experience placement$40 - $70 per hour
...Location Toronto Employment Type Intern Location Type Hybrid Department R&D - SW Kernels & Workloads Compensation CA$40 – CA... ...e.g., LLMs, CNNs) Experience with ML frameworks such as PyTorch and ML... ...understanding of computer architecture (CPU, GPU, custom ASICs, etc.) Currently...InternshipHourly pay$184k - $287.5k
...state‑of‑the‑art model optimization techniques—speculative... ...and efficient attention kernels optimized for KV‑caching... ..., layer‑by‑layer model profiling to identify compute and... ...with modern LLM/VLM inference stacks, such... ...Strong understanding of GPU architecture, the compilation...- ...GenAI Inference Optimization Engineer to... ...workloads on AMD GPU platforms. You... ...multiple layers—from kernels and runtimes to... ...efficiency for LLM and multimodal... ...and use profiling, benchmarking,... ...Contribute to internal tools and, where... ...Experience with ML frameworks (PyTorch...
$184k - $287.5k
...Architect to assist customers in building AI/ML and HPC software solutions at scale. As a... ...aspects related to tasks like large scale LLM training and inference. Conducting regular... ...diagnostics. Hands-on experience with GPU systems in general including but not limited...- ...effortlessly run large-scale ML applications, without... ...10 times faster than GPU-based hyperscale cloud... ...to join our on‑field Kernel Reliability team. You’ll... ...inference, training, and internal production services. In... ..., tracing, sanitizers, profilers, etc.). Familiarity...Internship
$124k - $195.5k
...Deep Learning Kernel Software Performance Architect... ...computing. An era in which our GPU acts as the brains of... ...issues* Engage AI/ML training and inference... ...teams to identify and optimize critical deep learning... ...performance analysis and profiling to identify performance...Work experience placement$272k - $431.25k
...Dynamo orchestrates GPU shards, routes... ...deployment of cutting-edge LLM workloads. We... ...the team in internal reviews and external... ...performance storage, or ML systems infrastructure... ...especially designs optimized for low latency and... ...~ Strong skills in profiling and optimizing...Local areaRemote work- ...for inference applications including deep learning framework optimizations and GPU kernel technologies. You will closely collaborate with other... ...in Computer Science, strong programming skills in C/C++, and significant experience with ML frameworks. #J-18808-Ljbffr...
$130k - $170k
...is hiring a Platform Engineer in Santa Clara, California, to lead the design and operation of scalable infrastructure supporting AI/LLM-based solutions. The ideal candidate will have over 5 years of experience in Platform Engineering. Your role involves managing Kubernetes...- ...Technical Staff, Machine Learning Kernels to design, optimize, and benchmark high-... ...optimize high-performance ML kernels, primarily... ...and memory efficiency. Profile, benchmark, and analyze performance... ...accelerators. Advise internal teams on GPU and accelerator...Visa sponsorshipRelocation package
- ...work spans low-level kernel performance debugging and optimization, system-level... ...the art and customer ML models. Optimize and... ...level deep learning / LLM math. Strong analytical... ...Computer Architecture, CPU/GPU Performance, Kernel... ...to performance profiling and debug on any...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to ML Infra Intern: GPU Kernel Optimization & LLM Profiling. Be the first to apply!
- machine learning intern Santa Clara, CA
- machine learning researcher Santa Clara, CA
- machine learning Santa Clara, CA
- artificial intelligence - machine learning intern Santa Clara, CA
- machine learning research scientist Santa Clara, CA
- data engineer machine learning Santa Clara, CA
- machine learning scientist Santa Clara, CA
- internship machine learning Santa Clara, CA
- machine learning remote Santa Clara, CA
- amd machine learning

