ML Compiler Intern: Optimize Latency & ML Models

TETRAMEM INC

A technology company in California is looking for a Compiler Engineer Intern to support the Compiler Software team in developing a compiler and runtime for in-memory machine learning accelerators. Responsibilities include exploring compiler development techniques and engaging with QA teams. Requirements include a Bachelor's in Computer Science/Engineering and programming skills in C++, C, or Rust. The role offers a valuable opportunity to gain experience in AI network architectures and compiler optimizations. #J-18808-Ljbffr

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the ML Compiler Intern: Optimize Latency & ML Models in San Jose, CA vacancy

US 2026 Software - Compiler Engineer Intern
...US 2026 Software - Compiler Engineer Intern We are seeking a Compiler Engineer intern to support the Compiler Software... ...and support the development of compiler optimizations to improve latency/bandwidth of our ML models Engage with QA teams Requirements Bachelor's degree...
Internship
TETRAMEM INC
San Jose, CA
1 day ago
Edge ML Software Engineer (Model Optimization-PICO) - San Jose
$212.8k
...Responsibilities: - Convert and compile ML models for execution on edge NPUs,... ...- Apply hardware-aware optimization strategies, such as... ...and operator fusion, to meet latency, memory and power targets.... ...unsupervised contact with internal/external clients and/or colleagues...
Suggested
Temporary work
Local area
ByteDance
San Jose, CA
4 days ago
Machine Learning Engineer - AI Compiler Optimization
$156k - $387.6k
...Machine Learning Engineer - AI Compiler Optimization Location: San Jose... ...specifically for recommendation model scenarios, including but not... ...compilation optimization and low-latency inference optimization for... ...unsupervised contact with internal/external clients and/or...
Suggested
Temporary work
Local area
ByteDance
San Jose, CA
3 days ago
Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous Vehicles
$184k - $287.5k
Responsibilities Develop state‑of‑the‑art model optimization techniques—speculative decoding with... ...of GPU architecture, the compilation stack, and the ability to debug end‑to... ...hardware‑in‑the‑loop testing, and ultra‑low latency requirements. Benefits Base salary range...
Suggested
NVIDIA Gruppe
Santa Clara, CA
20 hours ago
Software Intern - AI Compilers
$50 - $70 per hour
...Software Intern - AI Compilers Santa Clara, California, United States Tenstorrent... ...innovations in software models, compilers, platforms,... ...Benchmark, analyze, and optimize performance of key machine learning... ...Develop novel ML models and primitives that take...
Internship
Tenstorrent University
Santa Clara, CA
4 days ago
ML Infra Intern: GPU Kernel Optimization & LLM Profiling
$19 - $65 per hour
...Machine Learning Infrastructure Engineer Intern to advance their AI-based virtual driver... ...involves identifying bottlenecks in BEV model training and implementing high-performance... ...explore using LLMs for code generation to optimize processes. The position offers competitive...
Internship
Hourly pay
Medium
Santa Clara, CA
1 day ago
ML Infra Engineer Intern: Optimize BEV Training on GPUs
$19 - $65 per hour
...seeking a Machine Learning Infrastructure Engineer Intern to work on high-performance kernels for BEV model training. In this role, you will analyze training bottlenecks... .... This internship also explores the use of LLMs to optimize code generation and performance profiling. The...
Internship
Hourly pay
PlusAI, Inc.
Santa Clara, CA
1 day ago
Senior ML Engineer - Model Compression
$128.7k - $261.3k
...repeatable, high-velocity model deployments through... ...numerically robust, low-latency models to the car, blending... ...our deployment, compiler, and kernel teams Advancing... ..., Data Science / ML, or a closely related quantitative... ...focused onmodel optimization and deployment, with...
Local area
Remote work
Work from home
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
4 days ago
ML Engineer Intern - Build Scalable Ads Models (Summer)
...Ellis Technologies, Inc. is seeking an enthusiastic intern in San Jose to build scalable machine learning... ...and enhance the quality of ads through advanced models. You will work alongside various disciplines to optimize ad strategies and contribute to innovative shopping...
Internship
Hourly pay
Summer work
Ellis Technologies, Inc.
San Jose, CA
1 day ago
TPU ML Compiler Engineer — Optimize Large-Scale Workloads
$147k - $211k
Google Inc. is seeking a skilled ML Compiler Software Engineer for its Sunnyvale office. The position requires a Bachelor's degree, proficiency... .... In this role, you will focus on developing compiler optimizations for Tensor Processing Units (TPUs), enhancing parallelization...
Full time
Work at office
Google Inc.
Sunnyvale, CA
2 days ago
Senior ML Scientist, Conversion Modeling & Ad Optimization
$148.75k - $361k
...techniques. The role involves significant applied research to tackle complex advertising challenges related to conversion modeling and optimization. The position offers a competitive salary between $148,750 - $361,000 annually, along with excellent benefits including health...
Jobr
San Jose, CA
1 day ago
Senior ML Scientist: Conversion Modeling & Ad Optimization
...You will focus on developing innovative methods for conversion modeling and enhance ad performance through generative models. The ideal... ...conduct original research, drive A/B testing initiatives, and optimize outcomes through advanced AI technologies. This role is essential...
Experimentation Jobs
San Jose, CA
1 day ago
Senior ML Performance Engineer: Scale & Optimize AI Models
$174k - $252k
...Senior Machine Learning Engineer in Sunnyvale, CA, to improve AI model performance and efficiency. Candidates should possess a... ...experience in software development, testing, and performance optimization. Responsibilities include engaging with product teams to resolve...
Google Inc.
Sunnyvale, CA
2 days ago
Senior ML Systems Engineer: Compiler & Performance
...effortlessly run large-scale ML applications, without the... ...current customers include top model labs, global enterprises, and... ..., graph lowering, compiler optimizations, runtime integration, and performance... ...and familiarity with model internals (e.g., attention, MoE, diffusion...
Internship
Dormont Manufacturing Co
Sunnyvale, CA
11 hours ago
Principal GenAI Inference Optimization Engineer
...Principal GenAI Inference Optimization Engineer to join our Models and Applications... ...to optimizing latency, throughput, and cost... ...Collaborate with hardware, compiler, and framework teams... .... Contribute to internal tools and, where applicable... .... Experience with ML frameworks (PyTorch,...
Advanced Micro Devices , Inc.
San Jose, CA
1 day ago
Staff ML Compiler Engineer
$185.1k - $335.3k
...the AI Kernels & Compilers team, that mission... ...new approaches to model export, kernel development... ...and CUDA/TensorRT internals, and mentoring... ...them into highly optimized inference... ...and effortless for ML engineers across the... ...fidelity, and on-vehicle latency. Along the way,...
Local area
Remote work
Work from home
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
4 days ago
Staff Machine Learning Engineer - AI/ML Compiler
$160.5k - $240.7k
...developers to easily integrate, optimize, and deploy ML models on Qualcomm devices.... ...Workbench lets developers compile trained PyTorch or ONNX models... ...grows Partner with internal Business Units to onboard... ...constraints (target SoC, latency budgets, memory limits) into...
Qualcomm
Santa Clara, CA
1 day ago
ML Engineer - Inference & Model Deployment
...are looking for a founding ML engineer who can help us turn powerful AI and ML models into fast, reliable production... ...: deploying models, optimizing inference latency and throughput, scaling serving... ...such as quantization, pruning, compilation, batching, caching, and memory...
Full time
Relocation package
HiringCafe
Cupertino, CA
4 days ago
Energy Systems Research Engineer Intern: Market Modeling & ML
...in Cupertino, California, is seeking an Intern to support electricity market simulation... ...analysis, build and validate SCUC/SCED models, and support data processing for various... ...machine learning methods and contribute to optimizing workflows. #J-18808-Ljbffr Gridmatic
Internship
Gridmatic
Cupertino, CA
20 hours ago
Embedded AI/ML Intern: Train Models on Microcontrollers
$31.25 - $41.5 per hour
...Renesas Electronics is seeking a Summer 2026 Intern in San Jose, CA, for a role focused on developing machine learning models. This hybrid position requires full-time enrollment in a master’s program and an interest in embedded systems. You’ll build and evaluate AI models...
Internship
Hourly pay
Full time
Summer internship
Renesas
San Jose, CA
1 day ago
Embedded AI/ML Intern Edge Models on Renesas RA6/RA8
$31.25 - $41.5 per hour
...Renesas Electronics is offering a Summer 2026 Internship based in San Jose, CA, focusing on the development of machine learning models. Interns will participate in building, training, and evaluating AI models, gaining hands-on experience with industry tools and Renesas...
Internship
Hourly pay
Summer internship
Renesas
San Jose, CA
1 day ago
Compiler Engineer (Mid and/or Backend)
...Persimmons Ai Compiler Developer Persimmons is building... ...-based large language models by applying... ...You will help design and optimize the Persimmons Compiler... ...utilization, throughput, and low latency. Contribute to graph... ...flows from high-level ML representations to low-...
Persimmons, Inc.
San Jose, CA
4 days ago
Fellow, AI Software (Workload Optimization)
...Fellow, AI Software (Workload Optimization) We are looking for a... ...software stack—from ROCm and compilers to high-level AI frameworks—is... ...the ability to map emerging model architectures to low-level software... ...or applied experience in AI/ML, including areas such as deep...
Advanced Micro Devices , Inc.
San Jose, CA
2 days ago
AI Models, Product Manager
...effortlessly run large-scale ML applications, without... ...customers include top model labs, global... ...-removing blockers and optimizing for their specific use... ...tradeoffs between quality, latency, throughput, and cost... ...Understanding of model compilers and optimization. # Contributor...
Work experience placement
Work at office
Remote work
Shift work
CEREBRAS SYSTEMS INC.
Sunnyvale, CA
3 days ago
Physics & AI Modeling Intern — Multiphysics & ML
A leading materials engineering company seeks a 2026 Summer Intern in Physics & AI Modeling Engineering. This role is focused on advanced modeling projects utilizing physics and AI workflows. Candidates should have a Master's or PhD in a relevant field and familiarity with...
Internship
Full time
Summer internship
Applied Materials, Inc.
Santa Clara, CA
4 days ago
Senior Machine Learning Applications and Compiler Engineer, LPX
$152k - $241.5k
...Machine Learning Applications and Compiler Engineer! NVIDIA is seeking... ...to develop algorithms and optimizations for our LPX inference and compiler... ...seamless deployment of models across platforms. Benchmark,... ...spatial accelerators at top tier ML, compiler, and computer...
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform)
$229.9k - $262.4k
...Lead AI Engineer (Inference Optimization, FM hosting, AI Platform)Applylocations... ..., our applications of AI & ML are bringing humanity and... ...of customers. Our AI models and platforms empower teams across... ...performance — scalability, cost, latency, throughput — of large scale...
Full time
Part time
Local area
Capital One
San Jose, CA
1 day ago
CoreML On-Device ML Graph Compiler Engineer
...leading technology company in Cupertino seeks an On-Device ML Infrastructure Engineer focused on developing advanced ML models for Apple's hardware. The role involves building graph compilers and runtimes, optimizing model execution, and contributing to the AI ecosystem....
Apple
Cupertino, CA
1 day ago
Principal Compiler Engineer - ML Systems
$210k - $280k
...the first full-stack, generative AI platform, from chip to model, optimized for enterprise and government organizations. Powered by... ...most valuable assets. About The Role The Principal Compiler Engineer - ML Systems position will be responsible for working with...
Full time
Temporary work
Local area
Flexible hours
SambaNova Systems
San Jose, CA
2 days ago
Machine Learning Engineering Manager, Proactive - On-Device Modeling
$198.3k - $342.8k
...Manager, Proactive - On-Device Modeling Santa Clara, California,... ...their devices. As an Applied ML team, we're pushing the boundaries... ..., all while running a low latency production service and keeping... ...attention mechanisms, and inference optimization Strong software engineering...
Work experience placement
Relocation
Apple
Santa Clara, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Compiler Intern: Optimize Latency & ML Models. Be the first to apply!