ML Compiler Intern: Optimize Latency & ML Models
TETRAMEM INC
A technology company in California is looking for a Compiler Engineer Intern to support the Compiler Software team in developing a compiler and runtime for in-memory machine learning accelerators. Responsibilities include exploring compiler development techniques and engaging with QA teams. Requirements include a Bachelor's in Computer Science/Engineering and programming skills in C++, C, or Rust. The role offers a valuable opportunity to gain experience in AI network architectures and compiler optimizations. #J-18808-Ljbffr
- ...US 2026 Software - Compiler Engineer Intern We are seeking a Compiler Engineer intern to support the Compiler Software... ...and support the development of compiler optimizations to improve latency/bandwidth of our ML models Engage with QA teams Requirements Bachelor's degree...Internship
$212.8k
...Responsibilities: - Convert and compile ML models for execution on edge NPUs,... ...- Apply hardware-aware optimization strategies, such as... ...and operator fusion, to meet latency, memory and power targets.... ...unsupervised contact with internal/external clients and/or colleagues...SuggestedTemporary workLocal area$156k - $387.6k
...Machine Learning Engineer - AI Compiler Optimization Location: San Jose... ...specifically for recommendation model scenarios, including but not... ...compilation optimization and low-latency inference optimization for... ...unsupervised contact with internal/external clients and/or...SuggestedTemporary workLocal area$184k - $287.5k
Responsibilities Develop state‑of‑the‑art model optimization techniques—speculative decoding with... ...of GPU architecture, the compilation stack, and the ability to debug end‑to... ...hardware‑in‑the‑loop testing, and ultra‑low latency requirements. Benefits Base salary range...Suggested$50 - $70 per hour
...Software Intern - AI Compilers Santa Clara, California, United States Tenstorrent... ...innovations in software models, compilers, platforms,... ...Benchmark, analyze, and optimize performance of key machine learning... ...Develop novel ML models and primitives that take...Internship$19 - $65 per hour
...Machine Learning Infrastructure Engineer Intern to advance their AI-based virtual driver... ...involves identifying bottlenecks in BEV model training and implementing high-performance... ...explore using LLMs for code generation to optimize processes. The position offers competitive...InternshipHourly pay$19 - $65 per hour
...seeking a Machine Learning Infrastructure Engineer Intern to work on high-performance kernels for BEV model training. In this role, you will analyze training bottlenecks... .... This internship also explores the use of LLMs to optimize code generation and performance profiling. The...InternshipHourly pay$128.7k - $261.3k
...repeatable, high-velocity model deployments through... ...numerically robust, low-latency models to the car, blending... ...our deployment, compiler, and kernel teams Advancing... ..., Data Science / ML, or a closely related quantitative... ...focused onmodel optimization and deployment, with...Local areaRemote workWork from homeRelocation packageFlexible hours- ...Ellis Technologies, Inc. is seeking an enthusiastic intern in San Jose to build scalable machine learning... ...and enhance the quality of ads through advanced models. You will work alongside various disciplines to optimize ad strategies and contribute to innovative shopping...InternshipHourly paySummer work
$147k - $211k
Google Inc. is seeking a skilled ML Compiler Software Engineer for its Sunnyvale office. The position requires a Bachelor's degree, proficiency... .... In this role, you will focus on developing compiler optimizations for Tensor Processing Units (TPUs), enhancing parallelization...Full timeWork at office$148.75k - $361k
...techniques. The role involves significant applied research to tackle complex advertising challenges related to conversion modeling and optimization. The position offers a competitive salary between $148,750 - $361,000 annually, along with excellent benefits including health...- ...You will focus on developing innovative methods for conversion modeling and enhance ad performance through generative models. The ideal... ...conduct original research, drive A/B testing initiatives, and optimize outcomes through advanced AI technologies. This role is essential...
$174k - $252k
...Senior Machine Learning Engineer in Sunnyvale, CA, to improve AI model performance and efficiency. Candidates should possess a... ...experience in software development, testing, and performance optimization. Responsibilities include engaging with product teams to resolve...- ...effortlessly run large-scale ML applications, without the... ...current customers include top model labs, global enterprises, and... ..., graph lowering, compiler optimizations, runtime integration, and performance... ...and familiarity with model internals (e.g., attention, MoE, diffusion...Internship
- ...Principal GenAI Inference Optimization Engineer to join our Models and Applications... ...to optimizing latency, throughput, and cost... ...Collaborate with hardware, compiler, and framework teams... .... Contribute to internal tools and, where applicable... .... Experience with ML frameworks (PyTorch,...
$185.1k - $335.3k
...the AI Kernels & Compilers team, that mission... ...new approaches to model export, kernel development... ...and CUDA/TensorRT internals, and mentoring... ...them into highly optimized inference... ...and effortless for ML engineers across the... ...fidelity, and on-vehicle latency. Along the way,...Local areaRemote workWork from homeRelocation packageFlexible hours$160.5k - $240.7k
...developers to easily integrate, optimize, and deploy ML models on Qualcomm devices.... ...Workbench lets developers compile trained PyTorch or ONNX models... ...grows Partner with internal Business Units to onboard... ...constraints (target SoC, latency budgets, memory limits) into...- ...are looking for a founding ML engineer who can help us turn powerful AI and ML models into fast, reliable production... ...: deploying models, optimizing inference latency and throughput, scaling serving... ...such as quantization, pruning, compilation, batching, caching, and memory...Full timeRelocation package
- ...in Cupertino, California, is seeking an Intern to support electricity market simulation... ...analysis, build and validate SCUC/SCED models, and support data processing for various... ...machine learning methods and contribute to optimizing workflows. #J-18808-Ljbffr GridmaticInternship
$31.25 - $41.5 per hour
...Renesas Electronics is seeking a Summer 2026 Intern in San Jose, CA, for a role focused on developing machine learning models. This hybrid position requires full-time enrollment in a master’s program and an interest in embedded systems. You’ll build and evaluate AI models...InternshipHourly payFull timeSummer internship$31.25 - $41.5 per hour
...Renesas Electronics is offering a Summer 2026 Internship based in San Jose, CA, focusing on the development of machine learning models. Interns will participate in building, training, and evaluating AI models, gaining hands-on experience with industry tools and Renesas...InternshipHourly paySummer internship- ...Persimmons Ai Compiler Developer Persimmons is building... ...-based large language models by applying... ...You will help design and optimize the Persimmons Compiler... ...utilization, throughput, and low latency. Contribute to graph... ...flows from high-level ML representations to low-...
- ...Fellow, AI Software (Workload Optimization) We are looking for a... ...software stack—from ROCm and compilers to high-level AI frameworks—is... ...the ability to map emerging model architectures to low-level software... ...or applied experience in AI/ML, including areas such as deep...
- ...effortlessly run large-scale ML applications, without... ...customers include top model labs, global... ...-removing blockers and optimizing for their specific use... ...tradeoffs between quality, latency, throughput, and cost... ...Understanding of model compilers and optimization. # Contributor...Work experience placementWork at officeRemote workShift work
- A leading materials engineering company seeks a 2026 Summer Intern in Physics & AI Modeling Engineering. This role is focused on advanced modeling projects utilizing physics and AI workflows. Candidates should have a Master's or PhD in a relevant field and familiarity with...InternshipFull timeSummer internship
$152k - $241.5k
...Machine Learning Applications and Compiler Engineer! NVIDIA is seeking... ...to develop algorithms and optimizations for our LPX inference and compiler... ...seamless deployment of models across platforms. Benchmark,... ...spatial accelerators at top tier ML, compiler, and computer...$229.9k - $262.4k
...Lead AI Engineer (Inference Optimization, FM hosting, AI Platform)Applylocations... ..., our applications of AI & ML are bringing humanity and... ...of customers. Our AI models and platforms empower teams across... ...performance — scalability, cost, latency, throughput — of large scale...Full timePart timeLocal area- ...leading technology company in Cupertino seeks an On-Device ML Infrastructure Engineer focused on developing advanced ML models for Apple's hardware. The role involves building graph compilers and runtimes, optimizing model execution, and contributing to the AI ecosystem....
$210k - $280k
...the first full-stack, generative AI platform, from chip to model, optimized for enterprise and government organizations. Powered by... ...most valuable assets. About The Role The Principal Compiler Engineer - ML Systems position will be responsible for working with...Full timeTemporary workLocal areaFlexible hours$198.3k - $342.8k
...Manager, Proactive - On-Device Modeling Santa Clara, California,... ...their devices. As an Applied ML team, we're pushing the boundaries... ..., all while running a low latency production service and keeping... ...attention mechanisms, and inference optimization Strong software engineering...Work experience placementRelocation
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to ML Compiler Intern: Optimize Latency & ML Models. Be the first to apply!
- machine learning intern San Jose, CA
- machine learning researcher San Jose, CA
- machine learning part time San Jose, CA
- machine learning San Jose, CA
- artificial intelligence - machine learning intern San Jose, CA
- machine learning research scientist San Jose, CA
- data engineer machine learning San Jose, CA
- machine learning scientist San Jose, CA
- internship machine learning San Jose, CA
- machine learning remote San Jose, CA

