Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Compiler Intern: Optimize Latency & ML Models

TETRAMEM INC

A technology company in California is looking for a Compiler Engineer Intern to support the Compiler Software team in developing a compiler and runtime for in-memory machine learning accelerators. Responsibilities include exploring compiler development techniques and engaging with QA teams. Requirements include a Bachelor's in Computer Science/Engineering and programming skills in C++, C, or Rust. The role offers a valuable opportunity to gain experience in AI network architectures and compiler optimizations. #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the ML Compiler Intern: Optimize Latency & ML Models in San Jose, CA vacancy
  •  ...US 2026 Software - Compiler Engineer Intern We are seeking a Compiler Engineer intern to support the Compiler Software...  ...and support the development of compiler optimizations to improve latency/bandwidth of our ML models Engage with QA teams Requirements Bachelor's degree... 
    Internship

    TETRAMEM INC

    San Jose, CA
    1 day ago
  • $212.8k

     ...Responsibilities: - Convert and compile ML models for execution on edge NPUs,...  ...- Apply hardware-aware optimization strategies, such as...  ...and operator fusion, to meet latency, memory and power targets....  ...unsupervised contact with internal/external clients and/or colleagues... 
    Suggested
    Temporary work
    Local area

    ByteDance

    San Jose, CA
    4 days ago
  • $156k - $387.6k

     ...Machine Learning Engineer - AI Compiler Optimization Location: San Jose...  ...specifically for recommendation model scenarios, including but not...  ...compilation optimization and low-latency inference optimization for...  ...unsupervised contact with internal/external clients and/or... 
    Suggested
    Temporary work
    Local area

    ByteDance

    San Jose, CA
    3 days ago
  • $184k - $287.5k

    Responsibilities Develop state‑of‑the‑art model optimization techniques—speculative decoding with...  ...of GPU architecture, the compilation stack, and the ability to debug end‑to...  ...hardware‑in‑the‑loop testing, and ultra‑low latency requirements. Benefits Base salary range... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    20 hours ago
  • $50 - $70 per hour

     ...Software Intern - AI Compilers Santa Clara, California, United States Tenstorrent...  ...innovations in software models, compilers, platforms,...  ...Benchmark, analyze, and optimize performance of key machine learning...  ...Develop novel ML models and primitives that take... 
    Internship

    Tenstorrent University

    Santa Clara, CA
    4 days ago
  • $19 - $65 per hour

     ...Machine Learning Infrastructure Engineer Intern to advance their AI-based virtual driver...  ...involves identifying bottlenecks in BEV model training and implementing high-performance...  ...explore using LLMs for code generation to optimize processes. The position offers competitive... 
    Internship
    Hourly pay

    Medium

    Santa Clara, CA
    1 day ago
  • $19 - $65 per hour

     ...seeking a Machine Learning Infrastructure Engineer Intern to work on high-performance kernels for BEV model training. In this role, you will analyze training bottlenecks...  .... This internship also explores the use of LLMs to optimize code generation and performance profiling. The... 
    Internship
    Hourly pay

    PlusAI, Inc.

    Santa Clara, CA
    1 day ago
  • $128.7k - $261.3k

     ...repeatable, high-velocity model deployments through...  ...numerically robust, low-latency models to the car, blending...  ...our deployment, compiler, and kernel teams Advancing...  ..., Data Science / ML, or a closely related quantitative...  ...focused onmodel optimization and deployment, with... 
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    4 days ago
  •  ...Ellis Technologies, Inc. is seeking an enthusiastic intern in San Jose to build scalable machine learning...  ...and enhance the quality of ads through advanced models. You will work alongside various disciplines to optimize ad strategies and contribute to innovative shopping... 
    Internship
    Hourly pay
    Summer work

    Ellis Technologies, Inc.

    San Jose, CA
    1 day ago
  • $147k - $211k

    Google Inc. is seeking a skilled ML Compiler Software Engineer for its Sunnyvale office. The position requires a Bachelor's degree, proficiency...  .... In this role, you will focus on developing compiler optimizations for Tensor Processing Units (TPUs), enhancing parallelization... 
    Full time
    Work at office

    Google Inc.

    Sunnyvale, CA
    2 days ago
  • $148.75k - $361k

     ...techniques. The role involves significant applied research to tackle complex advertising challenges related to conversion modeling and optimization. The position offers a competitive salary between $148,750 - $361,000 annually, along with excellent benefits including health... 

    Jobr

    San Jose, CA
    1 day ago
  •  ...You will focus on developing innovative methods for conversion modeling and enhance ad performance through generative models. The ideal...  ...conduct original research, drive A/B testing initiatives, and optimize outcomes through advanced AI technologies. This role is essential... 

    Experimentation Jobs

    San Jose, CA
    1 day ago
  • $174k - $252k

     ...Senior Machine Learning Engineer in Sunnyvale, CA, to improve AI model performance and efficiency. Candidates should possess a...  ...experience in software development, testing, and performance optimization. Responsibilities include engaging with product teams to resolve... 

    Google Inc.

    Sunnyvale, CA
    2 days ago
  •  ...effortlessly run large-scale ML applications, without the...  ...current customers include top model labs, global enterprises, and...  ..., graph lowering, compiler optimizations, runtime integration, and performance...  ...and familiarity with model internals (e.g., attention, MoE, diffusion... 
    Internship

    Dormont Manufacturing Co

    Sunnyvale, CA
    11 hours ago
  •  ...Principal GenAI Inference Optimization Engineer to join our Models and Applications...  ...to optimizing latency, throughput, and cost...  ...Collaborate with hardware, compiler, and framework teams...  .... Contribute to internal tools and, where applicable...  .... Experience with ML frameworks (PyTorch,... 

    Advanced Micro Devices , Inc.

    San Jose, CA
    1 day ago
  • $185.1k - $335.3k

     ...the AI Kernels & Compilers team, that mission...  ...new approaches to model export, kernel development...  ...and CUDA/TensorRT internals, and mentoring...  ...them into highly optimized inference...  ...and effortless for ML engineers across the...  ...fidelity, and on-vehicle latency. Along the way,... 
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    4 days ago
  • $160.5k - $240.7k

     ...developers to easily integrate, optimize, and deploy ML models on Qualcomm devices....  ...Workbench lets developers compile trained PyTorch or ONNX models...  ...grows Partner with internal Business Units to onboard...  ...constraints (target SoC, latency budgets, memory limits) into... 

    Qualcomm

    Santa Clara, CA
    1 day ago
  •  ...are looking for a founding ML engineer who can help us turn powerful AI and ML models into fast, reliable production...  ...: deploying models, optimizing inference latency and throughput, scaling serving...  ...such as quantization, pruning, compilation, batching, caching, and memory... 
    Full time
    Relocation package

    HiringCafe

    Cupertino, CA
    4 days ago
  •  ...in Cupertino, California, is seeking an Intern to support electricity market simulation...  ...analysis, build and validate SCUC/SCED models, and support data processing for various...  ...machine learning methods and contribute to optimizing workflows. #J-18808-Ljbffr Gridmatic
    Internship

    Gridmatic

    Cupertino, CA
    20 hours ago
  • $31.25 - $41.5 per hour

     ...Renesas Electronics is seeking a Summer 2026 Intern in San Jose, CA, for a role focused on developing machine learning models. This hybrid position requires full-time enrollment in a master’s program and an interest in embedded systems. You’ll build and evaluate AI models... 
    Internship
    Hourly pay
    Full time
    Summer internship

    Renesas

    San Jose, CA
    1 day ago
  • $31.25 - $41.5 per hour

     ...Renesas Electronics is offering a Summer 2026 Internship based in San Jose, CA, focusing on the development of machine learning models. Interns will participate in building, training, and evaluating AI models, gaining hands-on experience with industry tools and Renesas... 
    Internship
    Hourly pay
    Summer internship

    Renesas

    San Jose, CA
    1 day ago
  •  ...Persimmons Ai Compiler Developer Persimmons is building...  ...-based large language models by applying...  ...You will help design and optimize the Persimmons Compiler...  ...utilization, throughput, and low latency. Contribute to graph...  ...flows from high-level ML representations to low-... 

    Persimmons, Inc.

    San Jose, CA
    4 days ago
  •  ...Fellow, AI Software (Workload Optimization) We are looking for a...  ...software stack—from ROCm and compilers to high-level AI frameworks—is...  ...the ability to map emerging model architectures to low-level software...  ...or applied experience in AI/ML, including areas such as deep... 

    Advanced Micro Devices , Inc.

    San Jose, CA
    2 days ago
  •  ...effortlessly run large-scale ML applications, without...  ...customers include top model labs, global...  ...-removing blockers and optimizing for their specific use...  ...tradeoffs between quality, latency, throughput, and cost...  ...Understanding of model compilers and optimization. # Contributor... 
    Work experience placement
    Work at office
    Remote work
    Shift work

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    3 days ago
  • A leading materials engineering company seeks a 2026 Summer Intern in Physics & AI Modeling Engineering. This role is focused on advanced modeling projects utilizing physics and AI workflows. Candidates should have a Master's or PhD in a relevant field and familiarity with... 
    Internship
    Full time
    Summer internship

    Applied Materials, Inc.

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...Machine Learning Applications and Compiler Engineer! NVIDIA is seeking...  ...to develop algorithms and optimizations for our LPX inference and compiler...  ...seamless deployment of models across platforms. Benchmark,...  ...spatial accelerators at top tier ML, compiler, and computer... 

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $229.9k - $262.4k

     ...Lead AI Engineer (Inference Optimization, FM hosting, AI Platform)Applylocations...  ..., our applications of AI & ML are bringing humanity and...  ...of customers. Our AI models and platforms empower teams across...  ...performance — scalability, cost, latency, throughput — of large scale... 
    Full time
    Part time
    Local area

    Capital One

    San Jose, CA
    1 day ago
  •  ...leading technology company in Cupertino seeks an On-Device ML Infrastructure Engineer focused on developing advanced ML models for Apple's hardware. The role involves building graph compilers and runtimes, optimizing model execution, and contributing to the AI ecosystem.... 

    Apple

    Cupertino, CA
    1 day ago
  • $210k - $280k

     ...the first full-stack, generative AI platform, from chip to model, optimized for enterprise and government organizations. Powered by...  ...most valuable assets. About The Role The Principal Compiler Engineer - ML Systems position will be responsible for working with... 
    Full time
    Temporary work
    Local area
    Flexible hours

    SambaNova Systems

    San Jose, CA
    2 days ago
  • $198.3k - $342.8k

     ...Manager, Proactive - On-Device Modeling Santa Clara, California,...  ...their devices. As an Applied ML team, we're pushing the boundaries...  ..., all while running a low latency production service and keeping...  ...attention mechanisms, and inference optimization Strong software engineering... 
    Work experience placement
    Relocation

    Apple

    Santa Clara, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Compiler Intern: Optimize Latency & ML Models. Be the first to apply!