Machine Learning Engineer, Runtime & Optimization
$204k - $259kFull-time
Waymo
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's Most Experienced Driver™—to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo’s fully autonomous ride-hail service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over ten million rider-only trips, enabled by its experience autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states. The ML Platform team at Waymo provides a set of tools to support and automate the lifecycle of the machine learning workflow, including feature and experiment management, model development, optimization and monitoring. These efforts have resulted in making machine learning more accessible to teams at Waymo, including Perception, Planner, Research and Simulation. We are looking for engineers with ML software or ML systems expertise to help us improve compute performance on both cloud and car. You'll work across the entire ML stack from the system perspective, from efficient deep learning models, model compression, ML software (e.g. JAX, XLA, Triton, and CUDA), to . You will be pleasantly challenged with deploying Waymo ML models on limited computation resources. In this hybrid role, you will report to the Senior Manager of Runtime and Optimization. You Will
- Lead the collaboration with the world-class Waymo ML scientists in perception, planner, research and simulation. Identify opportunities in both systems and models to make ML workloads faster.
- Lead projects from proposals through execution by developing junior engineers.
- Analyze and improve ML system workloads on both cloud and self-driving cars .
- Apply model optimization, efficient deep learning techniques and ML software improvements to Waymo's ML systems.
- M.S. in CS, EE, Deep Learning or a related field
- 2+ years of experience as a technical lead, including writing project plans, engaging with customer teams, mentoring, responsible for goals & execution, reporting status.
- 5+ years of experience developing solutions in ML systems or ML software stack (Pytorch/JAX/TF, runtime libraries, ML compiler).
- Deep understanding of ML system architecture, performance analysis and tools.
- Strong Python or C++ programming skills
- PhD in CS, EE, Deep Learning or a related field.
- Familiarity with the HW architecture of ML hardware accelerators (e.g., GPU/TPU).
- Deep knowledge of model optimization or efficient deep learning techniques for foundation models or LLM.
- Experience with GPU HW or TPU HW and related system software.
Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Machine Learning Engineer, Runtime & Optimization in California vacancy
$159.05k - $199.3k
...the role We are looking for a software engineer with deep experience in optimizing ML models and deploying them on production‑grade embedded runtime environments. You’ll work across the... ...Experience in working with deep learning frameworks (e.g., PyTorch, JAX, ONNX,...SuggestedFull timeFor contractorsFor subcontractor$140k - $250k
...support and automate the lifecycle of the AI workflow, including model development, evaluation, optimization, deployment, and monitoring. As a Machine Learning Engineer in ML Runtime & Optimization, you will be developing technologies to accelerate the training and...SuggestedTemporary work$278.1k - $347.6k
...games run inside a modern, browser-native runtime (built on technologies such as WebGPU... ...entirely within that runtime. As our Principal Engineer for On-Device AI Inference & Systems,... ...leaves research, through export, optimization, and kernel-level tuning, to a shipped feature...SuggestedWork at officeWorldwideRelocation package$170.5k - $315.49k
## Inference Optimization Engineer (local / edge runtime)Applylocations: US, California, Santa Clara: US, Oregon,... ...models run directly on the user's machine (AI PC, edge, on-prem, and beyond),... ...engines where it helps us# What you’ll learn / grow into*Curiosity is required....SuggestedInternshipLocal areaImmediate startShift work$150k - $200k
...collision and force guards, smoothing, and runtime monitors for safety‑critical deployment... ...‑speed robot autonomy software stack optimized for inference performance. Advance SOTA... ...PhD or MS degree in Computer Science, Machine Learning, Robotics, or equivalent technical...Suggested$170.5k - $315.49k
...run directly on the user's machine (AI PC, edge, on-prem, and beyond... ...We are seeking a Machine Learning Engineer / Data Scientist to join... ...intensive tasks. Debug and optimize training runs - Profile... ...model choices interact with runtime constraints on edge hardware...InternshipLocal areaImmediate startShift work$165.2k - $223.6k
...Description The Product: Amazon's Machine Learning accelerators are at the... ...includes an ML compiler, runtime and natively integrates into... ...including silicon engineering, hardware design and verification... ...Amazon Neuron team works to optimize the performance of complex neural...InternshipLocal areaFlexible hours$260k - $330k
...have: 3+ years of solid experience building machine learning, data science, ranking, prediction, recommendation, optimization, or large-scale data systems PubMatic is the... ...: We are looking for a Machine Learning Engineer to help build and improve performance optimization...Full timeWork at officeRemote work$110k - $300k
...data centers. Our talented team of engineers and industry-leading executives... ...positions become available. Develop, optimize, and deploy lightweight machine learning models for edge AI applications,... ...Understanding of ML compiler and runtime design. Experience working with...$208k - $263.5k
...Atoms is building the machines that power the next era... ..., operate them, learn from them, and improve... ...We are roboticists, engineers, operators, and builders... ...Research, Post-Training Optimization, and Data Engineering.... ...hardware-accelerated runtimes (e.g., TensorRT, CUDA)...Full timeInternshipWork at officeFlexible hours- ...generalist robotics a reality. We're looking for an Inference Optimization MLE to help build and operate the systems that make our... ...regressions across model versions Collaborate closely with research engineers to translate model innovations into optimized, deployment‑...
$160.36k - $240.54k
...team in Mountain View, CA. You will design, build, and deploy components for machine learning model life cycles, enhancing Nuro's autonomy stack. Candidates should have expertise in ML optimization techniques and programming skills in Python and C++. This role offers a...$181.1k - $318.4k
On-Device ML Infrastructure Engineer (CoreML Runtime), Graphics, Games & ML Cupertino, California, United States Machine Learning and AI Imagine being at the forefront of an evolution... ...to embedded systems, developing optimization toolkits for model compression and acceleration...Relocation$160.16k - $266.93k
...prototyping, test, manufacturing, and optimization. Our ~15,000 employees create... ...in over 100 countries. Learn more about what we do. Our... ...is hiring a ML Performance Engineer to make our training and... ...optimization, kernel fusion, runtime selection Write custom CUDA...Flexible hours- ...persona. Genies is looking for a Senior Machine Learning Engineer to join our Avatar Technology team,... ...will work across data, modeling, and runtime systems to ensure ML-driven animation... ...engines. Debug, profile, and optimize runtime performance across CPU, GPU,...Full timeWork experience placementWork at office
$215.28k - $364.32k
...Staff Machine Learning Engineer Santa Clara, CA XPENG is a leading smart technology company at... ...preparation, model training, evaluation, optimization, quantization, and deployment. You... ..., quantization accuracy drop, or runtime performance bottlenecks. Familiarity...Full time- ...career. THE ROLE: We are looking for a Principal Machine Learning Engineer to join our Models and Applications team. If you are excited... ...Improve the end-to-end training pipeline performance. Optimize the distributed training pipeline and algorithm to scale out...
- ...Staff Machine Learning Engineer As a Staff Machine Learning Engineer, you will be responsible for... ...inform research & product direction Optimize and validate models for integration... ...throughput optimization (e.g., TensorRT, ONNX Runtime, mixed precision), and building...Work at office
- Generalist is seeking an expert in machine learning optimizations to join our innovative team. You will own the optimizations for both training and on-robot inference stacks, focusing on achieving step-function gains. The ideal candidate should be proficient with the latest...
$181.1k - $318.4k
...commitment to environmental sustainability and optimal resource utilization. This team plays a... ...services. As a Sr. ML Optimization Engineer, you will work at the intersection of... ...strategy, applied analytics, machine learning, and large-scale optimization. You will...Relocation$19 - $65 per hour
PlusAI is seeking a Machine Learning Infrastructure Engineer Intern to work on high-performance kernels for BEV model training. In this role, you will... ...Triton. This internship also explores the use of LLMs to optimize code generation and performance profiling. The hourly...Hourly payInternship$181.1k - $318.4k
Senior Machine Learning Engineer, AI, SIML Cupertino, California, United States Machine Learning and AI Are you passionate about Generative... ...Preferred Qualifications Experience working on AI/ML-optimized runtime stacks. Familiarity with parallelization algorithms for...RelocationFlexible hours$160k - $240k
Tensec is looking for a Machine Learning Engineer in San Francisco to build algorithms and optimization systems that drive their autonomous decision engine. This role involves designing trading strategies, building execution layers, and deploying robust models that enhance...Relocation package- Qualcomm in San Diego is looking for a Machine Learning Engineer to implement graph optimization passes and develop AI solutions. The role emphasizes collaboration with engineering teams and requires experience in ML systems and Python programming. The position offers a...
$2,000 per month
...Machine Learning Research Engineer Cupertino, CA Etched is building AI chips that are hard-coded for... ...performance characterization tools, and runtime abstractions by implementing... ...allow for real-time image generation Optimize model instructions and scheduling algorithms...Work at officeRelocation package- jobr.pro is seeking a Staff Machine Learning Engineer to join our Vector Bidding Science team in Mountain View, California. In this critical... ...vision and develop advanced bidding systems using AI and optimization frameworks. You will design algorithms, analyze marketplace...
$178.4k - $267.6k
...Qualcomm Technologies, Inc. Job Area: Engineering Group, Engineering Group > Machine Learning Engineering General Summary:... ...design, develop and test model optimization techniques that include - but... ...like Huggingface Optimum, ONNX runtime and OpenVino. Proven hands-on experience...Work experience placementWork from home$150k - $300k
Director, Machine Learning EngineeringSkip to main contentGEICO uses cookies... ..., Machine Learning Engineering page is loaded## Director,... ...Great Careers.**Director - Runtime Intelligence & Personalization... ...environments**Observability & Optimization*** Establish observability...Hourly payTemporary workWork experience placementLocal area- The Role We're looking for someone who loves optimizing model inference to join us in building the core of ComfyUI - the most complex and bleeding-edge part of our engine. You'll be working on making AI models run faster and more efficiently than anyone thought possible...
- ...combines threat intelligence with machine learning, enables financial... ...scale. As a Senior Software Engineer, ML Infrastructure at TRM Labs... ...concurrent models and users. Optimize high-throughput inference.... ...stacks such as TensorRT, ONNX Runtime, vLLM, FlashAttention, and...Worldwide
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Machine Learning Engineer, Runtime & Optimization. Be the first to apply!
Related searches
- machine learning engineer California
- computer vision machine learning engineer California
- machine learning scientist California
- machine learning remote California
- machine learning California
- artificial intelligence - machine learning intern California
- internship machine learning California
- data engineer machine learning California
- machine learning intern California
- machine learning research scientist California

