Senior ML Inference Engineer - Platform

$128.7k - $261.3k

General Motors Proving Ground

About the Team The Model Deployment & Inference Solutions team in GM AV deploys machine learning models from training frameworks (e.g. PyTorch) onto autonomous vehicle hardware. Our mission is two‑fold: build the ML deployment platform that makes model rollouts fast and predictable, and optimize models so they meet the real‑time latency and memory budgets required to run on‑vehicle. Our work is on the critical path of GM's publicly committed launch of eyes‑off (hands‑free, eyes‑free) autonomous driving in 2028, debuting on the Cadillac Escalade IQ, building on Super Cruise's billion‑plus hands‑free miles. About the Role This role sits in the team's Platform pillar. We own the unified ML deployment platform that automates the path from a trained model to inference on the vehicle, along with the developer‑experience and agentic‑tooling layer that makes deployment self‑serve for every ML model development team at GM. Responsibilities Design, build, and operate the ML deployment platform that automates the path from trained model to on‑vehicle inference. Drive cross‑organization model deployments to the autonomous vehicle stack, partnering with model development teams to take high‑value models from training to production on‑vehicle. Build agentic tools that diagnose and fix deployment‑blocking issues, automating workflows currently performed manually by engineers. Build the developer experience that ML model development teams use day to day: tooling, dashboards, automation, and observability. Drive shift‑left validation that surfaces deployment risk (compile, runtime, parity, latency) early in the model development cycle. Build platform tools that integrate the work of our sister teams (kernels, compiler, reduced precision and parity) so their optimization wins land directly in the deployment workflow. Partner with the team's Performance pillar and model development teams across the AV organization. Required Qualifications BS, MS, or PhD in Computer Science or a related technical field. 3+ years of relevant industry experience. Strong fundamentals and excellent coding ability in Python. Experience building or operating production platform or infrastructure systems where reliability, observability, and extensibility matter. Experience with ML model deployment, inference integration, model optimization workflows, or model serving infrastructure, with at least one prior context where you owned the path from a trained model to a running inference workload. Experience using coding agents (Cursor, Claude Code, GitHub Copilot, or equivalent) as part of your engineering workflow. Experience designing clean, well‑tested software with clear interfaces and good abstractions. Strong cross‑team collaboration skills. Preferred Qualifications Experience building agentic or LLM‑powered developer tooling. Experience with ML or workflow orchestration frameworks (Airflow, Temporal, Flyte, Ray, Kubeflow, or equivalent). Familiarity with the NVIDIA GPU stack at the integration level (CUDA‑aware Python, TensorRT, Triton inference server, torch.compile, ONNX). Experience with inference‑serving frameworks (Triton, TorchServe, Ray Serve, vLLM) or edge‑deployment toolchains. Experience with low‑latency or real‑time systems. Experience in autonomous vehicles, robotics, or other safety‑critical ML deployment domains. Open‑source contributions to PyTorch, Ray, Airflow, Temporal, vLLM, TensorRT, or related projects. Compensation The salary range for this role is $128,700 to $261,300. The actual base salary a successful candidate will be offered within this range will vary based on factors relevant to the position. Bonus potential is offered through an incentive pay program based on company performance, job level, and individual performance. Benefits GM offers a variety of health and wellbeing benefit programs including medical, dental, vision, Health Savings Account, Flexible Spending Accounts, retirement savings plan, sickness and accident benefits, life insurance, paid vacation and holidays, tuition assistance programs, employee assistance program, GM vehicle discounts and more. Work Location and Travel This role is based remotely, but if the selected candidate lives within a specific mile radius of a GM hub, they will be expected to report to the location three times a week (or other frequency dictated by your manager). The selected candidate will be required to travel Non‑Discrimination and Equal Employment Opportunities (U.S.) General Motors is committed to being a workplace that is not only free of unlawful discrimination, but one that genuinely fosters inclusion and belonging. All employment decisions are made on a non‑discriminatory basis without regard to sex, race, color, national origin, citizenship status, religion, age, disability, pregnancy or maternity status, sexual orientation, gender identity, status as a veteran or protected veteran, or any other similarly protected status in accordance with federal, state and local laws. #J-18808-Ljbffr General Motors

Apply

Vacancy posted 23 hours ago

Similar jobs that could be interesting for youBased on the Senior ML Inference Engineer - Platform in Mountain View, CA vacancy

Senior ML Inference Engineer - Platform
$128.7k - $261.3k
The Model Deployment & Inference Solutions team in GM AV deploys machine learning models... .... Our mission is two-fold: build the ML deployment platform that makes model rollouts fast and... ..., or equivalent) as part of your engineering workflow. Experience designing clean,...
Senior
Flexible hours
General Motors
Sunnyvale, CA
4 days ago
Senior ML Inference Platform Engineer (Remote)
Israelvcforum is looking for a Senior ML Infrastructure Engineer in Mountain View, California. This position aims to build and scale robust platforms for ML inference workflows supporting GM’s AI efforts. You will collaborate with ML engineers and researchers to implement...
Senior
Remote job
Israelvcforum
Mountain View, CA
1 day ago
Remote Senior ML Inference Platform Engineer
General Motors is seeking a Senior ML Infrastructure Engineer to build and scale a robust platform for machine learning inference workflows. You will design backend software components, collaborate with ML engineers, and lead initiatives across GM's ML ecosystem. With over...
Senior
Remote job
General Motors
Sunnyvale, CA
3 days ago
Senior ML Deployment Engineer — Platform (Remote)
$128.7k - $261.3k
General Motors is seeking a skilled professional for the role focused on ML deployment for autonomous vehicles. This position involves designing platforms that automate model inference and collaborating across teams to enhance development workflows. The ideal candidate...
Senior
Remote job
General Motors
Mountain View, CA
3 days ago
ML Engineer — AI Platform & Multimodal Inference
...View is seeking a Machine Learning Engineer to build and optimize the infrastructure... ...for its Intelligence Composition Platform. The role involves designing and deploying ML models for multimodal data understanding, optimizing inference pipelines, and collaborating with...
Suggested
Corvic
Mountain View, CA
4 days ago
Staff ML Engineer, Inference Platform
$195k - $298k
...is eligible for relocation assistance. About the Team The ML Inference Platform is part of the AI Compute Platforms organization within Infrastructure... .... About the Role We are seeking a Staff ML Infrastructure engineer to help build and scale robust Compute platforms for ML...
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
3 days ago
Senior Software Engineer, Inference Platform Palo Alto
We’re looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic search, retrieval, and AI-native... ...AI Platform organization and collaborate with ML researchers and engineers from our Voyage.ai acquisition...
Senior
Local area
Worldwide
MongoDB
Palo Alto, CA
23 hours ago
Senior ML Engineer, Agentic AI Platforms
$148.5k - $223.9k
Salesforce AI Research in Palo Alto is seeking a Machine Learning Engineer to develop next-generation agentic AI systems. You will closely collaborate with research scientists and product managers to innovate and design cutting-edge AI solutions. The ideal candidate possesses...
Senior
Centaur Labs
Palo Alto, CA
1 day ago
Senior ML Engineer - Agentic AI Platform & Customer Impact
A leading technology company is seeking a Machine Learning Engineer to develop next-generation agentic AI platforms. In this role, you will collaborate with research scientists and engineers to design and implement innovative AI solutions. Candidates should possess exceptional...
Senior
Salesforce, Inc..
Palo Alto, CA
1 day ago
Senior ML Infra Engineer for AI Validation Platform
General Motors is looking for a Senior ML Infrastructure Engineer to build robust compute platforms for AI validation. This role emphasizes driving efficiency and maximizing GPU utilization while improving platform reliability. You will collaborate with engineers to shape...
Senior
General Motors
Sunnyvale, CA
1 day ago
Senior ML Infra Engineer for Scalable API Platforms
$166k - $244k
Carlsbad Tech is actively seeking a Senior Software Engineer to work on the Gemini Live API in Sunnyvale, CA. This role involves building scalable... ...in software development, infrastructure management, and AI/ML technologies. Benefits include a competitive salary ranging...
Senior
Carlsbad Tech
Sunnyvale, CA
1 day ago
Senior Machine Learning Engineer - VETi Platform
...Engagement Technology and Imager - platform is an AI-enabled wearable... .... We are looking for a Senior Machine Learning Engineer to build the AI foundation... ...Integrate and optimize AI inference into the VETi platform's... ..., with at least one major ML framework (PyTorch,...
Senior
Kodiak Sciences Inc
Palo Alto, CA
2 days ago
Senior ML Deployment Platform Engineer
General Motors is seeking a Machine Learning Engineer for the Model Deployment & Inference Solutions team in Sunnyvale, California. The role involves building and optimizing a unified ML deployment platform to ensure efficient model rollouts for autonomous vehicles. Candidates...
Senior
General Motors
Sunnyvale, CA
4 days ago
Senior ML Compiler & Inference Systems Engineer
$152k - $287.5k
NVIDIA Gruppe is seeking a Senior Machine Learning Applications and Compiler Engineer in Santa Clara, California. This role... ...algorithms for their LPX inference and compiler stack, optimizing... ...neural network workloads on NVIDIA platforms. Ideal candidates will possess...
Senior
NVIDIA Gruppe
Santa Clara, CA
1 day ago
Senior Inference Platform Engineer — Low-Latency, Multi-Tenant
A leading data platform company in Palo Alto seeks a Senior Engineer to develop a cutting-edge inference platform supporting semantic search and AI-native experiences. The ideal... ...Go, Rust, or Python. You'll work alongside ML researchers to enhance infrastructure for real...
Senior
MongoDB
Palo Alto, CA
23 hours ago
Senior/Staff Software Engineer - Machine Learning Platform (Inference)
$236k - $339.25k
...Machine Learning Platform Team Member At Snowflake, we are powering... ..., and enable end-to-end ML workflows. We are on an early... ...collaboratively and proactively with senior architects, PMs, and team... ...in serving LLMs using inference engines like vLLM, TensorRT-LLM, TEI,...
Senior
Flexible hours
Streamlit
Menlo Park, CA
5 days ago
Senior ML Accelerator Engineer - GPU
$128.7k - $261.3k
...approaches to model export, kernel development, and performance engineering so that every cycle on our accelerators translates into... ...and custom libraries that sit at the heart of our on‑vehicle ML inference for ADAS and autonomous driving . We own making core AI workloads...
Senior
Local area
Work from home
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
5 days ago
Senior ML Engineer - Embodied AI Onboard Autonomy
$158k - $241.9k
...future of transportation on a global scale. Role: As a Senior AI/ML Engineer within the Onboard Embodied AI organization, you will be a... ..., delivering end-to-end solutions capable of real-time inference and robust autonomous driving performance. Lead and architect...
Senior
Local area
Work from home
Relocation package
Flexible hours
General Motors
Mountain View, CA
2 days ago
Senior ML Compiler Engineer
$128.7k - $261.3k
...development, and performance engineering so that every cycle on... ...into fast, reliable inference across GPUs powering... ...teams to co-design a platform that enables new ideas... .... The Role As a Senior Compiler Engineer on... ...reliable, and effortless for ML engineers across the...
Senior
Local area
Work from home
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
5 days ago
Matterport - Senior ML Ops Engineer
$173k - $253k
...Senior MLOps Engineer Matterport is leading the digital transformation of... ...groundbreaking spatial computing platform turns buildings into data... ...You will work closely with ML R&D Engineers and other engineering... ...model performance, optimize inference speed and resource...
Senior
Work at office
Work from home
CoStar Group
Sunnyvale, CA
23 hours ago
Staff Inference ML Runtime Engineer
...-leading training and inference speeds and empowers machine... ...run large-scale ML applications, without... ...The Inference ML Engineering team at Cerebras Systems... ...full potential of our platform, leveraging its performance... ...and usability. As a Senior Software Engineer on...
CEREBRAS SYSTEMS INC.
Sunnyvale, CA
3 days ago
Senior ML Engineer - Embodied AI Onboard Autonomy
$158k - $241.9k
...the future of transportation on a global scale. Role As a Senior AI/ML Engineer within the Onboard Embodied AI organization, you will be a... ...models, delivering end‑to‑end solutions capable of real‑time inference and robust autonomous driving performance. Lead and...
Senior
Local area
Relocation package
Flexible hours
Israelvcforum
Mountain View, CA
1 day ago
Senior ML Platform Engineer — Real-Time Fraud Detection
A leading AI-powered fraud detection platform in Mountain View is seeking experienced platform engineers to design and build advanced machine learning systems. You will engage in improving core detection algorithms, using unsupervised and supervised machine learning, and...
Senior
DataVisor
Mountain View, CA
1 day ago
Senior ML Engineer - Model Compression
$128.7k - $261.3k
...developers and deployment and infra engineers to ship numerically robust,... ...Mathematics, Data Science / ML, or a closely related... ...model compression / efficient inference or relevant experience ~ StrongproficiencyinPyTorchandexperience... ...(automotive SoCs, robotics platforms, or similar) ~ Published...
Senior
Local area
Remote work
Work from home
Relocation package
Flexible hours
General Motors
Mountain View, CA
5 days ago
Staff ML Engineer — Ultra-Low-Latency Inference
A tech company in Mountain View seeks talented engineers for a role emphasizing high-performance systems, inference optimization, and model acceleration. You will thrive in ambiguity, tackle unclear problems, and design impactful solutions. The position offers a competitive...
Inworld
Mountain View, CA
1 day ago
Staff ML Infra Engineer: Scalable Inference Platform (Hybrid)
A leading automotive company is seeking a Staff ML Infrastructure Engineer to build robust compute platforms for machine learning workflows in Sunnyvale, CA. The... ...skills in Go, Python or C++, and expertise in ML inference. The position offers a hybrid work model and...
General Motors
Sunnyvale, CA
2 days ago
Senior MLOps Engineer: RAG & LLM Platform
AI Chopping Block, Inc. is looking for a Senior AI Engineer to develop Retrieval-Augmented Generation (RAG) systems. This role involves leading the architecture of AI-powered assistants to enhance industrial troubleshooting. Successful candidates will have a strong background...
Senior
AI Chopping Block, Inc.
Palo Alto, CA
1 day ago
Senior ML Platform Engineer (Autonomous Driving)
...everything is connected and moves autonomously through a self‑managing urban transportation operating system. At 42dot, our AD ML Platform Engineers build the core data platform and ML training / eval platform for the cutting edge algorithms in autonomous driving. We...
Senior
Full time
Work experience placement
42dot Inc.
Sunnyvale, CA
3 days ago
Senior ML Systems Engineer
...deliver industry‑leading training and inference speeds and empowers machine... ...users to effortlessly run large‑scale ML applications, without the hassle of... ...seeking a versatile and experienced engineer to join our SOTA Training Platform team. This team is responsible to rapidly...
Senior
Internship
Cerebras
Sunnyvale, CA
4 days ago
Senior Applied ML Research Engineer, X’s Moonshot for Specialized Professional Intelligence
$165k - $238k
...startup. As an innovation engine, X focuses on... ...driven team of experienced ML researchers, software engineers... ...About The Role As a Senior Applied Research... ...professional intelligence platform. You will focus on building... ...and high-fidelity inference over extremely large corpora...
Senior
Full time
Work at office
3 days per week
X, The Moonshot Factory
Mountain View, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior ML Inference Engineer - Platform. Be the first to apply!