Senior ML Inference Engineer - Platform
$128.7k - $261.3kGeneral Motors Proving Ground
About the Team The Model Deployment & Inference Solutions team in GM AV deploys machine learning models from training frameworks (e.g. PyTorch) onto autonomous vehicle hardware. Our mission is two‑fold: build the ML deployment platform that makes model rollouts fast and predictable, and optimize models so they meet the real‑time latency and memory budgets required to run on‑vehicle. Our work is on the critical path of GM's publicly committed launch of eyes‑off (hands‑free, eyes‑free) autonomous driving in 2028, debuting on the Cadillac Escalade IQ, building on Super Cruise's billion‑plus hands‑free miles. About the Role This role sits in the team's Platform pillar. We own the unified ML deployment platform that automates the path from a trained model to inference on the vehicle, along with the developer‑experience and agentic‑tooling layer that makes deployment self‑serve for every ML model development team at GM. Responsibilities Design, build, and operate the ML deployment platform that automates the path from trained model to on‑vehicle inference. Drive cross‑organization model deployments to the autonomous vehicle stack, partnering with model development teams to take high‑value models from training to production on‑vehicle. Build agentic tools that diagnose and fix deployment‑blocking issues, automating workflows currently performed manually by engineers. Build the developer experience that ML model development teams use day to day: tooling, dashboards, automation, and observability. Drive shift‑left validation that surfaces deployment risk (compile, runtime, parity, latency) early in the model development cycle. Build platform tools that integrate the work of our sister teams (kernels, compiler, reduced precision and parity) so their optimization wins land directly in the deployment workflow. Partner with the team's Performance pillar and model development teams across the AV organization. Required Qualifications BS, MS, or PhD in Computer Science or a related technical field. 3+ years of relevant industry experience. Strong fundamentals and excellent coding ability in Python. Experience building or operating production platform or infrastructure systems where reliability, observability, and extensibility matter. Experience with ML model deployment, inference integration, model optimization workflows, or model serving infrastructure, with at least one prior context where you owned the path from a trained model to a running inference workload. Experience using coding agents (Cursor, Claude Code, GitHub Copilot, or equivalent) as part of your engineering workflow. Experience designing clean, well‑tested software with clear interfaces and good abstractions. Strong cross‑team collaboration skills. Preferred Qualifications Experience building agentic or LLM‑powered developer tooling. Experience with ML or workflow orchestration frameworks (Airflow, Temporal, Flyte, Ray, Kubeflow, or equivalent). Familiarity with the NVIDIA GPU stack at the integration level (CUDA‑aware Python, TensorRT, Triton inference server, torch.compile, ONNX). Experience with inference‑serving frameworks (Triton, TorchServe, Ray Serve, vLLM) or edge‑deployment toolchains. Experience with low‑latency or real‑time systems. Experience in autonomous vehicles, robotics, or other safety‑critical ML deployment domains. Open‑source contributions to PyTorch, Ray, Airflow, Temporal, vLLM, TensorRT, or related projects. Compensation The salary range for this role is $128,700 to $261,300. The actual base salary a successful candidate will be offered within this range will vary based on factors relevant to the position. Bonus potential is offered through an incentive pay program based on company performance, job level, and individual performance. Benefits GM offers a variety of health and wellbeing benefit programs including medical, dental, vision, Health Savings Account, Flexible Spending Accounts, retirement savings plan, sickness and accident benefits, life insurance, paid vacation and holidays, tuition assistance programs, employee assistance program, GM vehicle discounts and more. Work Location and Travel This role is based remotely, but if the selected candidate lives within a specific mile radius of a GM hub, they will be expected to report to the location three times a week (or other frequency dictated by your manager). The selected candidate will be required to travel Non‑Discrimination and Equal Employment Opportunities (U.S.) General Motors is committed to being a workplace that is not only free of unlawful discrimination, but one that genuinely fosters inclusion and belonging. All employment decisions are made on a non‑discriminatory basis without regard to sex, race, color, national origin, citizenship status, religion, age, disability, pregnancy or maternity status, sexual orientation, gender identity, status as a veteran or protected veteran, or any other similarly protected status in accordance with federal, state and local laws. #J-18808-Ljbffr General Motors
$128.7k - $261.3k
The Model Deployment & Inference Solutions team in GM AV deploys machine learning models... .... Our mission is two-fold: build the ML deployment platform that makes model rollouts fast and... ..., or equivalent) as part of your engineering workflow. Experience designing clean,...SeniorFlexible hours- Israelvcforum is looking for a Senior ML Infrastructure Engineer in Mountain View, California. This position aims to build and scale robust platforms for ML inference workflows supporting GM’s AI efforts. You will collaborate with ML engineers and researchers to implement...SeniorRemote job
- General Motors is seeking a Senior ML Infrastructure Engineer to build and scale a robust platform for machine learning inference workflows. You will design backend software components, collaborate with ML engineers, and lead initiatives across GM's ML ecosystem. With over...SeniorRemote job
$128.7k - $261.3k
General Motors is seeking a skilled professional for the role focused on ML deployment for autonomous vehicles. This position involves designing platforms that automate model inference and collaborating across teams to enhance development workflows. The ideal candidate...SeniorRemote job- ...View is seeking a Machine Learning Engineer to build and optimize the infrastructure... ...for its Intelligence Composition Platform. The role involves designing and deploying ML models for multimodal data understanding, optimizing inference pipelines, and collaborating with...Suggested
$195k - $298k
...is eligible for relocation assistance. About the Team The ML Inference Platform is part of the AI Compute Platforms organization within Infrastructure... .... About the Role We are seeking a Staff ML Infrastructure engineer to help build and scale robust Compute platforms for ML...Relocation packageFlexible hours- We’re looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic search, retrieval, and AI-native... ...AI Platform organization and collaborate with ML researchers and engineers from our Voyage.ai acquisition...SeniorLocal areaWorldwide
$148.5k - $223.9k
Salesforce AI Research in Palo Alto is seeking a Machine Learning Engineer to develop next-generation agentic AI systems. You will closely collaborate with research scientists and product managers to innovate and design cutting-edge AI solutions. The ideal candidate possesses...Senior- A leading technology company is seeking a Machine Learning Engineer to develop next-generation agentic AI platforms. In this role, you will collaborate with research scientists and engineers to design and implement innovative AI solutions. Candidates should possess exceptional...Senior
- General Motors is looking for a Senior ML Infrastructure Engineer to build robust compute platforms for AI validation. This role emphasizes driving efficiency and maximizing GPU utilization while improving platform reliability. You will collaborate with engineers to shape...Senior
$166k - $244k
Carlsbad Tech is actively seeking a Senior Software Engineer to work on the Gemini Live API in Sunnyvale, CA. This role involves building scalable... ...in software development, infrastructure management, and AI/ML technologies. Benefits include a competitive salary ranging...Senior- ...Engagement Technology and Imager - platform is an AI-enabled wearable... .... We are looking for a Senior Machine Learning Engineer to build the AI foundation... ...Integrate and optimize AI inference into the VETi platform's... ..., with at least one major ML framework (PyTorch,...Senior
- General Motors is seeking a Machine Learning Engineer for the Model Deployment & Inference Solutions team in Sunnyvale, California. The role involves building and optimizing a unified ML deployment platform to ensure efficient model rollouts for autonomous vehicles. Candidates...Senior
$152k - $287.5k
NVIDIA Gruppe is seeking a Senior Machine Learning Applications and Compiler Engineer in Santa Clara, California. This role... ...algorithms for their LPX inference and compiler stack, optimizing... ...neural network workloads on NVIDIA platforms. Ideal candidates will possess...Senior- A leading data platform company in Palo Alto seeks a Senior Engineer to develop a cutting-edge inference platform supporting semantic search and AI-native experiences. The ideal... ...Go, Rust, or Python. You'll work alongside ML researchers to enhance infrastructure for real...Senior
$236k - $339.25k
...Machine Learning Platform Team Member At Snowflake, we are powering... ..., and enable end-to-end ML workflows. We are on an early... ...collaboratively and proactively with senior architects, PMs, and team... ...in serving LLMs using inference engines like vLLM, TensorRT-LLM, TEI,...SeniorFlexible hours$128.7k - $261.3k
...approaches to model export, kernel development, and performance engineering so that every cycle on our accelerators translates into... ...and custom libraries that sit at the heart of our on‑vehicle ML inference for ADAS and autonomous driving . We own making core AI workloads...SeniorLocal areaWork from homeRelocation packageFlexible hours$158k - $241.9k
...future of transportation on a global scale. Role: As a Senior AI/ML Engineer within the Onboard Embodied AI organization, you will be a... ..., delivering end-to-end solutions capable of real-time inference and robust autonomous driving performance. Lead and architect...SeniorLocal areaWork from homeRelocation packageFlexible hours$128.7k - $261.3k
...development, and performance engineering so that every cycle on... ...into fast, reliable inference across GPUs powering... ...teams to co-design a platform that enables new ideas... .... The Role As a Senior Compiler Engineer on... ...reliable, and effortless for ML engineers across the...SeniorLocal areaWork from homeRelocation packageFlexible hours$173k - $253k
...Senior MLOps Engineer Matterport is leading the digital transformation of... ...groundbreaking spatial computing platform turns buildings into data... ...You will work closely with ML R&D Engineers and other engineering... ...model performance, optimize inference speed and resource...SeniorWork at officeWork from home- ...-leading training and inference speeds and empowers machine... ...run large-scale ML applications, without... ...The Inference ML Engineering team at Cerebras Systems... ...full potential of our platform, leveraging its performance... ...and usability. As a Senior Software Engineer on...
$158k - $241.9k
...the future of transportation on a global scale. Role As a Senior AI/ML Engineer within the Onboard Embodied AI organization, you will be a... ...models, delivering end‑to‑end solutions capable of real‑time inference and robust autonomous driving performance. Lead and...SeniorLocal areaRelocation packageFlexible hours- A leading AI-powered fraud detection platform in Mountain View is seeking experienced platform engineers to design and build advanced machine learning systems. You will engage in improving core detection algorithms, using unsupervised and supervised machine learning, and...Senior
$128.7k - $261.3k
...developers and deployment and infra engineers to ship numerically robust,... ...Mathematics, Data Science / ML, or a closely related... ...model compression / efficient inference or relevant experience ~ StrongproficiencyinPyTorchandexperience... ...(automotive SoCs, robotics platforms, or similar) ~ Published...SeniorLocal areaRemote workWork from homeRelocation packageFlexible hours- A tech company in Mountain View seeks talented engineers for a role emphasizing high-performance systems, inference optimization, and model acceleration. You will thrive in ambiguity, tackle unclear problems, and design impactful solutions. The position offers a competitive...
- A leading automotive company is seeking a Staff ML Infrastructure Engineer to build robust compute platforms for machine learning workflows in Sunnyvale, CA. The... ...skills in Go, Python or C++, and expertise in ML inference. The position offers a hybrid work model and...
- AI Chopping Block, Inc. is looking for a Senior AI Engineer to develop Retrieval-Augmented Generation (RAG) systems. This role involves leading the architecture of AI-powered assistants to enhance industrial troubleshooting. Successful candidates will have a strong background...Senior
- ...everything is connected and moves autonomously through a self‑managing urban transportation operating system. At 42dot, our AD ML Platform Engineers build the core data platform and ML training / eval platform for the cutting edge algorithms in autonomous driving. We...SeniorFull timeWork experience placement
- ...deliver industry‑leading training and inference speeds and empowers machine... ...users to effortlessly run large‑scale ML applications, without the hassle of... ...seeking a versatile and experienced engineer to join our SOTA Training Platform team. This team is responsible to rapidly...SeniorInternship
$165k - $238k
...startup. As an innovation engine, X focuses on... ...driven team of experienced ML researchers, software engineers... ...About The Role As a Senior Applied Research... ...professional intelligence platform. You will focus on building... ...and high-fidelity inference over extremely large corpora...SeniorFull timeWork at office3 days per week
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior ML Inference Engineer - Platform. Be the first to apply!
- computer vision machine learning engineer Mountain View, CA
- machine learning ai engineer Mountain View, CA
- senior ml engineer Mountain View, CA
- machine learning software engineer Mountain View, CA
- machine learning engineer Mountain View, CA
- ai ml engineer Mountain View, CA
- platform developer Mountain View, CA
- platform engineer Mountain View, CA
- platform engineering manager Mountain View, CA
- data platform engineer Mountain View, CA

