Staff ML Engineer, Inference Platform

$195k - $298k

General Motors Proving Ground

Hybrid This role is categorized as hybrid. This means the successful candidate is expected to report to the Sunnyvale Tecnical Center, CA at least three times per week, at minimum or other frequency dictated by the business. This job is eligible for relocation assistance. About the Team The ML Inference Platform is part of the AI Compute Platforms organization within Infrastructure Platforms. Our team owns the cloud-agnostic, reliable, and cost-efficient platform that powers GM’s AI efforts. We’re proud to serve as the AI infrastructure platform for teams developing autonomous vehicles (L3/L4/L5), as well as other groups building AI-driven products for GM and its customers. We enable rapid innovation and feature development by optimizing for high-priority, ML-centric use cases. Our platform supports the serving of state-of-the‑art (SOTA) machine learning models for experimental and bulk inference, with a focus on performance, availability, concurrency, and scalability. We’re committed to maximizing GPU utilization across platforms (B200, H100, A100, and more) while maintaining reliability and cost efficiency. About the Role We are seeking a Staff ML Infrastructure engineer to help build and scale robust Compute platforms for ML workflows. In this role, you’ll work closely with ML engineers and researchers to ensure efficient model serving and inference in production, for their workflows such as data mining, labeling, model distillation, simulations and more. This is a high-impact opportunity to influence the future of AI infrastructure at GM. You will play a key role in shaping the architecture, roadmap and user-experience of a robust ML inference service supporting real-time, batch, and experimental inference needs. The ideal candidate brings experience in designing distributed systems for ML, strong problem-solving skills, and a product mindset focused on platform usability and reliability. What you’ll be doing Design and implement core platform backend software components. Collaborate with ML engineers and researchers to understand critical workflows, parse them to platform requirements, and deliver incremental value. Lead technical decision-making on model serving strategies, orchestration, caching, model versioning, and auto-scaling mechanisms. Drive the development of monitoring, observability, and metrics to ensure reliability, performance, and resource optimization of inference services. Proactively research and integrate state-of-the‑art model serving frameworks, hardware accelerators, and distributed computing techniques. Lead large‑scale technical initiatives across GM’s ML ecosystem. Raise the engineering bar through technical leadership, establishing best practices. Contribute to open source projects; represent GM in relevant communities. Minimum Requirements 8+ years of industry experience, with focus on machine learning systems or high performance backend services. Expertise in either Go, Python, C++ or other relevant coding languages. Expertise in ML inference, model serving frameworks (triton, rayserve, vLLM etc). Strong communication skills and a proven ability to drive cross‑functional initiatives. Experience working with cloud platforms such as GCP, Azure, or AWS. Ability to thrive in a dynamic, multi‑tasking environment with ever‑evolving priorities. Preferred Qualifications Hands‑on experience building ML infrastructure platforms for model serving/inference. Experience working with or designing interfaces, apis and clients for ML workflows. Experience with Ray framework, and/or vLLM. Experience with distributed systems, and handling large‑scale data processing. Familiarity with telemetry, and other feedback loops to inform product improvements. Familiarity with hardware acceleration (GPUs) and optimizations for inference workloads. Contributions to open‑source ML serving frameworks. Why Join Us If you’re excited to tackle some of today’s most complex engineering challenges, see the impact of your work in real‑world AV applications, and help shape the future of AI infrastructure at GM—this is the team for you. Compensation The compensation information is a good faith estimate only. It is based on what a successful applicant might be paid in accordance with applicable state laws. The compensation may not be representative for positions located outside of New York, Colorado, California, or Washington. Compensation: The expected base compensation for this role is $195,000 - $298,000 Actual base compensation within the identified range will vary based on factors relevant to the position. Bonus Potential: An incentive pay program offers payouts based on company performance, job level, and individual performance. Benefits: GM offers a variety of health and wellbeing benefit programs. Benefit options include medical, dental, vision, Health Savings Account, Flexible Spending Accounts, retirement savings plan, sickness and accident benefits, life insurance, paid vacation & holidays, tuition assistance programs, employee assistance program, GM vehicle discounts and more. Company Vehicle Upon successful completion of a motor vehicle report review, you will be eligible to participate in a company vehicle evaluation program, through which you will be assigned a General Motors vehicle to drive and evaluate. Note: program participants are required to purchase/lease a qualifying GM vehicle every four years unless one of a limited number of exceptions applies. #J-18808-Ljbffr General Motors

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the Staff ML Engineer, Inference Platform in Sunnyvale, CA vacancy

Staff Inference ML Runtime Engineer
...industry-leading training and inference speeds and empowers machine... ...effortlessly run large-scale ML applications, without the hassle... ...Role The Inference ML Engineering team at Cerebras Systems is dedicated... ...the full potential of our platform, leveraging its performance,...
Suggested
CEREBRAS SYSTEMS INC.
Sunnyvale, CA
1 day ago
Staff ML Infra Engineer: Scalable Inference Platform (Hybrid)
A leading automotive company is seeking a Staff ML Infrastructure Engineer to build robust compute platforms for machine learning workflows in Sunnyvale, CA. The role... ...in Go, Python or C++, and expertise in ML inference. The position offers a hybrid work model and competitive...
Suggested
General Motors
Sunnyvale, CA
18 hours ago
Staff ML Engineer — Ultra-Low-Latency Inference
A tech company in Mountain View seeks talented engineers for a role emphasizing high-performance systems, inference optimization, and model acceleration. You will thrive in ambiguity, tackle unclear problems, and design impactful solutions. The position offers a competitive...
Suggested
Inworld
Mountain View, CA
4 days ago
Senior ML Inference Engineer - Platform
$128.7k - $261.3k
...About the Team The Model Deployment & Inference Solutions team in GM AV deploys... .... Our mission is two-fold: build the ML deployment platform that makes model rollouts fast and predictable... ...currently performed manually by engineers. Build the developer experience...
Suggested
Local area
Remote work
Work from home
Relocation package
Flexible hours
Shift work
General Motors
Mountain View, CA
3 days ago
ML Engineer — AI Platform & Multimodal Inference
...View is seeking a Machine Learning Engineer to build and optimize the infrastructure... ...for its Intelligence Composition Platform. The role involves designing and deploying ML models for multimodal data understanding, optimizing inference pipelines, and collaborating with...
Suggested
Corvic
Mountain View, CA
2 days ago
Staff ML Engineer, ML Compute Platform
$195k - $298k
...the GM Global Technical Center - Cole Engineering Center Podium or Mountain View Technical... ...assistance. About the Team The ML Compute Platform is part of the AI Compute Platform organization... .... About the Role We are seeking a Staff ML Engineer to build and scale robust...
Local area
Relocation package
Flexible hours
General Motors
Mountain View, CA
4 days ago
Staff ML Engineer, Scalable Compute Platform
General Motors is seeking a Staff ML Engineer to build and scale robust compute platforms for ML workflows. This is a high-impact role that allows you to work closely with ML engineers and shape the user experience of the platform. The successful candidate will design core...
3 days per week
General Motors
Mountain View, CA
4 days ago
Staff AI/ML Engineer - CI Platform
$170k - $300k
...and maintaining the tools and services engineers here at GM use every day to do their best... ...domain. The Role We are looking for a Staff Engineer with an extensive engineering... ...Team The Continuous Integration (CI) Platform team owns our CI infrastructure along...
Work experience placement
Work at office
Local area
Remote work
Work from home
Relocation package
Flexible hours
3 days per week
General Motors
Sunnyvale, CA
4 days ago
Remote Senior ML Inference Platform Engineer
General Motors is seeking a Senior ML Infrastructure Engineer to build and scale a robust platform for machine learning inference workflows. You will design backend software components, collaborate with ML engineers, and lead initiatives across GM's ML ecosystem. With...
Remote job
General Motors
Sunnyvale, CA
1 day ago
Senior ML Inference Platform Engineer (Remote)
Israelvcforum is looking for a Senior ML Infrastructure Engineer in Mountain View, California. This position aims to build and scale robust platforms for ML inference workflows supporting GM’s AI efforts. You will collaborate with ML engineers and researchers to implement...
Remote job
Israelvcforum
Mountain View, CA
4 days ago
Staff ML Engineer - Embodied AI Onboard Autonomy
$180k - $280k
...autonomous vehicle behavior across real-world scenarios. As a Staff AI/ML Engineer within the Onboard Embodied AI organization, you will be a... ..., delivering end-to-end solutions capable of real-time inference and robust autonomous driving performance. Lead and...
Local area
Work from home
Relocation
Relocation package
General Motors
Mountain View, CA
3 days ago
Staff ML Compiler Engineer
$185.1k - $335.3k
...development, and performance engineering so that every cycle on... ...into fast, reliable inference across GPUs powering GM... ...teams to co-design a platform that enables new ideas... ...driving. The Role As a Staff Compiler Engineer on... ..., and effortless for ML engineers across the AV...
Local area
Remote work
Work from home
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
3 days ago
Staff AI Cloud Platform Engineer - Inference & Training
A leading AI technology company in Sunnyvale, California, is seeking a skilled software engineer to optimize its AI cloud platform for model training and inference. In this role, you'll enhance deployment efficiency and ensure system reliability and scalability. The ideal...
Cerebras
Sunnyvale, CA
18 hours ago
Staff ML Infrastructure Engineer - Embodied AI Offboard Perception
...strategies. Implement CI/CD pipelines for ML systems, including automated testing,... ...frameworks, including training metrics, inference performance metrics, data‑quality checks,... ...performance on hard‑edge cases. Partner with ML engineers, researchers, and software teams to...
Local area
Remote work
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
4 days ago
Staff Machine Learning Engineer, Recommendation & AI Platform
$230k - $300k
...NewsBreak is the Content Intelligence platform shaping the future content economy.... ...About the Role We are seeking a Staff Machine Learning Engineer to provide technical leadership for... ...experience building large-scale ML or recommendation systems. - Deep expertise...
Full time
Local area
Work from home
NewsBreak
Mountain View, CA
2 days ago
ML Engineer - Inference & Model Deployment
...building a 100x better job search engine: fast, comprehensive, honest,... ...We are looking for a founding ML engineer who can help us turn... ...deploying models, optimizing inference latency and throughput,... ...infrastructure. Are comfortable with cloud platforms, distributed systems, storage...
Relocation package
HiringCafe
Cupertino, CA
1 day ago
Staff ML Engineer for Agentic AI Platform
B Capital is seeking a Machine Learning Engineer in Palo Alto, California to work on their next-gen agentic AI platform. The role involves collaborating with teams to design and implement AI systems while innovating new solutions that greatly impact Salesforce customers...
B Capital
Palo Alto, CA
18 hours ago
Staff ML Compute Platform Engineer
$207k - $300k
A leading tech company is looking for a Staff Software Engineer specializing in ML and Compute Platform in Sunnyvale, CA. In this role, you will develop embedded software and lead projects that influence millions of users globally. The ideal candidate will have 8+ years...
Google Inc.
Sunnyvale, CA
4 days ago
Senior ML Compiler & Inference Systems Engineer
$152k - $287.5k
...Machine Learning Applications and Compiler Engineer in Santa Clara, California. This role... ...involves developing algorithms for their LPX inference and compiler stack, optimizing the... ...performance of neural network workloads on NVIDIA platforms. Ideal candidates will possess an MS or...
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Principal Staff AI/ML Engineer - AV ML Infra
$296.3k
...About the team: The AV ML Infra team at GM builds ML infrastructure... ...the productivity of ML engineers, and drive the adoption of... ...includes: AI Validation & Inference: Ensures robust model performance... ...Experience with Google Cloud Platform, Microsoft Azure, or Amazon...
Local area
Work from home
Flexible hours
General Motors
Sunnyvale, CA
2 days ago
ML Engineer, AI Platform
...Corvic's Intelligence Composition Platform. You'll work on multimodal... ...Design, train, and deploy ML models for multimodal data understanding... ...Build and optimize inference pipelines for production workloads... ...with LLM fine-tuning and prompt engineering #J-18808-Ljbffr Corvic
Corvic
Mountain View, CA
2 days ago
Staff ML Engineer - Embodied AI Scaling Foundations
$189k - $300k
...The team directly works on and delivers ML models to the product that successively go... ...collaborative, high-impact team of AI/ML engineers, data scientists and engineers who are passionate... ...-generation autonomous vehicles. As a Staff AI/ML Engineer in the Embodied AI Data...
Local area
Remote work
Work from home
Relocation
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
18 hours ago
Staff Machine Learning Engineer - AI Foundation
$215.28k - $364.32k
...Staff Machine Learning Engineer - Ai Foundation Santa Clara, CA Xpeng is a leading smart technology... ...towards establishing a state-of-art ML infrastructure for training very large... ...model and accelerating model training/inference. Our mission is to solve the autonomous...
Full time
XPENG
Santa Clara, CA
16 days ago
Staff Machine Learning Engineer - On-Device AI/ML
$160.5k - $240.7k
...Technologies, Inc. Job Area: Engineering Group, Engineering Group... ...Qualcomm AI Hub is the platform for on-device AI - enabling developers... ..., profile, and deploy ML models on Qualcomm devices. Qualcomm... ...has broad impact - affecting inference correctness, runtime...
Work experience placement
Work from home
Qualcomm
Santa Clara, CA
4 days ago
Staff Machine Learning Engineer - VLM/LLM Evaluation
$238k - $302k
...applied to a range of vehicle platforms and product use cases. The... ...modeling, Bayesian inference, hierarchical learning, and... ...you will report to a Senior Staff Software Engineer. You will: * Work with... ...5+ years of experience in ML engineering and applied Deep...
Full time
Temporary work
Remote work
Waymo
Mountain View, CA
11 days ago
Staff ML Engineer - Embodied AI Offboard Perception
$189.3k - $320.7k
...deploying machine learning solutions that support safe and reliable autonomous vehicle behavior across real-world scenarios. As a Staff ML Engineer on the Prometheus team within the Embodied AI organization, you will be a senior engineer responsible for developing and...
Local area
Remote work
Work from home
Relocation
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
3 days ago
Senior ML Deployment Engineer — Platform (Remote)
$128.7k - $261.3k
General Motors is seeking a skilled professional for the role focused on ML deployment for autonomous vehicles. This position involves designing platforms that automate model inference and collaborating across teams to enhance development workflows. The ideal candidate...
Remote job
General Motors
Mountain View, CA
1 day ago
Staff ML Infra Engineer — Compute for AV Simulation
A major automotive company in Sunnyvale, California is seeking a Staff ML Infrastructure engineer to enhance AI validation efforts. This role focuses on building and scaling compute platforms for simulation and data workflows. Ideal candidates should have extensive experience...
General Motors
Sunnyvale, CA
1 day ago
Staff ML Engineer, Personalization for a Next-Gen AI
$207k - $300k
Google Inc. is seeking a Staff Software Engineer for its Machine Learning team in Mountain View, CA. The role involves developing cutting-edge AI assistant features and employing complex data analysis to enhance user interactions. Candidates should have at least 8 years...
Full time
Google Inc.
Mountain View, CA
2 days ago
Staff ML Engineer - Embodied AI Perception (Remote)
General Motors is seeking a Machine Learning Engineer with a Master's or PhD to design ML solutions for autonomous driving. The candidate will implement models, collaborate across teams, and mentor engineers while working remotely with a hybrid expectation for those near...
Remote job
General Motors
Sunnyvale, CA
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff ML Engineer, Inference Platform. Be the first to apply!