Staff ML Engineer, Inference Platform
$185.5k - $270kGeneral Motors Proving Ground
Job Description
Hybrid This role is categorized as hybrid. This means the successful candidate is expected to report to the Sunnyvale Tecnical Center, CA at least three times per week, at minimum or other frequency dictated by the business. This job is eligible for relocation assistance.
About the Team:
The ML Inference Platform is part of the AI Compute Platforms organization within Infrastructure Platforms. Our team owns the cloud-agnostic, reliable, and cost-efficient platform that powers GM's AI efforts. We're proud to serve as the AI infrastructure platform for teams developing autonomous vehicles (L3/L4/L5), as well as other groups building AI-driven products for GM and its customers. We enable rapid innovation and feature development by optimizing for high-priority, ML-centric use cases. Our platform supports the serving of state-of-the-art (SOTA) machine learning models for experimental and bulk inference, with a focus on performance, availability, concurrency, and scalability. We're committed to maximizing GPU utilization across platforms (B200, H100, A100, and more) while maintaining reliability and cost efficiency.
About the Role:
We are seeking a Staff ML Infrastructure engineer to help build and scale robust Compute platforms for ML workflows. In this role, you'll work closely with ML engineers and researchers to ensure efficient model serving and inference in production, for their workflows such as data mining, labeling, model distillation, simulations and more. This is a high-impact opportunity to influence the future of AI infrastructure at GM. You will play a key role in shaping the architecture, roadmap and user-experience of a robust ML inference service supporting real-time, batch, and experimental inference needs. The ideal candidate brings experience in designing distributed systems for ML, strong problem-solving skills, and a product mindset focused on platform usability and reliability.
What you'll be doing:
Design and implement core platform backend software components.
Collaborate with ML engineers and researchers to understand critical workflows, parse them to platform requirements, and deliver incremental value.
Lead technical decision-making on model serving strategies, orchestration, caching, model versioning, and auto-scaling mechanisms.
Drive the development of monitoring, observability, and metrics to ensure reliability, performance, and resource optimization of inference services.
Proactively research and integrate state-of-the-art model serving frameworks, hardware accelerators, and distributed computing techniques.
Lead large-scale technical initiatives across GM's ML ecosystem.
Raise the engineering bar through technical leadership, establishing best practices.
Contribute to open source projects; represent GM in relevant communities.
Minimum Requirements
8+ years of industry experience, with focus on machine learning systems or high performance backend services.
Expertise in either Go, Python, C++ or other relevant coding languages.
Expertise in ML inference, model serving frameworks (triton, rayserve, vLLM etc).
Strong communication skills and a proven ability to drive cross-functional initiatives.
Experience working with cloud platforms such as GCP, Azure, or AWS.
Ability to thrive in a dynamic, multi-tasking environment with ever-evolving priorities.
Preferred Qualifications
Hands-on experience building ML infrastructure platforms for model serving/inference.
Experience working with or designing interfaces, apis and clients for ML workflows.
Experience with Ray framework, and/or vLLM.
Experience with distributed systems, and handling large-scale data processing.
Familiarity with telemetry, and other feedback loops to inform product improvements.
Familiarity with hardware acceleration (GPUs) and optimizations for inference workloads.
Contributions to open-source ML serving frameworks.
Why Join Us?
If you're excited to tackle some of today's most complex engineering challenges, see the impact of your work in real-world AV applications, and help shape the future of AI infrastructure at GM-this is the team for you.
Compensation: The compensation information is a good faith estimate only. It is based on what a successful applicant might be paid in accordance with applicable state laws. The compensation may not be representative for positions located outside of New York, Colorado, California, or Washington
Compensation: The expected base compensation for this role is : $185,500 - $270,000 Actual base compensation within the identified range will vary based on factors relevant to the position.
Bonus Potential: An incentive pay program offers payouts based on company performance, job level, and individual performance.
Benefits: GM offers a variety of health and wellbeing benefit programs. Benefit options include medical, dental, vision, Health Savings Account, Flexible Spending Accounts, retirement savings plan, sickness and accident benefits, life insurance, paid vacation & holidays, tuition assistance programs, employee assistance program, GM vehicle discounts and more.
Company Vehicle: Upon successful completion of a motor vehicle report review, you will be eligible to participate in a company vehicle evaluation program, through which you will be assigned a General Motors vehicle to drive and evaluate. Note: program participants are required to purchase/lease a qualifying GM vehicle every four years unless one of a limited number of exceptions applies.
#GM-AV-1
About GM
Our vision is a world with Zero Crashes, Zero Emissions and Zero Congestion and we embrace the responsibility to lead the change that will make our world better, safer and more equitable for all.
Why Join Us
We believe we all must make a choice every day - individually and collectively - to drive meaningful change through our words, our deeds and our culture. Every day, we want every employee to feel they belong to one General Motors team.
Benefits Overview
From day one, we're looking out for your well-being-at work and at home-so you can focus on realizing your ambitions. Learn how GM supports a rewarding career that rewards you personally by visiting Total Rewards resources ( .
Non-Discrimination and Equal Employment Opportunities (U.S.)
General Motors is committed to being a workplace that is not only free of unlawful discrimination, but one that genuinely fosters inclusion and belonging. We strongly believe that providing an inclusive workplace creates an environment in which our employees can thrive and develop better products for our customers.
All employment decisions are made on a non-discriminatory basis without regard to sex, race, color, national origin, citizenship status, religion, age, disability, pregnancy or maternity status, sexual orientation, gender identity, status as a veteran or protected veteran, or any other similarly protected status in accordance with federal, state and local laws.
We encourage interested candidates to review the key responsibilities and qualifications for each role and apply for any positions that match their skills and capabilities. Applicants in the recruitment process may be required, where applicable, to successfully complete a role-related assessment(s) and/or a pre-employment screening prior to beginning employment. To learn more, visit How we Hire ( .
Accommodations
General Motors offers opportunities to all job seekers including individuals with disabilities. If you need a reasonable accommodation to assist with your job search or application for employment, email (View email address on click.appcast.io) us or call us at View phone number on click.appcast.io. In your email, please include a description of the specific accommodation you are requesting as well as the job title and requisition number of the position for which you are applying.
We are leading the change to make our world better, safer and more equitable for all through our actions and how we behave. Learn more about:
Our Company (
Our Culture
How we hire??????? (
Our diverse team of employees bring their collective passion for engineering, technology and design to deliver on our vision of a world with Zero Crashes, Zero Emissions and Zero Congestion. We are looking for adventure-seekers and imaginative thought leaders to help us transform mobility.
Explore our global locations (
We are determined to lead change for the world through technology, ingenuity and harnessing the creativity of our diverse team. Join us to help lead the change that will make our world better, safer and more equitable for all by becoming a member of GM's Talent Community (beamery.com) ( . As a part of our Talent Community, you will receive updates about GM, open roles, career insights and more.
Please note that filling out the form below will not add you to our Talent Community automatically; you will need to use the link above. If you are seeking to apply to a specific role, we encourage you to click "Apply Now" on the job posting of interest.
The policy of General Motors is to extend opportunities to qualified applicants and employees on an equal basis regardless of an individual's age, race, color, sex, religion, national origin, disability, sexual orientation, gender identity/expression or veteran status. Additionally, General Motors is committed to being an Equal Employment Opportunity Employer and offers opportunities to all job seekers including individuals with disabilities. If you need a reasonable accommodation to assist with your job search or application for employment, email us at View email address on click.appcast.io .In your email, please include a description of the specific accommodation you are requesting as well as the job title and requisition number of the position for which you are applying.
$195k - $298k
...eligible for relocation assistance. About the Team The ML Inference Platform is part of the AI Compute Platforms organization... ...and cost efficiency. About the Role We are seeking a Staff ML Infrastructure engineer to help build and scale robust Compute platforms for...SuggestedRelocation packageFlexible hours- ...industry-leading training and inference speeds and empowers machine... ...effortlessly run large-scale ML applications, without the hassle... ...Role The Inference ML Engineering team at Cerebras Systems is dedicated... ...the full potential of our platform, leveraging its performance,...Suggested
$128.7k - $261.3k
...About the Team The Model Deployment & Inference Solutions team in GM AV deploys... .... Our mission is two-fold: build the ML deployment platform that makes model rollouts fast and predictable... ...currently performed manually by engineers. Build the developer experience...SuggestedLocal areaRemote workWork from homeRelocation packageFlexible hoursShift work- ...View is seeking a Machine Learning Engineer to build and optimize the infrastructure... ...for its Intelligence Composition Platform. The role involves designing and deploying ML models for multimodal data understanding, optimizing inference pipelines, and collaborating with...Suggested
$212k - $386.3k
...Senior Staff Machine Learning Engineer, Apple Search & Knowledge Platforms Apple is where individual imaginations gather together... ...cost-efficient LLM-based model inference. Integrate the Apple ecosystem... ...engineering/applied research/ML experiences in natural language...SuggestedTemporary workWorldwideRelocation$181.1k - $318.4k
...Staff Machine Learning Engineer: Platform Intelligence - Apple Maps Apple Maps and the thousands of applications... ...for on-device training and inference. You will partner with a variety of... ...technical design documents for new ML features is a core part of this role...Relocation$212k - $386.3k
...Sr Staff Machine Learning Engineer, ML Platform Work Locations (2) Submit Resume At Apple, we work every day to create products that enrich people... ...with foundational infrastructure in model training, inference and agentic AI, as well as associated data and application...Immediate startRelocation$170k - $300k
...and maintaining the tools and services engineers here at GM use every day to do their best... ...domain. The Role We are looking for a Staff Engineer with an extensive engineering... ...Team The Continuous Integration (CI) Platform team owns our CI infrastructure along...Work experience placementWork at officeLocal areaRemote workWork from homeRelocation packageFlexible hours3 days per week$220.5k - $298.5k
Intuit Inc. in Mountain View is seeking a Senior Staff Machine Learning Engineer to drive advanced AI research and innovation for the Intuit Business Platform. This role requires deep expertise in Generative AI and related fields, with a focus on prototyping and building...- General Motors seeks a Staff Engineer in Mountain View, CA, to enhance tools and services for development teams. The ideal candidate will have over 7 years of experience in designing scalable production systems, with expertise in Go and Python programming. This hybrid...
- An automotive leader seeks a Senior ML Infrastructure Engineer to build and enhance platforms for ML Inference workflows. The role involves collaborating with engineers to ensure optimal model serving and overseeing system performance. Candidates should have substantial...
$180k - $280k
...autonomous vehicle behavior across real-world scenarios. As a Staff AI/ML Engineer within the Onboard Embodied AI organization, you will be a... ..., delivering end-to-end solutions capable of real-time inference and robust autonomous driving performance. Lead and...Local areaWork from homeRelocationRelocation package$181.1k - $318.4k
...Sr. / Staff ML Engineer, FM Training Integration - ML Compute We are looking for a ML Engineer to join our ML Compute team to help improve... ..., scalability, and reliability of model training and inference workloads in the cloud. In this role, you will lead the integration...Relocation$185.1k - $335.3k
...development, and performance engineering so that every cycle on... ...into fast, reliable inference across GPUs powering GM... ...teams to co-design a platform that enables new ideas... ...driving. The Role As a Staff Compiler Engineer on... ..., and effortless for ML engineers across the AV...Local areaRemote workWork from homeRelocation packageFlexible hours$189.3k - $290.7k
...autonomous vehicle behavior across real-world scenarios. As a Staff ML Infra Engineer on the Offboard Perception team within the Embodied AI... ...and observability frameworks, including training metrics, inference performance metrics, data quality checks, and production monitoring...Local areaRemote workWork from homeRelocationRelocation packageFlexible hours- A leading AI technology company in Sunnyvale, California, is seeking a skilled software engineer to optimize its AI cloud platform for model training and inference. In this role, you'll enhance deployment efficiency and ensure system reliability and scalability. The ideal...
$181.1k - $318.4k
...Sr./Staff ML Infrastructure Engineer, Compute (TPU Scheduling) - Foundation Model Work Locations (2)... ...execution of large-scale training and inference jobs. This role spans scheduling algorithms... ...Apple's distributed AI compute platform Minimum Qualifications ~7+...Relocation$212k - $386.3k
...Staff Machine Learning Engineer, Apple Search & Knowledge Platforms The AI, Search & Knowledge Platforms team builds amazing products and services for Apple's customers... ...the team's workflow. Working on the end-to-end ML lifecycle: algorithm design and implementation,...Relocation$230k - $300k
...NewsBreak is the Content Intelligence platform shaping the future content economy.... ...About the Role We are seeking a Staff Machine Learning Engineer to provide technical leadership for... ...experience building large-scale ML or recommendation systems. - Deep expertise...Full timeLocal areaWork from home$124k - $195.5k
...Machine Learning Applications and Compiler Engineer for New College Grad 2026 in Santa Clara,... ...will focus on developing algorithms for inference and compiler stack optimizations, working... ...development, and experience with ML frameworks like TensorFlow and PyTorch. A...$181.1k - $318.4k
...something! Description As a Senior/Staff Engineer on the Foundation Model... ...of large‑scale training and inference jobs. This role spans scheduling... ...systems for distributed ML workloads running on Kubernetes... ...Apple’s distributed AI compute platform Minimum Qualifications 7+...Relocation- Senior Staff Machine Learning Engineer, AI Platform & Emerging Capabilities Intuit is seeking a Senior Staff Machine Learning Engineer to help shape the future... ...the multi-year technical vision and roadmap for AI/ML platform capabilities, ensuring Intuit stays at the forefront...
- B Capital is seeking a Machine Learning Engineer in Palo Alto, California to work on their next-gen agentic AI platform. The role involves collaborating with teams to design and implement AI systems while innovating new solutions that greatly impact Salesforce customers...
$218.8k - $335.3k
...Job Description Staff AI/ML Engineer, AV ML Infra We're General Motors (GM), a company driving... ...includes: AI Validation & Inference: Ensures robust model performance by... ...Points: ~ Experience with Google Cloud Platform, Microsoft Azure, or Amazon Web Services...Local areaWork from homeFlexible hours$199.7k - $254.6k
...CX AI Incubation Team as a Senior AI/ML DevOps Engineer and help productionize LLM/SLM capabilities... ...observable AI services, optimizing inference performance from CPU and small GPUs to... ...to build and run production-grade AI platforms and services that transform customer engagement...Full timeTemporary workLocal areaFlexible hours- Apple Inc. is seeking a Machine Learning Engineer to join the ADP ML Data Platform team in Cupertino, California. The role encompasses designing and building scalable systems for ML data and embeddings, optimizing AI models for production, and driving automation and reliability...
$189k - $300k
...The team directly works on and delivers ML models to the product that successively go... ...collaborative, high-impact team of AI/ML engineers, data scientists and engineers who are passionate... ...-generation autonomous vehicles. As a Staff AI/ML Engineer in the Embodied AI Data...Local areaRemote workWork from homeRelocationRelocation packageFlexible hours$232k - $310k
...Fortune 500 organizations. Our platform is built from the ground up,... ..., and high standards. Our engineers, product leaders, and go-to-market... ...world works. About AII/ML Team Our AI/ML team is building... ...strategies (QLORA, DPO) and inference optimization (vLLM, TensorRT-...Work experience placementWork at officeRemote workFlexible hours3 days per week$189.3k - $320.7k
...deploying machine learning solutions that support safe and reliable autonomous vehicle behavior across real-world scenarios. As a Staff ML Engineer on the Prometheus team within the Embodied AI organization, you will be a senior engineer responsible for developing and...Local areaRemote workWork from homeRelocationRelocation packageFlexible hours$160.5k - $240.7k
...Technologies, Inc. Job Area: Engineering Group, Engineering Group... ...Qualcomm AI Hub is the platform for on-device AI - enabling developers... ..., optimize, and deploy ML models on Qualcomm devices. Qualcomm... ...IRs, operator fusion, shape inference, lowering passes, backend...Work experience placementWork from home
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff ML Engineer, Inference Platform. Be the first to apply!
- staff security engineer Sunnyvale, CA
- assistant engineer Sunnyvale, CA
- engineering aide Sunnyvale, CA
- staff engineer Sunnyvale, CA
- technology administrator Sunnyvale, CA
- senior staff systems engineer Sunnyvale, CA
- assistant mechanical engineer Sunnyvale, CA
- software engineer staff Sunnyvale, CA
- senior staff engineer Sunnyvale, CA
- machine learning ai engineer Sunnyvale, CA

