Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Machine Learning Engineer - Autonomous Driving Model Quantization & Deployment

$215.28k - $364.32k

XPENG

XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric vertical take-off and landing (eVTOL) aircraft, and robotics. With a strong focus on intelligent mobility, XPENG is dedicated to reshaping the future of transportation through cutting-edge R&D in AI, machine learning, and smart connectivity.

The Mission: The challenge of Vision-Language-Action (VLA) models and Foundation Models isn't just their intelligence-it's their real-time execution at the edge . We are seeking a high-caliber Staff Machine Learning Engineer to bridge the gap between massive research models and production-ready L4 autonomous driving systems. You will lead the effort to optimize and deploy our VLA models onto vehicle-grade compute platforms for our global fleet.

Key Responsibilities:
  • Lead Optimization Strategy: Own the end-to-end quantization and optimization roadmap for large-scale multimodal models (Transformers, VLMs).
  • Model Compression: Apply and innovate in PTQ (Post-Training Quantization), QAT (Quantization-Aware Training), and pruning techniques to fit VLA models into strict memory and power envelopes.
  • Hardware-Software Co-design: Collaborate directly with model researchers to ensure architectures are "deployment-friendly" and with platform teams to influence future hardware requirements.
  • Production Excellence: Develop and maintain robust, safety-critical deployment stacks in Modern C++ , ensuring 24/7 stability and deterministic performance on the road.
Basic Qualifications:
  • Proven Track Record: 5-8 years of experience in model deployment, quantization, or high-performance computing (HPC).
  • Core Technical Skills: Mastery of Modern C++ and deep experience with CUDA or other hardware acceleration libraries.
  • Deep Learning Expertise: Strong familiarity with PyTorch and deep knowledge of inference engines like TensorRT , ONNX Runtime, or TVM.
  • Quantization Depth: Hands-on experience with INT8/FP8/INT4 quantization and knowledge of the unique challenges in quantizing Large Language Models (LLMs) or Transformers.
  • Platform Knowledge: Solid understanding of computer architecture (Cache, Memory Bandwidth, SIMD) and experience with embedded/edge compute constraints.
  • Systems Thinking: Ability to debug complex performance bottlenecks across the entire software stack.
Preferred Qualifications:
  • Experience with VLA /VLM or other Foundation Model deployment.
  • Background in autonomous driving, robotics, or real-time safety-critical systems.
  • Contributions to open-source inference or compiler projects.
What do we provide:
  • A fun, supportive and engaging environment
  • Infrastructures and computational resources to support your ML model development/research.
  • Opportunity to work on cutting edge technologies with the top talent in the field.
  • Opportunity to make significant impact on transportation revolution by the means of advancing autonomous driving
  • Competitive compensation package
  • Snacks, lunches, dinners, and fun activities

The base salary range for this full-time position is $215,280-$364,320, in addition to bonus, equity and benefits. Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training.

We are an Equal Opportunity Employer. It is our policy to provide equal employment opportunities to all qualified persons without regard to race, age, color, sex, sexual orientation, religion, national origin, disability, veteran status or marital status or any other prescribed category set forth in federal or state regulations.
Vacancy posted 19 hours ago
Similar jobs that could be interesting for youBased on the Staff Machine Learning Engineer - Autonomous Driving Model Quantization & Deployment in Santa Clara, CA vacancy
  • $124k

     ...just training models, we're...  ...agents that autonomously operate computers...  ..., we deploy these models...  ...post-training quantization and quantization...  ...massive deep learning models run...  ...of miles of driving + robot interactions...  ..., inference engine, and silicon...  ...Science, Machine Learning,... 
    Suggested
    Hourly pay
    Full time
    Temporary work
    Immediate start
    Flexible hours

    Tesla

    Palo Alto, CA
    3 days ago
  • $244.14k - $413.16k

     ...Senior Staff Machine Learning Engineer - Foundation Model Santa Clara, CA XPENG is a leading...  ...integrating advanced AI and autonomous driving technologies into its...  ...to design, train, and deploy large-scale multi-...  ..., including quantization, export, and latency–accuracy... 
    Suggested
    Full time

    XPENG

    Santa Clara, CA
    3 days ago
  • $129.19k - $247.04k

     ...Company DiDi's autonomous driving unit was established...  ...The Foundation Model Team focuses on building...  ...and generalizable deep learning systems that enable safe...  ...intersection of large-scale machine learning, autonomous...  ..., and on-vehicle deployment Collaborate... 
    Suggested

    DiDi Labs

    San Jose, CA
    3 days ago
  • $148.91k - $252k

     ...Machine Learning Engineer - LLM, AI & Robotics Santa Clara, CA XPENG is a leading...  ...integrating advanced AI and autonomous driving technologies into its...  ...small LLMs, weight sharing, model quantization, etc.) that can be deployed locally in Robots and Cars.... 
    Suggested
    Full time

    XPENG

    Santa Clara, CA
    3 days ago
  • $204k - $259k

     ...Waymo is an autonomous driving technology company...  ...Driver. The DUE Machine Learning team will build...  ...machine learning models to deliver...  ...researchers and software engineers who are...  ...development and deployment of cutting-edge...  ...distillation and quantization. Build and scale... 
    Suggested
    Full time

    Waymo

    Mountain View, CA
    2 days ago
  •  ...and foundation models enable...  ...of automated driving systems. Our...  ...excellence, constantly learning and evolving...  ...As an  ML Engineer within the Application...  ...model-based autonomous driving—both...  ...and real-world deployment. You’ll...  ...success as a Machine Learning Engineer... 
    Full time
    Work at office
    Work from home

    Wayve

    Sunnyvale, CA
    1 day ago
  • $170k - $216k

     ...Waymo is an autonomous driving technology company with the mission...  ...the system which learns the spatial-temporal...  ...of sensors, enabling engineers like you to (1) develop...  ...data, to (2) develop models and model training at...  ...years experience in Machine Learning and/or... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    2 days ago
  • $204k - $259k

     ...Waymo is an autonomous driving technology company with the mission...  ...an advanced ML and engineering team that leverages...  ...vision, deep learning, and generative AI to...  ...vision / multimodal models (e.g., Gemini) to extract...  ...prototyping to production deployment and scaling for... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    2 days ago
  • $128.7k - $261.3k

     ...everything we do in autonomous and assisted driving. The AV...  ...new approaches to model export, kernel development...  ...development, and performance engineering so that every cycle...  ...integration, and deployment tooling, with a...  ...developing and deploying machine learning models?... 
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    3 days ago
  • $204k - $259k

     ...Waymo is an autonomous driving technology company with...  ...the system which learns the spatial-...  ...sensors, enabling engineers like you to (1) develop...  ..., to (2) develop models and model...  ...of experience in Machine Learning, with a...  ...evaluation, and deployment in a production environment... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    2 days ago
  • $154.9k - $222.37k

     ...applications from automated driving to industrial robotics,...  ...to 3D position, allowing autonomous devices like vehicles and...  ...Overview: As an ML Engineer on our perception team, you...  ...own the development and deployment of 3D perception models across object detection,... 
    Flexible hours

    Aeva, Inc

    Mountain View, CA
    5 days ago
  • $153.2k - $234.1k

     ...people as we aim to make driving safer, smarter, and...  ...the future of autonomous driving? Join the Embodied...  ...that powers every machine learning engineer working on our...  ...Autonomous Driving models. From foundational models...  ...implementation, and deployment of scalable platforms... 
    Work at office
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    5 days ago
  • $215.28k - $364.32k

     ...Staff Machine Learning Engineer - Ai Foundation Santa Clara, CA...  ...integrating advanced AI and autonomous driving technologies into...  ...large foundation model and accelerating...  ...Implement and benchmark (Quantization, Knowledge...  ...optimization, etc.). Deploy optimized models across... 
    Full time

    XPENG

    Santa Clara, CA
    1 day ago
  • $213k - $263k

     ...Waymo is an autonomous driving technology company...  ...collaborative group of machine learning (ML) engineers, software...  ...algorithms, to model the real world,...  ...to a Senior Staff Engineering Manager...  ...and deployment of cutting-edge...  ...distillation and quantization. Build and scale... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    2 days ago
  • $238k - $302k

     ...Waymo is an autonomous driving technology company with the...  ...Driver. The DUE Machine Learning team will build and...  ...advanced machine learning models to deliver training...  ...and software engineers who are passionate about...  ...learning models and ML deployment at scale We... 
    Full time

    Waymo

    Mountain View, CA
    2 days ago
  • $203.45k - $344.3k

     ...Senior Staff Physical AI Data Algorithm Engineer Santa Clara, CA XPENG...  ...advanced AI and autonomous driving technologies into...  ...-edge R&D in AI, machine learning, and smart connectivity...  ...supports weekly model iteration, cross-...  ...to on-board deployment and continuous optimization... 
    Full time
    Temporary work
    Work experience placement

    XPENG

    Santa Clara, CA
    1 day ago
  • $140k - $230k

     ...-first automated driving technology and Toyota...  ..., and business models that transform...  ...state-of-the-art of machine learning (ML) for...  ...software and hardware engineers and researchers to...  ...mapping system for autonomous driving. Unlike...  ...train, validate, and deploy new models,... 
    Full time
    Temporary work
    Flexible hours

    Woven By Toyota

    Palo Alto, CA
    1 day ago
  •  ...hiring our Founding Machine Learning Engineer (MLE) with expertise...  ...Development and Time-Series Modeling. You'll play a...  ...you'll design, train, deploy, and monitor ML systems...  ..., tool integration, autonomous workflows, memory/context...  ...solutions and drive rapid iteration What... 
    Visa sponsorship

    Stealth Startup

    Sunnyvale, CA
    1 day ago
  • $203.45k - $344.3k

     ...Senior Staff AI Data Infrastructure/Pipeline Engineer Santa Clara, CA XPENG...  ...advanced AI and autonomous driving technologies into...  ...-edge R&D in AI, machine learning, and smart connectivity...  ...production → model training /...  ...performance tuning and deployment experience. ~ Experience... 
    Full time
    Overseas

    XPENG

    Santa Clara, CA
    2 days ago
  •  ...software for factory-built autonomous trucks....  ...Plus to accelerate the deployment of next-generation autonomous...  ...a huge impact and drive the future of...  ...are seeking a Senior Machine Learning Engineer with expertise in deep...  ...vehicle simulation models tailored to various... 

    PlusAI, Inc.

    Santa Clara, CA
    3 days ago
  • $156k - $387.6k

     ...recommendation, weakly-supervised learning, few-shot classification, video...  ...more. We aim to succeed both in driving measurable business impact (e.g....  ...breakthroughs. 3. Drive engineering deployment and implementation, ensuring model stability, scalability, and efficiency... 
    Temporary work
    Local area

    Tik Tok

    San Jose, CA
    2 days ago
  • $176k - $420k

     ...Tesla is looking for an experienced applied Machine Learning Engineers to help build models that deliver robust for robotics to drive the future of autonomy across all current...  ..., ML model training, and on-vehicle deployment Experiment with data generation and fleet... 
    Hourly pay
    Full time
    Temporary work
    Flexible hours

    Tesla

    Palo Alto, CA
    1 day ago
  •  ...Team's Vision: Our Engineering team is shaping the future...  ...the development of autonomous agents that don't just...  ..., but take action—driving complex automation and...  ...RAG, MCP, fine tuning models and prompt engineering...  ...complex multi-region cloud deployments. Vector DB... 
    Immediate start

    Illumio

    Sunnyvale, CA
    8 days ago
  • $213k - $263k

     ...Waymo is an autonomous driving technology company...  ...massive foundation models directly onto...  ...collaboration to engineer robust, high-reliability...  ...Vision, Machine Learning, Robotics, or a...  ...frameworks and quantization techniques for real...  ...building, training, or deploying Multimodal... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    2 days ago
  • $19 - $65 per hour

     ...for factory-built autonomous trucks. Headquartered...  ...to accelerate the deployment of next-generation...  ...a huge impact and drive the future of...  ...to fine-tune the model. Connect the prototype...  ...simulation engine. Develop metrics...  ...Familiarity with deep learning frameworks (... 
    Internship

    PlusAI, Inc.

    Santa Clara, CA
    2 days ago
  •  ...seeking a highly skilled Machine Learning Engineer with deep expertise in developing...  ...’s Eye View (BEV) fusion models using multimodal sensor...  ...radar sensors to support autonomous driving and 3D scene...  ...with research, data, and deployment engineers to refine models... 

    PlusAI, Inc.

    Santa Clara, CA
    3 days ago
  • $19 - $65 per hour

     ...for factory-built autonomous trucks. Headquartered...  ...to accelerate the deployment of next-generation...  ...a huge impact and drive the future of autonomy...  ...for planning models. Support Reinforcement Learning: Create the infrastructure...  ...vision, and machine learning. Proficiency... 
    Internship

    PlusAI, Inc.

    Santa Clara, CA
    3 days ago
  •  ...Top 3 Required skills: Machine Learning, Gen AI, Python •...  ...refine prompts (prompt engineering) including system...  ...reliability. Implement autonomous agent workflows...  ...control, and transparent model documentation....  ...and lead end-to-end deployment, optimization, and MLOps... 
    Hourly pay
    Permanent employment
    Work at office
    Remote work
    3 days per week

    eTeam

    Sunnyvale, CA
    2 days ago
  • $212.8k

     ...Convert and compile ML models for execution on edge NPUs, and apply quantization mechanisms. - Profile and...  ...Computer Science, Electrical Engineering, Computer Engineering,...  ...industry experience in machine learning software engineering, model deployment, or ML systems for... 
    Temporary work
    Local area

    ByteDance

    San Jose, CA
    3 days ago
  • $160k - $200k

     ...factory-built autonomous trucks. Headquartered...  ...accelerate the deployment of next-...  ...huge impact and drive the future of autonomy...  ...Infrastructure Engineer at Plus, you...  ...for managing model versioning...  ...of-the-art deep learning frameworks like...  ...'s possible in machine learning infrastructure... 

    PlusAI, Inc.

    Santa Clara, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Machine Learning Engineer - Autonomous Driving Model Quantization & Deployment. Be the first to apply!