Staff Machine Learning Engineer - Autonomous Driving Model Quantization & Deployment
$215.28k - $364.32kXPENG
XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric vertical take-off and landing (eVTOL) aircraft, and robotics. With a strong focus on intelligent mobility, XPENG is dedicated to reshaping the future of transportation through cutting-edge R&D in AI, machine learning, and smart connectivity. The Mission: The challenge of Vision-Language-Action (VLA) models and Foundation Models isn't just their intelligence-it's their real-time execution at the edge . We are seeking a high-caliber Staff Machine Learning Engineer to bridge the gap between massive research models and production-ready L4 autonomous driving systems. You will lead the effort to optimize and deploy our VLA models onto vehicle-grade compute platforms for our global fleet. Key Responsibilities:
- Lead Optimization Strategy: Own the end-to-end quantization and optimization roadmap for large-scale multimodal models (Transformers, VLMs).
- Model Compression: Apply and innovate in PTQ (Post-Training Quantization), QAT (Quantization-Aware Training), and pruning techniques to fit VLA models into strict memory and power envelopes.
- Hardware-Software Co-design: Collaborate directly with model researchers to ensure architectures are "deployment-friendly" and with platform teams to influence future hardware requirements.
- Production Excellence: Develop and maintain robust, safety-critical deployment stacks in Modern C++ , ensuring 24/7 stability and deterministic performance on the road.
- Proven Track Record: 5-8 years of experience in model deployment, quantization, or high-performance computing (HPC).
- Core Technical Skills: Mastery of Modern C++ and deep experience with CUDA or other hardware acceleration libraries.
- Deep Learning Expertise: Strong familiarity with PyTorch and deep knowledge of inference engines like TensorRT , ONNX Runtime, or TVM.
- Quantization Depth: Hands-on experience with INT8/FP8/INT4 quantization and knowledge of the unique challenges in quantizing Large Language Models (LLMs) or Transformers.
- Platform Knowledge: Solid understanding of computer architecture (Cache, Memory Bandwidth, SIMD) and experience with embedded/edge compute constraints.
- Systems Thinking: Ability to debug complex performance bottlenecks across the entire software stack.
- Experience with VLA /VLM or other Foundation Model deployment.
- Background in autonomous driving, robotics, or real-time safety-critical systems.
- Contributions to open-source inference or compiler projects.
- A fun, supportive and engaging environment
- Infrastructures and computational resources to support your ML model development/research.
- Opportunity to work on cutting edge technologies with the top talent in the field.
- Opportunity to make significant impact on transportation revolution by the means of advancing autonomous driving
- Competitive compensation package
- Snacks, lunches, dinners, and fun activities
Vacancy posted 19 hours ago
Similar jobs that could be interesting for youBased on the Staff Machine Learning Engineer - Autonomous Driving Model Quantization & Deployment in Santa Clara, CA vacancy
$124k
...just training models, we're... ...agents that autonomously operate computers... ..., we deploy these models... ...post-training quantization and quantization... ...massive deep learning models run... ...of miles of driving + robot interactions... ..., inference engine, and silicon... ...Science, Machine Learning,...SuggestedHourly payFull timeTemporary workImmediate startFlexible hours$244.14k - $413.16k
...Senior Staff Machine Learning Engineer - Foundation Model Santa Clara, CA XPENG is a leading... ...integrating advanced AI and autonomous driving technologies into its... ...to design, train, and deploy large-scale multi-... ..., including quantization, export, and latency–accuracy...SuggestedFull time$129.19k - $247.04k
...Company DiDi's autonomous driving unit was established... ...The Foundation Model Team focuses on building... ...and generalizable deep learning systems that enable safe... ...intersection of large-scale machine learning, autonomous... ..., and on-vehicle deployment Collaborate...Suggested$148.91k - $252k
...Machine Learning Engineer - LLM, AI & Robotics Santa Clara, CA XPENG is a leading... ...integrating advanced AI and autonomous driving technologies into its... ...small LLMs, weight sharing, model quantization, etc.) that can be deployed locally in Robots and Cars....SuggestedFull time$204k - $259k
...Waymo is an autonomous driving technology company... ...Driver. The DUE Machine Learning team will build... ...machine learning models to deliver... ...researchers and software engineers who are... ...development and deployment of cutting-edge... ...distillation and quantization. Build and scale...SuggestedFull time- ...and foundation models enable... ...of automated driving systems. Our... ...excellence, constantly learning and evolving... ...As an ML Engineer within the Application... ...model-based autonomous driving—both... ...and real-world deployment. You’ll... ...success as a Machine Learning Engineer...Full timeWork at officeWork from home
$170k - $216k
...Waymo is an autonomous driving technology company with the mission... ...the system which learns the spatial-temporal... ...of sensors, enabling engineers like you to (1) develop... ...data, to (2) develop models and model training at... ...years experience in Machine Learning and/or...Full timeRemote work$204k - $259k
...Waymo is an autonomous driving technology company with the mission... ...an advanced ML and engineering team that leverages... ...vision, deep learning, and generative AI to... ...vision / multimodal models (e.g., Gemini) to extract... ...prototyping to production deployment and scaling for...Full timeRemote work$128.7k - $261.3k
...everything we do in autonomous and assisted driving. The AV... ...new approaches to model export, kernel development... ...development, and performance engineering so that every cycle... ...integration, and deployment tooling, with a... ...developing and deploying machine learning models?...Local areaWork from homeRelocation packageFlexible hours$204k - $259k
...Waymo is an autonomous driving technology company with... ...the system which learns the spatial-... ...sensors, enabling engineers like you to (1) develop... ..., to (2) develop models and model... ...of experience in Machine Learning, with a... ...evaluation, and deployment in a production environment...Full timeRemote work$154.9k - $222.37k
...applications from automated driving to industrial robotics,... ...to 3D position, allowing autonomous devices like vehicles and... ...Overview: As an ML Engineer on our perception team, you... ...own the development and deployment of 3D perception models across object detection,...Flexible hours$153.2k - $234.1k
...people as we aim to make driving safer, smarter, and... ...the future of autonomous driving? Join the Embodied... ...that powers every machine learning engineer working on our... ...Autonomous Driving models. From foundational models... ...implementation, and deployment of scalable platforms...Work at officeLocal areaRemote workWork from homeRelocationRelocation packageFlexible hours$215.28k - $364.32k
...Staff Machine Learning Engineer - Ai Foundation Santa Clara, CA... ...integrating advanced AI and autonomous driving technologies into... ...large foundation model and accelerating... ...Implement and benchmark (Quantization, Knowledge... ...optimization, etc.). Deploy optimized models across...Full time$213k - $263k
...Waymo is an autonomous driving technology company... ...collaborative group of machine learning (ML) engineers, software... ...algorithms, to model the real world,... ...to a Senior Staff Engineering Manager... ...and deployment of cutting-edge... ...distillation and quantization. Build and scale...Full timeRemote work$238k - $302k
...Waymo is an autonomous driving technology company with the... ...Driver. The DUE Machine Learning team will build and... ...advanced machine learning models to deliver training... ...and software engineers who are passionate about... ...learning models and ML deployment at scale We...Full time$203.45k - $344.3k
...Senior Staff Physical AI Data Algorithm Engineer Santa Clara, CA XPENG... ...advanced AI and autonomous driving technologies into... ...-edge R&D in AI, machine learning, and smart connectivity... ...supports weekly model iteration, cross-... ...to on-board deployment and continuous optimization...Full timeTemporary workWork experience placement$140k - $230k
...-first automated driving technology and Toyota... ..., and business models that transform... ...state-of-the-art of machine learning (ML) for... ...software and hardware engineers and researchers to... ...mapping system for autonomous driving. Unlike... ...train, validate, and deploy new models,...Full timeTemporary workFlexible hours- ...hiring our Founding Machine Learning Engineer (MLE) with expertise... ...Development and Time-Series Modeling. You'll play a... ...you'll design, train, deploy, and monitor ML systems... ..., tool integration, autonomous workflows, memory/context... ...solutions and drive rapid iteration What...Visa sponsorship
$203.45k - $344.3k
...Senior Staff AI Data Infrastructure/Pipeline Engineer Santa Clara, CA XPENG... ...advanced AI and autonomous driving technologies into... ...-edge R&D in AI, machine learning, and smart connectivity... ...production → model training /... ...performance tuning and deployment experience. ~ Experience...Full timeOverseas- ...software for factory-built autonomous trucks.... ...Plus to accelerate the deployment of next-generation autonomous... ...a huge impact and drive the future of... ...are seeking a Senior Machine Learning Engineer with expertise in deep... ...vehicle simulation models tailored to various...
$156k - $387.6k
...recommendation, weakly-supervised learning, few-shot classification, video... ...more. We aim to succeed both in driving measurable business impact (e.g.... ...breakthroughs. 3. Drive engineering deployment and implementation, ensuring model stability, scalability, and efficiency...Temporary workLocal area$176k - $420k
...Tesla is looking for an experienced applied Machine Learning Engineers to help build models that deliver robust for robotics to drive the future of autonomy across all current... ..., ML model training, and on-vehicle deployment Experiment with data generation and fleet...Hourly payFull timeTemporary workFlexible hours- ...Team's Vision: Our Engineering team is shaping the future... ...the development of autonomous agents that don't just... ..., but take action—driving complex automation and... ...RAG, MCP, fine tuning models and prompt engineering... ...complex multi-region cloud deployments. Vector DB...Immediate start
$213k - $263k
...Waymo is an autonomous driving technology company... ...massive foundation models directly onto... ...collaboration to engineer robust, high-reliability... ...Vision, Machine Learning, Robotics, or a... ...frameworks and quantization techniques for real... ...building, training, or deploying Multimodal...Full timeRemote work$19 - $65 per hour
...for factory-built autonomous trucks. Headquartered... ...to accelerate the deployment of next-generation... ...a huge impact and drive the future of... ...to fine-tune the model. Connect the prototype... ...simulation engine. Develop metrics... ...Familiarity with deep learning frameworks (...Internship- ...seeking a highly skilled Machine Learning Engineer with deep expertise in developing... ...’s Eye View (BEV) fusion models using multimodal sensor... ...radar sensors to support autonomous driving and 3D scene... ...with research, data, and deployment engineers to refine models...
$19 - $65 per hour
...for factory-built autonomous trucks. Headquartered... ...to accelerate the deployment of next-generation... ...a huge impact and drive the future of autonomy... ...for planning models. Support Reinforcement Learning: Create the infrastructure... ...vision, and machine learning. Proficiency...Internship- ...Top 3 Required skills: Machine Learning, Gen AI, Python •... ...refine prompts (prompt engineering) including system... ...reliability. Implement autonomous agent workflows... ...control, and transparent model documentation.... ...and lead end-to-end deployment, optimization, and MLOps...Hourly payPermanent employmentWork at officeRemote work3 days per week
$212.8k
...Convert and compile ML models for execution on edge NPUs, and apply quantization mechanisms. - Profile and... ...Computer Science, Electrical Engineering, Computer Engineering,... ...industry experience in machine learning software engineering, model deployment, or ML systems for...Temporary workLocal area$160k - $200k
...factory-built autonomous trucks. Headquartered... ...accelerate the deployment of next-... ...huge impact and drive the future of autonomy... ...Infrastructure Engineer at Plus, you... ...for managing model versioning... ...of-the-art deep learning frameworks like... ...'s possible in machine learning infrastructure...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff Machine Learning Engineer - Autonomous Driving Model Quantization & Deployment. Be the first to apply!
Related searches
- engineering aide Santa Clara, CA
- software engineer staff Santa Clara, CA
- technology administrator Santa Clara, CA
- staff engineer Santa Clara, CA
- senior staff engineer Santa Clara, CA
- assistant engineer Santa Clara, CA
- senior staff systems engineer Santa Clara, CA
- senior ml engineer Santa Clara, CA
- computer vision machine learning engineer Santa Clara, CA
- machine learning software engineer Santa Clara, CA



