Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Machine Learning Engineer, ML Runtime & Optimization

$140k - $250k

pony.ai

Founded in 2016 in Silicon Valley, Pony.ai has quickly become a global leader in autonomous mobility and is a pioneer in extending autonomous mobility technologies and services at a rapidly expanding footprint of sites around the world. Operating Robotaxi, Robotruck and Personally Owned Vehicles (POV) business units, Pony.ai is an industry leader in the commercialization of autonomous driving and is committed to developing the safest autonomous driving capabilities on a global scale. Pony.ai’s leading position has been recognized, with CNBC ranking Pony.ai #10 on its CNBC Disruptor list of the 50 most innovative and disruptive tech companies of 2022. In June 2023, Pony.ai was recognized on the XPRIZE and Bessemer Venture Partners inaugural “XB100” 2023 list of the world’s top 100 private deep tech companies, ranking #12 globally. As of August 2023, Pony.ai has accumulated nearly 21 million miles of autonomous driving globally. Pony.ai went public at NASDAQ in Nov. 2024.

Responsibility

The ML Infrastructure team at Pony.ai provides a set of tools to support and automate the lifecycle of the AI workflow, including model development, evaluation, optimization, deployment, and monitoring.

As a Machine Learning Engineer in ML Runtime & Optimization, you will be developing technologies to accelerate the training and inferences of the AI models in autonomous driving systems.

This includes:

  • Identifying key applications for current and future autonomous driving problems and performing in-depth analysis and optimization to ensure the best possible performance on current and next-generation compute architectures.

  • Collaborating closely with diverse groups in Pony.ai including both hardware and software to optimize and craft core parallel algorithms as well as to influence the next-generation compute platform architecture design and software infrastructure.

  • Apply model optimization and efficient deep learning techniques to models and optimized ML operator libraries.

  • Work across the entire ML framework/compiler stack (e.g.Torch, CUDA and TensorRT), and system-efficient deep learning models.

Requirements

  • BS/MS or Ph.D in computer science, electrical engineering or a related discipline.
  • Strong programming skills in C/C++ or Python.
  • Experience on model optimization, quantization or other efficient deep learning techniques
  • Good understanding of hardware performance, regarding CPU or GPU execution model, threads, registers, cache, cost/performance trade-off, etc.
  • Experience with profiling, benchmarking and validating performance for complex computing architectures.
  • Experience in optimizing the utilization of compute resources, identifying and resolving compute and data flow bottlenecks.
  • Strong communication skills and ability to work cross-functionally between software and hardware teams

Preferred Qualifications:

One or more of the following fields are preferred

  • Experience with parallel programming, ideally CUDA, OpenCL or OpenACC.
  • Experience in computer vision, machine learning and deep learning.
  • Strong knowledge of software design, programming techniques and algorithms.
  • Good knowledge of common deep learning frameworks and libraries.
  • Deep knowledge on system performance, GPU optimization or ML compiler.

Compensation and Benefits

Base Salary Range: $140,000 - $250,000 Annually

Compensation may vary outside of this range depending on many factors, including the candidate’s qualifications, skills, competencies, experience, and location. Base pay is one part of the Total Compensation and this role may be eligible for bonuses/incentives and restricted stock units.

Also, we provide the following benefits to the eligible employees:

  • Health Care Plan (Medical, Dental & Vision)
  • Retirement Plan (Traditional and Roth 401k)
  • Life Insurance (Basic, Voluntary & AD&D)
  • Paid Time Off (Vacation & Public Holidays)
  • Family Leave (Maternity, Paternity)
  • Short Term & Long Term Disability
  • Free Food & Snacks
Vacancy posted more than 2 months ago
Similar jobs that could be interesting for youBased on the Machine Learning Engineer, ML Runtime & Optimization in Fremont, CA vacancy
  • $180.9k - $265.32k

     ...Staff Machine Learning Engineer – (ADAS/Autonomous Driving) Newark, CA Leading...  ...role focuses on productizing ML models, ensuring robust performance...  ...and diagnostics. Optimize inference pipelines using CUDA...  ...deliverables. Implement runtime performance metrics, logging... 
    Suggested
    Immediate start
    Night shift

    Lucid Motors

    Newark, CA
    18 hours ago
  • $150k - $250k

     ...mindset). Ship deep learning solutions (including LLM...  ...production-oriented ML for fleet-scale assessment: training, optimization, monitoring, and iteration...  ..., and platform/engineering leads to land cross-cutting...  .... in Computer Science, Machine Learning, AI, or a related... 
    Suggested
    Temporary work

    pony.ai

    Fremont, CA
    24 days ago
  • $80k - $90k

     ...Machine Learning Engineer Fremont, California Gotion Inc. is based in Silicon Valley, CA, currently...  ..., IL focused on building innovative ML models from the ground up. You will...  ...expertise in model development, optimization, and algorithmic innovation Proficiency... 
    Suggested
    Full time

    Gotion

    Fremont, CA
    18 hours ago
  • $150k - $220k

     ...Description Job Description Position Overview:  We are seeking a Machine Learning Engineer to join our team developing machine learning solutions for...  ...with process engineers, software engineers, and fellow ML engineers, you will develop and deploy models using image,... 
    Suggested
    Full time
    Work experience placement
    Local area

    Velo3D

    Fremont, CA
    18 days ago
  • $160k - $240k

     ...collaborators who bring sun-drenched optimism and drive. Whether you're building smarter...  ...Document Intelligence team builds AI/ML-powered solutions to extract...  ..., results-focused, and deeply skilled Machine Learning Engineers/scientists to work with us on a range... 
    Suggested
    Work at office
    Remote work
    Home office
    Flexible hours

    Workday

    Pleasanton, CA
    2 days ago
  •  ...real-time, multi-modal machine-generated data —...  ...Splunk and Cisco's global engineering capabilities. Our work...  ...unstructured data, deep learning-based time series...  ...Large-Scale Training & Optimization – Experience optimizing...  ...production monitoring of ML models. Strong... 
    Flexible hours

    Webex Events (formerly Socio)

    Pleasanton, CA
    8 days ago
  •  ...HPE Labs - Principal AI and Machine Learning Research Engineer This role has been designed as ‘’Onsite’ with...  ...immediate opening for a Principal AI/ML Engineer. HPE Labs is an...  ...compiler and mapping operations for optimal assignment of computational workloads... 
    Work experience placement
    Work at office
    Immediate start

    HPE

    Milpitas, CA
    4 days ago
  •  ...model new architectures for AI/ML accelerator integrated...  ...and mapping operations for optimal assignment of computational...  ...*PhD degree** in Electrical Engineering, Computer Science, Data Science...  ...years of experience in AI & Machine learning ( academic or industrial).**... 
    Work experience placement
    Local area

    Hewlett Packard Enterprise Development LP

    Milpitas, CA
    3 days ago
  •  ...AI/ML Engineer - Agentic This role has been designed as 'Hybrid' with an expectation that...  ...stores, hybrid search, and relevance optimization. Develop and operate high-performance...  ..., tool execution, and agent runtime behavior. Own observability and reliability... 
    Work at office
    2 days per week

    Hewlett Packard Enterprise

    Alviso, CA
    18 hours ago
  •  ...and services using Python and modern machine learning frameworks Write clean, efficient...  ..., data structures, and performance optimization Build and optimize data pipelines...  ...Collaborate with data scientists, ML engineers, and product teams to translate business... 

    Perfict Global, Inc.

    Fremont, CA
    18 hours ago
  •  ...Strong background in AI/ML with experience using frameworks...  ...TensorFlow, PyTorch, or Scikit-learn Proficiency in data handling...  ...) Strong debugging, optimization, and performance-tuning skills...  ..., preprocessing, and feature engineering for large datasets SQL... 
    Long term contract

    Perfict Global, Inc.

    Fremont, CA
    18 hours ago
  • pony.ai is seeking a Machine Learning Engineer to innovate in training large generative models for self-driving vehicles. This role involves designing scalable systems, implementing reinforcement learning methods, and collaborating with interdisciplinary teams to enhance... 

    ???? pony.ai

    Fremont, CA
    3 days ago
  • $140k - $165k

     ...Sr. Data Engineer (Ontology & Semantic Modeling) We're building a modern data + AI platform...  ..., deduplication, and monitoring Optimize Spark and SQL workloads for performance and...  ...Neo4j, etc.) Experience supporting AI/ML or LLM-based systems (RAG/hybrid retrieval... 
    Work at office
    Local area
    3 days per week

    Sound Thinking LLC

    Fremont, CA
    18 hours ago
  • $150k - $220k

    Velo3D in Fremont, California, is seeking a Machine Learning Engineer to develop machine learning solutions for quality assurance in additive manufacturing. The role involves working closely with various teams to design and implement models using sensor data. Candidates... 

    Velo3d-

    Fremont, CA
    1 day ago
  • $141k - $307k

     ...foundation that powers generative AI, machine learning, and advanced analytics across the company...  ...analytics workloads. Develop and optimize large-scale data processing solutions...  ...models. Partner closely with AI/ML engineers, data scientists, business teams, and... 
    Local area
    Remote work
    Flexible hours
    2 days per week
    3 days per week
    1 day per week

    Lam Research

    Fremont, CA
    18 hours ago
  •  ...candidate will have 5-7 years of Power BI development experience and expertise in AI/ML integration. Responsibilities include enhancing Salesforce applications with advanced AI features and optimizing workflows using RAG pipelines. This role requires collaboration with various... 

    TechDigital Group

    Pleasanton, CA
    18 hours ago
  •  ...business needs. 2. Develop and deploy AI/ML models for real-time decision-making and...  ...or Azure/AWS ML services). 5. Build and optimize Retrieval-Augmented Generation (RAG) pipelines...  ...: Ability to work with product managers, engineers, and data teams for AI‑driven... 

    TechDigital Group

    Pleasanton, CA
    18 hours ago
  •  ...today—think REST APIs, scalable backends, ML pipelines, data dashboards, and CI/CD...  ...and competitive. Resume, GitHub & LinkedIn Optimization: We polish your portfolio, code repos, and...  ...placement-driven approach, you don\'t just learn—you get placed. Let\'s connect your... 
    Full time
    Immediate start

    SynergisticIT

    Fremont, CA
    2 days ago
  • $120k - $150k

    Velo3D in Fremont, California is seeking a Software Engineer to bridge machine learning research with production deployment. This role focuses on transforming Python prototypes into high-performance C++ software for laser powder bed fusion additive manufacturing. Ideal... 

    Velo3d-

    Fremont, CA
    4 days ago
  • $50 - $55 per hour

     ...Machine Learning Engineer Primary Skills: Python-Advanced, REST APIs-Intermediate, AI/ML- Expert, Pytorch/Jax-Intermediate Contract Type: W2 and C2C Location: Fremont, CA (5 Days onsite) Duration:6+ months (Possible Extension) Pay Range: $50 - $55 per hour on W2 or... 
    Hourly pay
    Contract work

    Akraya

    Fremont, CA
    4 days ago
  •  ...development, cloud solutions, Data, AI/ML Engineering and digital transformation services....  ...tailored solutions to help businesses optimize their operations and stay competitive...  ...Responsibilities: Develop and deploy machine learning (ML) models to solve complex business... 
    Remote work
    Flexible hours

    Venturesoft

    Pleasanton, CA
    1 day ago
  •  ...requirements into technical solutions using data science and machine learning. Minimum Qualifications Bachelor's degree in Computer...  ...monitoring, anomaly detection, and automated alerting for data and ML solutions. Strong understanding of predictive analytics,... 
    Full time

    Prophecy Technologies

    Fremont, CA
    18 hours ago
  • $125k - $145k

     ...for data prep, modeling, and building ML components. • SQL - Skills: joins, window functions, CTEs, query optimization 2. Machine Learning • Linear/Logistic Regression •...  ...• Missing data handling • Feature engineering • Data cleaning • Outlier detection... 

    Tata Consultancy Services

    Pleasanton, CA
    4 days ago
  •  ...We have job opening for Data Science Engineer and the detailed Job description is given...  ...Data Lake, S3, SageMaker, and machine learning (M/L). As a Data Science Engineer,...  ...crucial role in designing, building, and optimizing data pipelines, machine learning models... 
    Work at office
    Local area

    Info Way Solutions

    Fremont, CA
    18 hours ago
  • $50 - $55 per hour

     ...Fremont, CA, for high-performance, data-intensive applications. The ideal candidate will have expertise in Python, experience with machine learning models, and proficiency in frameworks like PyTorch or JAX. Responsibilities include designing machine learning solutions and... 
    Hourly pay
    Contract work

    Akraya, Inc.

    Fremont, CA
    1 day ago
  • Skills AI / ML Microservices & API Development Python SQL Knowledge of popular LLMs like ChatGPT, Claude, etc Prompt engineering Job Responsibilities Maintain and monitor existing AI/ML based solutions/services Develop automated alerting system to ensure quality... 
    Temporary work
    Shift work

    TechDigital Group

    Fremont, CA
    18 hours ago
  •  ...opportunities for growth, meaningful work, and continuous learning! Job Title: AI/ML Engineer Location- Milpitas,4-5 days onsite...  ...Payrate: $80/hr on W2 • 5+ years of hands-on experience in machine learning and AI engineering, with a focus on NLP/LLMs.... 

    flextoninc

    Milpitas, CA
    1 day ago
  •  ...We are seeking a Senior Staff AI/ML Engineer to lead the design, development, and...  ...This role requires deep expertise in machine learning engineering, data science methodologies...  ...advanced machine learning, statistical, and optimization models to solve complex business and... 
    Temporary work
    Remote work
    Flexible hours
    Shift work

    SanDisk

    Milpitas, CA
    3 days ago
  •  ...Data Analytics Engineer VentureSoft Global, Inc. is a technology...  ...development, cloud solutions, Data, AI/ML Engineering and digital...  ...solutions to help businesses optimize their operations and stay...  ..., or SQL ~ Experience with machine learning concepts and experience working... 
    Remote work
    Flexible hours

    Venturesoft

    Pleasanton, CA
    1 day ago
  • $80 - $85 per hour

     ...business transformation. As a leading product engineering firm based in Silicon Valley, we provide...  ...cutting-edge technologies in AI, ML, and data analytics. Our collaborative,...  ...architecture design, scalability, and cost optimization Security & governance: RBAC, Managed... 
    Hourly pay

    BayRock Labs

    Milpitas, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning Engineer, ML Runtime & Optimization. Be the first to apply!