Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior ML Engineer, Autonomy Evaluation

Bedrock Robotics Inc

Join the team bringing advanced autonomy to the built world At Bedrock, we’re moving AI out of the lab and into the real world. Our team is composed of industry veterans who helped launch Waymo, scaled Segment to a $3.2B acquisition, and grew Uber Freight to $5B in revenue. Today, we’re deploying autonomous systems on heavy construction machinery across the country, accelerating project schedules of billion-dollar infrastructure projects and improving safety on job sites. Backed by $350M in funding, we’re working quickly to close the gap between America’s surging demand for housing, data centers, manufacturing hubs, and the construction industry’s growing labor shortage. This is where algorithms meet steel‑toed boots. You’ll collaborate with construction veterans and world‑class engineers to solve physical‑world problems that simulations can’t touch. If you’re ready to apply cutting‑edge technology to solve meaningful problems alongside a talented team—we’d love to have you join us. Machine Learning Engineer: Evaluation Bedrock is bringing autonomy to the construction industry! We’re a group of veterans from the autonomous vehicle industry who are passionate about bringing the benefits of automation to areas in the construction industry currently underserved by the market. We’re looking for a highly motivated engineer with experience evaluating complex ML systems deployed in the real world. Your mission: translate the infinite nuance of the built world into actionable, AI‑native evaluations that accelerate Bedrock Operator adoption. The ideal candidate has hands‑on experience building evaluation systems and designing and executing statistical tests to gauge performance deltas between system iterations. More importantly, you’ve iterated on complex ML systems run in production environments, and you understand the complexities that come with it. What you’ll do: Design and maintain eval systems: Build pipelines for measuring system performance – across open loop and closed loop simulation, hardware‑in‑the‑loop systems, and field data from Bedrock Operator‑equipped machinery. Excite other teams to gain insights earlier in the development cycle through streamlined workflows. Develop metrics: Connect product goals and system behavior – by bridging real‑world specification to measurable indicators from logged data. Empower confident decision making from parameter tuning to program planning by slicing through the noise and delivering objective insights. Classify data sources for training and testing: Implement infrastructure and classifiers – to self‑annotate data and allow creation of datasets for a variety of training and evaluation use cases. Leverage models to source rich annotations for massive datasets to accelerate model iteration. Predict system performance: Model metrics and interpret results – from various sources ranging from raw sensor data to key leading indicators. Determine whether new construction sites pose hidden challenges and drive business decisions about deployment readiness. What we’re looking for: Engineers who are currently Senior or Staff level with 5+ years of professional software engineering, data science, or research experience 2+ years of professional experience analyzing modern ML or robotics system performance on real‑world problems Proficiency in Python and a data warehouse query language and comfort with development on infrastructure within parallelized cloud‑based frameworks Strong statistical analysis skills (classification, model fit bias determination, hypothesis testing, and uncertainty quantification) Experience working with large datasets Bonus points: We’re especially interested in engineers who have applied statistical backgrounds to ML research or real‑world robotics applications. Our roles are often flexible. If you don’t fit all the criteria, or are in another location (especially one where we have an office like SF or NY) please apply anyway! We’d love to consider you. #J-18808-Ljbffr

Vacancy posted 8 hours ago
Similar jobs that could be interesting for youBased on the Senior ML Engineer, Autonomy Evaluation in San Francisco, CA vacancy
  •  ...client, I am hiring for a Senior Machine Learning Engineer – End-to-End (E2E) to help...  ...efficient, and human-like autonomy solutions for real-world freight...  ...representations Train and evaluate models using large-scale...  ...Solid understanding of modern ML architectures used in end-... 
    Senior

    Hydrogen UK Ltd

    San Francisco, CA
    8 hours ago
  • $229k - $276k

     ...Zoox is seeking a Senior Machine Learning Engineer to enhance validation processes for their autonomous mobility solutions in San Francisco. This...  ...learning, programming in Python, and an understanding of autonomy challenges. A PhD or equivalent experience is preferred.... 
    Senior

    jobs.frontdoordefense.com - Jobboard

    San Francisco, CA
    8 hours ago
  •  ...Arena Intelligence, Inc. in San Francisco is seeking a Senior Machine Learning Engineer to enhance AI model evaluation systems. You will work on data pipelines, inference APIs, and new evaluation methods. The ideal candidate possesses strong programming skills, experience... 
    Senior

    Arena Intelligence, Inc.

    San Francisco, CA
    1 day ago
  • $229k - $276k

     ...Senior Machine Learning Engineer, Autonomy Validation Location: San Francisco Bay Area Compensation: $229,000 -...  ...validation problems at the intersection of ML and data science. You will serve as...  ..., including model training, evaluation, and optimization. Strong programming... 
    Senior
    Temporary work

    jobs.frontdoordefense.com - Jobboard

    San Francisco, CA
    7 hours ago
  • Airbnb, Inc. is hiring a Senior Staff Machine Learning Engineer, focusing on driving evaluation strategies and data infrastructure for CSxAI initiatives. This role requires...  ...PhD in a relevant field, extensive experience in ML/AI systems, and strong leadership in technical... 
    Senior
    Remote job

    airbnb, Inc.

    San Francisco, CA
    4 days ago
  •  ...Proficiency in Python and standard ML frameworks (e.g., JAX,...  ...designing and using metrics for evaluating complex AI systems , (...  ...technical leadership, influencing senior stakeholders, and driving innovation...  ...for researchers and software engineers who are passionate about... 
    Senior

    Waymo

    San Francisco, CA
    4 days ago
  • $204k - $259k

     ...Senior Machine Learning Engineer – VLM/LLM Evaluation Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver...  ...or equivalent practical experience Experience in ML engineering and applied Deep Learning Experience with... 
    Senior
    Full time
    Temporary work
    Remote work

    Waymo

    San Francisco, CA
    4 days ago
  • $240.45k - $300.3k

     ...Senior Machine Learning Engineer - Model Evaluations, Public Sector The Public Sector ML team at Scale deploys advanced AI systems-including LLMs, agentic models, and multimodal pipelines-into mission-critical government environments. We build evaluation frameworks... 
    Senior
    Full time

    Scale AI

    San Francisco, CA
    7 days ago
  • $146k - $280k

     ...Waabi is seeking a Senior Applied Data Scientist in San Francisco to shape evaluation methodologies for autonomous driving technology. Responsibilities include designing production frameworks, prototyping analyses, and developing analytical models to correlate simulation... 
    Senior
    Flexible hours

    Waabi

    San Francisco, CA
    8 hours ago
  •  ...Join the team bringing advanced autonomy to the built world At Bedrock...  ...veterans and world-class engineers to solve physical-world problems...  ...datasets Build metrics to evaluate model performance in open loop...  ...infrastructure teams to integrate ML models into real-world... 
    Work at office
    Flexible hours

    Bedrock Robotics Inc

    San Francisco, CA
    8 hours ago
  •  ...Plaid Inc is looking for a Senior Research Scientist to lead applied research on their foundation model. The successful candidate will design model architectures and develop robust evaluation frameworks, making impactful contributions across Plaid's diverse financial... 
    Senior

    Plaid

    San Francisco, CA
    1 day ago
  •  ...scale.  About the Role We are looking for a visionary Senior ML Engineer who will bridge the gap between high-level architecture and...  ...models, embeddings; building clustering models; utilizing evaluation frameworks to quantify performance ~ Proficiency in agent... 
    Senior
    Shift work

    Palm Venture Studios

    San Francisco, CA
    7 days ago
  • $200k - $400k

     ...an innovative strategic engineer to help us scale. Role Overview The Senior Machine Learning...  ...ll work across the full ML lifecycle, from structuring...  ...datasets to deploying, evaluating, and training models in...  ...who thrives on scale, autonomy, and cross‑functional collaboration... 
    Senior
    Work experience placement

    Troveo AI

    San Francisco, CA
    8 hours ago
  • $128.7k - $261.3k

     ...development, and performance engineering so that every cycle on our accelerators...  ...driving. The Role As a Senior Compiler Engineer on the AI...  ...reliable, and effortless for ML engineers across the AV...  ...engineering, and real‑world autonomy, this role puts your decisions... 
    Senior
    Local area
    Flexible hours

    Israelvcforum

    San Francisco, CA
    7 hours ago
  •  ...Senior ML/RL Engineer, Behavior Planning At Bot Auto, we are revolutionizing the transportation of goods with our cutting-edge autonomous...  ...cross-functional teams to design robust reward functions and evaluation metrics that balance safety, progress, and comfort.... 
    Senior
    Shift work

    Bot Auto

    San Francisco, CA
    1 day ago
  •  ...leading financial technology company in San Francisco is seeking a Senior Research Scientist to lead applied research on their...  ...background in machine learning, with experience in model performance evaluations and production system development. This role is crucial in... 
    Senior

    Plaid

    San Francisco, CA
    7 hours ago
  • $200k - $300k

     ...Zoox is looking for a Senior/Staff Machine Learning Engineer in San Francisco to enhance agent behaviors using...  ...solutions while working with safety and autonomy engineers. A BS, MS, or PhD in...  ...with experience in deep learning and ML pipelines. Salary ranges from $200,0... 
    Senior

    jobs.frontdoordefense.com - Jobboard

    San Francisco, CA
    7 hours ago
  •  ...A leading AI solutions company in San Francisco is seeking an ML Eval Engineer to design evaluation benchmarks and improve model performance. This role involves working with unstructured enterprise data and collaborating closely with the ML and engineering teams. You will... 

    Reducto

    San Francisco, CA
    8 hours ago
  • $152k - $228k

     ...Job Description Job Description Senior ML Engineer About Invoca Invoca is an AI-powered revenue execution platform that brings together...  ...maintain CI/CD pipelines for ML artifacts — including model evaluation, versioning, and automated deployment. Serve as the primary... 
    Senior
    Currently hiring
    Remote work
    Flexible hours

    Invoca

    San Francisco, CA
    18 days ago
  • Arcada Labs Incorporated is seeking an ML Research Engineer in San Francisco to lead evaluations of AI models based on human preferences. You will design experiments and analysis pipelines to enhance our understanding of AI capabilities and contribute to user-facing tools... 

    Arcada Labs Incorporated

    San Francisco, CA
    3 days ago
  •  ...MakerMaker.AI is seeking a Senior ML Engineer in San Francisco. In this role, you will build and maintain machine learning systems and pipelines...  ...researchers and owning the data pipelines for training and evaluation. If you have 6+ years of experience in this field, apply to... 
    Senior

    MakerMaker.AI

    San Francisco, CA
    7 hours ago
  •  ...with safety, privacy, and real-world responsibility in mind. Our ML team comes from a culture of academic research driven to...  ...privacy for the sake of ML advancement. Responsibilities Own LLM evaluation processes and methods with a focus on generating benchmarks representative... 
    Local area
    Shift work

    Capitolis

    San Francisco, CA
    4 days ago
  • $208k - $300k

     ...A leading AI company is seeking a Machine Learning Engineer in the Public Sector to develop automated evaluation pipelines for AI models. You will work on advanced...  ...a strong programming background and experience in ML evaluation frameworks. Competitive salary range is... 

    Scale AI

    San Francisco, CA
    7 hours ago
  •  ...information, please visit  Role Description: We are seeking a Senior ML Scientist/Engineer to design models that operate on multimodal time-series...  ...data analysis, feature engineering, model training, evaluation, and optimization.  Design and evaluate machine... 
    Senior

    Gridware

    San Francisco, CA
    a month ago
  •  ...Enterprise AI Customer Support startup with their search for senior/staff ML research engineers. The role will be onsite in their SF office. What...  ...supervised fine-tuning and RL, and working on evaluations and deployments. ~ Familiar with SFT, FL, DPO, PPO, GRPO... 
    Senior
    Work at office

    DRH Search

    San Francisco, CA
    7 hours ago
  • $141k - $249k

     ...memory to pinpoint performance bottlenecks. - Identify and evaluate emerging technologies that can be adopted into Waabi’s training...  ...compilation/deployment techniques. - Work with researchers and ML engineers on best-practices for optimal resource usage. - Create and... 
    Senior
    Work at office
    Work from home
    Flexible hours

    Waabi

    San Francisco, CA
    2 days ago
  • $190k - $205k

     ...electrical, and visual signals Production Engineering Write clean, scalable, well-tested...  ...large shared codebase. Build end-to-end ML pipelines including data processing, feature extraction, training, evaluation, and deployment. Optimize models for performance... 
    Senior
    Live in

    Gridware

    San Francisco, CA
    3 days ago
  • $162.8k - $203.5k

     ...business for decades. To strengthen our efforts, we are hiring a Senior ML Engineer who will work end-to-end on creating and improving new...  ...production quality code to launch machine learning models at scale Evaluate machine learning systems against business goal Experience: B... 
    Senior
    Hourly pay
    Work experience placement
    Work at office
    Local area
    3 days per week

    Socotra

    San Francisco, CA
    8 hours ago
  • $172k - $229k

     ...matter most. Omnitag, our ML-powered multimodal data...  ...framework, is the engine that powers this discovery. As a Senior Machine Learning Engineer...  ...systems into our broader autonomy stack as experimental or...  ...practices for model training, evaluation, and deployment.... 
    Senior
    Work at office
    Remote work

    Motional

    San Francisco, CA
    18 days ago
  •  ...A leading robotics company in San Francisco is seeking a Machine Learning Engineer to lead the transition from teleoperation to autonomy. The ideal candidate will have extensive experience in machine learning, particularly in robotics and computer vision. Responsibilities... 

    Avatar Robotics

    San Francisco, CA
    7 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior ML Engineer, Autonomy Evaluation. Be the first to apply!