Senior ML Engineer, Autonomy Evaluation

Bedrock Robotics Inc

Join the team bringing advanced autonomy to the built world At Bedrock, we’re moving AI out of the lab and into the real world. Our team is composed of industry veterans who helped launch Waymo, scaled Segment to a $3.2B acquisition, and grew Uber Freight to $5B in revenue. Today, we’re deploying autonomous systems on heavy construction machinery across the country, accelerating project schedules of billion-dollar infrastructure projects and improving safety on job sites. Backed by $350M in funding, we’re working quickly to close the gap between America’s surging demand for housing, data centers, manufacturing hubs, and the construction industry’s growing labor shortage. This is where algorithms meet steel‑toed boots. You’ll collaborate with construction veterans and world‑class engineers to solve physical‑world problems that simulations can’t touch. If you’re ready to apply cutting‑edge technology to solve meaningful problems alongside a talented team—we’d love to have you join us. Machine Learning Engineer: Evaluation Bedrock is bringing autonomy to the construction industry! We’re a group of veterans from the autonomous vehicle industry who are passionate about bringing the benefits of automation to areas in the construction industry currently underserved by the market. We’re looking for a highly motivated engineer with experience evaluating complex ML systems deployed in the real world. Your mission: translate the infinite nuance of the built world into actionable, AI‑native evaluations that accelerate Bedrock Operator adoption. The ideal candidate has hands‑on experience building evaluation systems and designing and executing statistical tests to gauge performance deltas between system iterations. More importantly, you’ve iterated on complex ML systems run in production environments, and you understand the complexities that come with it. What you’ll do: Design and maintain eval systems: Build pipelines for measuring system performance – across open loop and closed loop simulation, hardware‑in‑the‑loop systems, and field data from Bedrock Operator‑equipped machinery. Excite other teams to gain insights earlier in the development cycle through streamlined workflows. Develop metrics: Connect product goals and system behavior – by bridging real‑world specification to measurable indicators from logged data. Empower confident decision making from parameter tuning to program planning by slicing through the noise and delivering objective insights. Classify data sources for training and testing: Implement infrastructure and classifiers – to self‑annotate data and allow creation of datasets for a variety of training and evaluation use cases. Leverage models to source rich annotations for massive datasets to accelerate model iteration. Predict system performance: Model metrics and interpret results – from various sources ranging from raw sensor data to key leading indicators. Determine whether new construction sites pose hidden challenges and drive business decisions about deployment readiness. What we’re looking for: Engineers who are currently Senior or Staff level with 5+ years of professional software engineering, data science, or research experience 2+ years of professional experience analyzing modern ML or robotics system performance on real‑world problems Proficiency in Python and a data warehouse query language and comfort with development on infrastructure within parallelized cloud‑based frameworks Strong statistical analysis skills (classification, model fit bias determination, hypothesis testing, and uncertainty quantification) Experience working with large datasets Bonus points: We’re especially interested in engineers who have applied statistical backgrounds to ML research or real‑world robotics applications. Our roles are often flexible. If you don’t fit all the criteria, or are in another location (especially one where we have an office like SF or NY) please apply anyway! We’d love to consider you. #J-18808-Ljbffr

Apply

Vacancy posted 8 hours ago

Similar jobs that could be interesting for youBased on the Senior ML Engineer, Autonomy Evaluation in San Francisco, CA vacancy

Senior ML Engineer - End-to-End Autonomy for Driving
...client, I am hiring for a Senior Machine Learning Engineer – End-to-End (E2E) to help... ...efficient, and human-like autonomy solutions for real-world freight... ...representations Train and evaluate models using large-scale... ...Solid understanding of modern ML architectures used in end-...
Senior
Hydrogen UK Ltd
San Francisco, CA
8 hours ago
Senior ML Engineer Autonomy Validation & Testing
$229k - $276k
...Zoox is seeking a Senior Machine Learning Engineer to enhance validation processes for their autonomous mobility solutions in San Francisco. This... ...learning, programming in Python, and an understanding of autonomy challenges. A PhD or equivalent experience is preferred....
Senior
jobs.frontdoordefense.com - Jobboard
San Francisco, CA
8 hours ago
Senior ML Engineer - Real-World AI Evaluations
...Arena Intelligence, Inc. in San Francisco is seeking a Senior Machine Learning Engineer to enhance AI model evaluation systems. You will work on data pipelines, inference APIs, and new evaluation methods. The ideal candidate possesses strong programming skills, experience...
Senior
Arena Intelligence, Inc.
San Francisco, CA
1 day ago
Senior Machine Learning Engineer, Autonomy Validation
$229k - $276k
...Senior Machine Learning Engineer, Autonomy Validation Location: San Francisco Bay Area Compensation: $229,000 -... ...validation problems at the intersection of ML and data science. You will serve as... ..., including model training, evaluation, and optimization. Strong programming...
Senior
Temporary work
jobs.frontdoordefense.com - Jobboard
San Francisco, CA
7 hours ago
Senior Staff ML Engineer, Data & Evaluation (Remote)
Airbnb, Inc. is hiring a Senior Staff Machine Learning Engineer, focusing on driving evaluation strategies and data infrastructure for CSxAI initiatives. This role requires... ...PhD in a relevant field, extensive experience in ML/AI systems, and strong leadership in technical...
Senior
Remote job
airbnb, Inc.
San Francisco, CA
4 days ago
Senior Staff ML Engineer (Driver Understanding and Evaluation)
...Proficiency in Python and standard ML frameworks (e.g., JAX,... ...designing and using metrics for evaluating complex AI systems , (... ...technical leadership, influencing senior stakeholders, and driving innovation... ...for researchers and software engineers who are passionate about...
Senior
Waymo
San Francisco, CA
4 days ago
Senior Machine Learning Engineer - VLM/LLM Evaluation
$204k - $259k
...Senior Machine Learning Engineer – VLM/LLM Evaluation Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver... ...or equivalent practical experience Experience in ML engineering and applied Deep Learning Experience with...
Senior
Full time
Temporary work
Remote work
Waymo
San Francisco, CA
4 days ago
Senior Machine Learning Engineer - Model Evaluations, Public Sector
$240.45k - $300.3k
...Senior Machine Learning Engineer - Model Evaluations, Public Sector The Public Sector ML team at Scale deploys advanced AI systems-including LLMs, agentic models, and multimodal pipelines-into mission-critical government environments. We build evaluation frameworks...
Senior
Full time
Scale AI
San Francisco, CA
7 days ago
Senior/Staff Applied Scientist - Autonomy Evaluation
$146k - $280k
...Waabi is seeking a Senior Applied Data Scientist in San Francisco to shape evaluation methodologies for autonomous driving technology. Responsibilities include designing production frameworks, prototyping analyses, and developing analytical models to correlate simulation...
Senior
Flexible hours
Waabi
San Francisco, CA
8 hours ago
Autonomy ML Engineer - Imitation & RL for Robotics
...Join the team bringing advanced autonomy to the built world At Bedrock... ...veterans and world-class engineers to solve physical-world problems... ...datasets Build metrics to evaluate model performance in open loop... ...infrastructure teams to integrate ML models into real-world...
Work at office
Flexible hours
Bedrock Robotics Inc
San Francisco, CA
8 hours ago
Senior Foundation ML Engineer - Production & Research
...Plaid Inc is looking for a Senior Research Scientist to lead applied research on their foundation model. The successful candidate will design model architectures and develop robust evaluation frameworks, making impactful contributions across Plaid's diverse financial...
Senior
Plaid
San Francisco, CA
1 day ago
Gentoro | Senior ML Engineer
...scale. About the Role We are looking for a visionary Senior ML Engineer who will bridge the gap between high-level architecture and... ...models, embeddings; building clustering models; utilizing evaluation frameworks to quantify performance ~ Proficiency in agent...
Senior
Shift work
Palm Venture Studios
San Francisco, CA
7 days ago
Senior Machine Learning Engineer
$200k - $400k
...an innovative strategic engineer to help us scale. Role Overview The Senior Machine Learning... ...ll work across the full ML lifecycle, from structuring... ...datasets to deploying, evaluating, and training models in... ...who thrives on scale, autonomy, and cross‑functional collaboration...
Senior
Work experience placement
Troveo AI
San Francisco, CA
8 hours ago
Senior ML Compiler Engineer
$128.7k - $261.3k
...development, and performance engineering so that every cycle on our accelerators... ...driving. The Role As a Senior Compiler Engineer on the AI... ...reliable, and effortless for ML engineers across the AV... ...engineering, and real‑world autonomy, this role puts your decisions...
Senior
Local area
Flexible hours
Israelvcforum
San Francisco, CA
7 hours ago
Senior ML/RL Engineer, Behavior Planning
...Senior ML/RL Engineer, Behavior Planning At Bot Auto, we are revolutionizing the transportation of goods with our cutting-edge autonomous... ...cross-functional teams to design robust reward functions and evaluation metrics that balance safety, progress, and comfort....
Senior
Shift work
Bot Auto
San Francisco, CA
1 day ago
Senior ML Engineer, Data Foundation & AI: Build & Deploy
...leading financial technology company in San Francisco is seeking a Senior Research Scientist to lead applied research on their... ...background in machine learning, with experience in model performance evaluations and production system development. This role is crucial in...
Senior
Plaid
San Francisco, CA
7 hours ago
Senior ML Engineer: Agent Simulation & Scalable Pipelines
$200k - $300k
...Zoox is looking for a Senior/Staff Machine Learning Engineer in San Francisco to enhance agent behaviors using... ...solutions while working with safety and autonomy engineers. A BS, MS, or PhD in... ...with experience in deep learning and ML pipelines. Salary ranges from $200,0...
Senior
jobs.frontdoordefense.com - Jobboard
San Francisco, CA
7 hours ago
ML Evaluation Engineer: Benchmark & Model Quality
...A leading AI solutions company in San Francisco is seeking an ML Eval Engineer to design evaluation benchmarks and improve model performance. This role involves working with unstructured enterprise data and collaborating closely with the ML and engineering teams. You will...
Reducto
San Francisco, CA
8 hours ago
Senior ML Engineer
$152k - $228k
...Job Description Job Description Senior ML Engineer About Invoca Invoca is an AI-powered revenue execution platform that brings together... ...maintain CI/CD pipelines for ML artifacts — including model evaluation, versioning, and automated deployment. Serve as the primary...
Senior
Currently hiring
Remote work
Flexible hours
Invoca
San Francisco, CA
18 days ago
ML Evaluation Engineer — Real‑World AI Metrics
Arcada Labs Incorporated is seeking an ML Research Engineer in San Francisco to lead evaluations of AI models based on human preferences. You will design experiments and analysis pipelines to enhance our understanding of AI capabilities and contribute to user-facing tools...
Arcada Labs Incorporated
San Francisco, CA
3 days ago
Senior ML Systems Engineer: Production-Grade Pipelines
...MakerMaker.AI is seeking a Senior ML Engineer in San Francisco. In this role, you will build and maintain machine learning systems and pipelines... ...researchers and owning the data pipelines for training and evaluation. If you have 6+ years of experience in this field, apply to...
Senior
MakerMaker.AI
San Francisco, CA
7 hours ago
ML Engineer — LLM Evaluation
...with safety, privacy, and real-world responsibility in mind. Our ML team comes from a culture of academic research driven to... ...privacy for the sake of ML advancement. Responsibilities Own LLM evaluation processes and methods with a focus on generating benchmarks representative...
Local area
Shift work
Capitolis
San Francisco, CA
4 days ago
ML Engineer, Public Sector: Model Evaluations & Safety
$208k - $300k
...A leading AI company is seeking a Machine Learning Engineer in the Public Sector to develop automated evaluation pipelines for AI models. You will work on advanced... ...a strong programming background and experience in ML evaluation frameworks. Competitive salary range is...
Scale AI
San Francisco, CA
7 hours ago
Senior Applied ML Engineer, On-Device
...information, please visit Role Description: We are seeking a Senior ML Scientist/Engineer to design models that operate on multimodal time-series... ...data analysis, feature engineering, model training, evaluation, and optimization. Design and evaluate machine...
Senior
Gridware
San Francisco, CA
a month ago
Senior/Staff ML Research Engineer
...Enterprise AI Customer Support startup with their search for senior/staff ML research engineers. The role will be onsite in their SF office. What... ...supervised fine-tuning and RL, and working on evaluations and deployments. ~ Familiar with SFT, FL, DPO, PPO, GRPO...
Senior
Work at office
DRH Search
San Francisco, CA
7 hours ago
Senior / Staff ML Training Optimization Engineer
$141k - $249k
...memory to pinpoint performance bottlenecks. - Identify and evaluate emerging technologies that can be adopted into Waabi’s training... ...compilation/deployment techniques. - Work with researchers and ML engineers on best-practices for optimal resource usage. - Create and...
Senior
Work at office
Work from home
Flexible hours
Waabi
San Francisco, CA
2 days ago
Senior ML Engineer, Multi-Sensor Modeling
$190k - $205k
...electrical, and visual signals Production Engineering Write clean, scalable, well-tested... ...large shared codebase. Build end-to-end ML pipelines including data processing, feature extraction, training, evaluation, and deployment. Optimize models for performance...
Senior
Live in
Gridware
San Francisco, CA
3 days ago
Senior ML Software Engineer, Mapping
$162.8k - $203.5k
...business for decades. To strengthen our efforts, we are hiring a Senior ML Engineer who will work end-to-end on creating and improving new... ...production quality code to launch machine learning models at scale Evaluate machine learning systems against business goal Experience: B...
Senior
Hourly pay
Work experience placement
Work at office
Local area
3 days per week
Socotra
San Francisco, CA
8 hours ago
Senior Machine Learning Engineer, Data Mining
$172k - $229k
...matter most. Omnitag, our ML-powered multimodal data... ...framework, is the engine that powers this discovery. As a Senior Machine Learning Engineer... ...systems into our broader autonomy stack as experimental or... ...practices for model training, evaluation, and deployment....
Senior
Work at office
Remote work
Motional
San Francisco, CA
18 days ago
Autonomy ML Engineer - Robotics & Vision-Action
...A leading robotics company in San Francisco is seeking a Machine Learning Engineer to lead the transition from teleoperation to autonomy. The ideal candidate will have extensive experience in machine learning, particularly in robotics and computer vision. Responsibilities...
Avatar Robotics
San Francisco, CA
7 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior ML Engineer, Autonomy Evaluation. Be the first to apply!