Senior ML Engineer, Autonomy Evaluation
Bedrock Robotics Inc
Join the team bringing advanced autonomy to the built world At Bedrock, we’re moving AI out of the lab and into the real world. Our team is composed of industry veterans who helped launch Waymo, scaled Segment to a $3.2B acquisition, and grew Uber Freight to $5B in revenue. Today, we’re deploying autonomous systems on heavy construction machinery across the country, accelerating project schedules of billion-dollar infrastructure projects and improving safety on job sites. Backed by $350M in funding, we’re working quickly to close the gap between America’s surging demand for housing, data centers, manufacturing hubs, and the construction industry’s growing labor shortage. This is where algorithms meet steel‑toed boots. You’ll collaborate with construction veterans and world‑class engineers to solve physical‑world problems that simulations can’t touch. If you’re ready to apply cutting‑edge technology to solve meaningful problems alongside a talented team—we’d love to have you join us. Machine Learning Engineer: Evaluation Bedrock is bringing autonomy to the construction industry! We’re a group of veterans from the autonomous vehicle industry who are passionate about bringing the benefits of automation to areas in the construction industry currently underserved by the market. We’re looking for a highly motivated engineer with experience evaluating complex ML systems deployed in the real world. Your mission: translate the infinite nuance of the built world into actionable, AI‑native evaluations that accelerate Bedrock Operator adoption. The ideal candidate has hands‑on experience building evaluation systems and designing and executing statistical tests to gauge performance deltas between system iterations. More importantly, you’ve iterated on complex ML systems run in production environments, and you understand the complexities that come with it. What you’ll do: Design and maintain eval systems: Build pipelines for measuring system performance – across open loop and closed loop simulation, hardware‑in‑the‑loop systems, and field data from Bedrock Operator‑equipped machinery. Excite other teams to gain insights earlier in the development cycle through streamlined workflows. Develop metrics: Connect product goals and system behavior – by bridging real‑world specification to measurable indicators from logged data. Empower confident decision making from parameter tuning to program planning by slicing through the noise and delivering objective insights. Classify data sources for training and testing: Implement infrastructure and classifiers – to self‑annotate data and allow creation of datasets for a variety of training and evaluation use cases. Leverage models to source rich annotations for massive datasets to accelerate model iteration. Predict system performance: Model metrics and interpret results – from various sources ranging from raw sensor data to key leading indicators. Determine whether new construction sites pose hidden challenges and drive business decisions about deployment readiness. What we’re looking for: Engineers who are currently Senior or Staff level with 5+ years of professional software engineering, data science, or research experience 2+ years of professional experience analyzing modern ML or robotics system performance on real‑world problems Proficiency in Python and a data warehouse query language and comfort with development on infrastructure within parallelized cloud‑based frameworks Strong statistical analysis skills (classification, model fit bias determination, hypothesis testing, and uncertainty quantification) Experience working with large datasets Bonus points: We’re especially interested in engineers who have applied statistical backgrounds to ML research or real‑world robotics applications. Our roles are often flexible. If you don’t fit all the criteria, or are in another location (especially one where we have an office like SF or NY) please apply anyway! We’d love to consider you. #J-18808-Ljbffr
- ...client, I am hiring for a Senior Machine Learning Engineer – End-to-End (E2E) to help... ...efficient, and human-like autonomy solutions for real-world freight... ...representations Train and evaluate models using large-scale... ...Solid understanding of modern ML architectures used in end-...Senior
$229k - $276k
...Zoox is seeking a Senior Machine Learning Engineer to enhance validation processes for their autonomous mobility solutions in San Francisco. This... ...learning, programming in Python, and an understanding of autonomy challenges. A PhD or equivalent experience is preferred....Senior- ...Arena Intelligence, Inc. in San Francisco is seeking a Senior Machine Learning Engineer to enhance AI model evaluation systems. You will work on data pipelines, inference APIs, and new evaluation methods. The ideal candidate possesses strong programming skills, experience...Senior
$229k - $276k
...Senior Machine Learning Engineer, Autonomy Validation Location: San Francisco Bay Area Compensation: $229,000 -... ...validation problems at the intersection of ML and data science. You will serve as... ..., including model training, evaluation, and optimization. Strong programming...SeniorTemporary work- Airbnb, Inc. is hiring a Senior Staff Machine Learning Engineer, focusing on driving evaluation strategies and data infrastructure for CSxAI initiatives. This role requires... ...PhD in a relevant field, extensive experience in ML/AI systems, and strong leadership in technical...SeniorRemote job
- ...Proficiency in Python and standard ML frameworks (e.g., JAX,... ...designing and using metrics for evaluating complex AI systems , (... ...technical leadership, influencing senior stakeholders, and driving innovation... ...for researchers and software engineers who are passionate about...Senior
$204k - $259k
...Senior Machine Learning Engineer – VLM/LLM Evaluation Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver... ...or equivalent practical experience Experience in ML engineering and applied Deep Learning Experience with...SeniorFull timeTemporary workRemote work$240.45k - $300.3k
...Senior Machine Learning Engineer - Model Evaluations, Public Sector The Public Sector ML team at Scale deploys advanced AI systems-including LLMs, agentic models, and multimodal pipelines-into mission-critical government environments. We build evaluation frameworks...SeniorFull time$146k - $280k
...Waabi is seeking a Senior Applied Data Scientist in San Francisco to shape evaluation methodologies for autonomous driving technology. Responsibilities include designing production frameworks, prototyping analyses, and developing analytical models to correlate simulation...SeniorFlexible hours- ...Join the team bringing advanced autonomy to the built world At Bedrock... ...veterans and world-class engineers to solve physical-world problems... ...datasets Build metrics to evaluate model performance in open loop... ...infrastructure teams to integrate ML models into real-world...Work at officeFlexible hours
- ...Plaid Inc is looking for a Senior Research Scientist to lead applied research on their foundation model. The successful candidate will design model architectures and develop robust evaluation frameworks, making impactful contributions across Plaid's diverse financial...Senior
- ...scale. About the Role We are looking for a visionary Senior ML Engineer who will bridge the gap between high-level architecture and... ...models, embeddings; building clustering models; utilizing evaluation frameworks to quantify performance ~ Proficiency in agent...SeniorShift work
$200k - $400k
...an innovative strategic engineer to help us scale. Role Overview The Senior Machine Learning... ...ll work across the full ML lifecycle, from structuring... ...datasets to deploying, evaluating, and training models in... ...who thrives on scale, autonomy, and cross‑functional collaboration...SeniorWork experience placement$128.7k - $261.3k
...development, and performance engineering so that every cycle on our accelerators... ...driving. The Role As a Senior Compiler Engineer on the AI... ...reliable, and effortless for ML engineers across the AV... ...engineering, and real‑world autonomy, this role puts your decisions...SeniorLocal areaFlexible hours- ...Senior ML/RL Engineer, Behavior Planning At Bot Auto, we are revolutionizing the transportation of goods with our cutting-edge autonomous... ...cross-functional teams to design robust reward functions and evaluation metrics that balance safety, progress, and comfort....SeniorShift work
- ...leading financial technology company in San Francisco is seeking a Senior Research Scientist to lead applied research on their... ...background in machine learning, with experience in model performance evaluations and production system development. This role is crucial in...Senior
$200k - $300k
...Zoox is looking for a Senior/Staff Machine Learning Engineer in San Francisco to enhance agent behaviors using... ...solutions while working with safety and autonomy engineers. A BS, MS, or PhD in... ...with experience in deep learning and ML pipelines. Salary ranges from $200,0...Senior- ...A leading AI solutions company in San Francisco is seeking an ML Eval Engineer to design evaluation benchmarks and improve model performance. This role involves working with unstructured enterprise data and collaborating closely with the ML and engineering teams. You will...
$152k - $228k
...Job Description Job Description Senior ML Engineer About Invoca Invoca is an AI-powered revenue execution platform that brings together... ...maintain CI/CD pipelines for ML artifacts — including model evaluation, versioning, and automated deployment. Serve as the primary...SeniorCurrently hiringRemote workFlexible hours- Arcada Labs Incorporated is seeking an ML Research Engineer in San Francisco to lead evaluations of AI models based on human preferences. You will design experiments and analysis pipelines to enhance our understanding of AI capabilities and contribute to user-facing tools...
- ...MakerMaker.AI is seeking a Senior ML Engineer in San Francisco. In this role, you will build and maintain machine learning systems and pipelines... ...researchers and owning the data pipelines for training and evaluation. If you have 6+ years of experience in this field, apply to...Senior
- ...with safety, privacy, and real-world responsibility in mind. Our ML team comes from a culture of academic research driven to... ...privacy for the sake of ML advancement. Responsibilities Own LLM evaluation processes and methods with a focus on generating benchmarks representative...Local areaShift work
$208k - $300k
...A leading AI company is seeking a Machine Learning Engineer in the Public Sector to develop automated evaluation pipelines for AI models. You will work on advanced... ...a strong programming background and experience in ML evaluation frameworks. Competitive salary range is...- ...information, please visit Role Description: We are seeking a Senior ML Scientist/Engineer to design models that operate on multimodal time-series... ...data analysis, feature engineering, model training, evaluation, and optimization. Design and evaluate machine...Senior
- ...Enterprise AI Customer Support startup with their search for senior/staff ML research engineers. The role will be onsite in their SF office. What... ...supervised fine-tuning and RL, and working on evaluations and deployments. ~ Familiar with SFT, FL, DPO, PPO, GRPO...SeniorWork at office
$141k - $249k
...memory to pinpoint performance bottlenecks. - Identify and evaluate emerging technologies that can be adopted into Waabi’s training... ...compilation/deployment techniques. - Work with researchers and ML engineers on best-practices for optimal resource usage. - Create and...SeniorWork at officeWork from homeFlexible hours$190k - $205k
...electrical, and visual signals Production Engineering Write clean, scalable, well-tested... ...large shared codebase. Build end-to-end ML pipelines including data processing, feature extraction, training, evaluation, and deployment. Optimize models for performance...SeniorLive in$162.8k - $203.5k
...business for decades. To strengthen our efforts, we are hiring a Senior ML Engineer who will work end-to-end on creating and improving new... ...production quality code to launch machine learning models at scale Evaluate machine learning systems against business goal Experience: B...SeniorHourly payWork experience placementWork at officeLocal area3 days per week$172k - $229k
...matter most. Omnitag, our ML-powered multimodal data... ...framework, is the engine that powers this discovery. As a Senior Machine Learning Engineer... ...systems into our broader autonomy stack as experimental or... ...practices for model training, evaluation, and deployment....SeniorWork at officeRemote work- ...A leading robotics company in San Francisco is seeking a Machine Learning Engineer to lead the transition from teleoperation to autonomy. The ideal candidate will have extensive experience in machine learning, particularly in robotics and computer vision. Responsibilities...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior ML Engineer, Autonomy Evaluation. Be the first to apply!
- machine learning software engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- graduate machine learning engineer San Francisco, CA
- computer vision machine learning engineer San Francisco, CA
- machine learning engineer San Francisco, CA
- senior ml engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- machine learning ai engineer San Francisco, CA
- data scientist machine learning engineer San Francisco, CA
- senior cost analyst San Francisco, CA


