Technical Lead, Evaluation Infrastructure
Nuro
Job Description
Job Description
Who We Are
Nuro is a self-driving technology company on a mission to make autonomy accessible to all. Founded in 2016, Nuro is building the world's most scalable driver, combining cutting-edge AI with automotive-grade hardware. Nuro licenses its core technology, the Nuro Driver™, to support a wide range of applications, from robotaxis and commercial fleets to personally owned vehicles. With technology proven over years of self-driving deployments, Nuro gives the automakers and mobility platforms a clear path to AVs at commercial scale, empowering a safer, richer, and more connected future.
About the RoleEvaluation Infrastructure plays a critical role at Nuro, directly enabling L4 driverless deployment. The team supports two demanding workloads: day-to-day Autonomy Evaluation that powers rapid software iteration, and large-scale Driverless Safety Validation that produces the rigorous evidence required to deploy autonomy on public roads.
The Evaluation Infrastructure team builds the metrics framework, evaluation pipelines, introspection tooling, and analysis products that turn raw on-road and simulation logs into actionable insight. Our metrics stack spans both heuristic and ML-based approaches, covering everything from low-level component accuracy to end-to-end behavior quality. The platform empowers autonomy and Systems & Safety teams to run complex evaluations and validations across a wide range of configurations and scales, producing the high-fidelity metrics that drive both short-term iteration and long-term release confidence — in close partnership with Simulation and the broader AI Platform.
As the Technical Lead, you will lead the team with deep technical guidance and rigor, setting the technical bar, shortening the time-to-signal for evaluation and the time-to-confidence for validation, so that both autonomy and Systems & Safety teams can iterate fast while deploying software safely.
About the Work- Build and own a unified metrics, evaluation, and validation platform — pipelines, introspection tooling, and analysis products that turn on-road and simulation logs into high-fidelity signals for autonomy iteration and driverless safety validation
- Drive the technical bar for metric quality across both heuristic and ML-based approaches; invest in the scale, reliability, and CI/CD of the evaluation stack to shorten time-to-signal for evaluation and time-to-confidence for validation, and to meet high SLAs for downstream stakeholders
- Mentor and grow the Evaluation Infrastructure team, and champion AI-native engineering practices that compound team velocity and code quality
- Partner with Product, Autonomy, Systems & Safety, and Simulation teams to define and execute the vision and strategy for evaluation at Nuro
- You have a degree in B.Sc or M.Sc., plus 4 years of relevant work experience
- Domain experience: Strong fluency in distributed systems, large-scale data and ML evaluation pipelines, metrics frameworks (heuristic and/or ML-based), and analytics platforms
- Engineering leadership: Experience setting technical vision, roadmap, and prioritization for a team operating at the intersection of autonomy, safety, and data infrastructure; a clear, concise communicator who partners effectively with PMs, engineers, and cross-functional stakeholders across Autonomy, Systems & Safety, and Simulation
- Technical excellence: Ability and willingness to deep-dive into implementation; sets the technical bar for metric quality, pipeline rigor, and safety-critical engineering practice across the broader software organization; strong proficiency in Python, C++, or similar languages
- AI-native mindset: Daily user of modern AI coding assistants and agentic tools (Claude Code, Cursor, and similar), with strong intuition for where they accelerate engineering work and where they don't; eager to apply LLMs and ML systems to evaluation problems, from automated triage and metric generation to natural-language analysis of fleet behavior; raises the team's productivity, code quality, and signal density through thoughtful AI integration
- Knowledge of data engineering, and its tooling and best practices
- Knowledge of batch and streaming data processing, warehousing, and analytics solutions
- Experience with data workflow orchestration platforms
- Prior experience building evaluation, validation, or analytics platforms, ideally in autonomy, robotics, or safety-critical systems
At Nuro, your base pay is one part of your total compensation package. For this position, the reasonably expected base pay range is between $193,930,200 and $291,150/year for the level at which this job has been scoped. Your base pay will depend on several factors, including your experience, qualifications, education, location, and skills. In the event that you are considered for a different level, a higher or lower pay range would apply. This position is also eligible for an annual performance bonus, equity, and a competitive benefits package.
At Nuro, we celebrate differences and are committed to a diverse workplace that fosters inclusion and psychological safety for all employees. Nuro is proud to be an equal opportunity employer and expressly prohibits any form of workplace discrimination based on race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other legally protected characteristics.
- ...safer, richer, and more connected future. About the Role Evaluation Infrastructure plays a critical role at Nuro, directly enabling L4... ...partnership with Simulation and the broader AI Platform. As the Technical Lead, you will lead the team with deep technical guidance and...SuggestedTemporary workWork experience placement
$235.03k - $352.29k
...Technical Lead Manager, Autonomy Evaluation and Intelligence Mountain View, California (HQ) Who We Are Nuro is a self-driving technology company... ...Work cross-functionally with Autonomy and Infrastructure engineers to set a roadmap that unifies evaluation frameworks...Suggested- ...Tech Lead, AI Compute Infrastructure Los Angeles, Palo Alto, San Francisco, Toronto, Singapore About... .... We are seeking a seasoned Technical Leader to build and scale the foundational... ...model training, and continuous evaluation/benchmarking. Enhance Observability...SuggestedFull time
$235.03k - $352.29k
...Technical Lead Manager, ML Platform Infrastructure Mountain View, California (HQ) Nuro is a self-driving technology company on a mission to make autonomy accessible to all. Founded in 2016, Nuro is building the world's most scalable driver, combining cutting-edge...Suggested$237k - $329k
Robotics Automation Technical Lead Manager, Platforms Infrastructure Google Sunnyvale, CA, USA Qualifications Bachelor’s degree or equivalent practical experience. 8 years of experience in leading cross-functional teams, and robotics product development. Experience...SuggestedFull timeContract workRemote workWorldwideFlexible hours$238k - $302k
...Senior Software Engineer, ML Evaluation Infra and Efficiency Waymo is an autonomous driving technology company with the mission to... ...requirements and scenarios, and improve DevX of the evaluation infrastructure. Improve runtime goodput of ML inference workload and...Full timeRemote work$170k - $216k
...Software Engineer, Quantitative Evaluations Waymo is an autonomous driving technology company with the mission to be the world's... ...C++ Experience with ML Experience with A/B experiment infrastructure Experience building and validating metrics to measure quality...Full timeRemote work$332k - $421k
...Principal Software Engineer, ML Flywheel Technical Lead Waymo is an autonomous driving... ..., (auto-)labeling, model training and evaluation, all the way to model deployment and monitoring... ...in close collaboration with Waymo's infrastructure, modeling and evaluation teams. They...Full timeRemote work$238k - $302k
...Software Engineer, Simulator Evaluation Waymo is an autonomous... ...Software Engineer to act as the Technical Architect for this domain.... ...Manager and act as a Technical Lead, bridging the gap between... ...evaluation platforms, or back-end infrastructure. ~ Expertise in designing...Full timeRemote workShift work$160.36k - $240.54k
...Software Engineer, Offboard Infrastructure Mountain View, California (HQ) Nuro is a self... ...areas: Data Platform, Simulation, and Technical Infrastructure. Data Platform: The... ..., the platform supports the autonomy evaluation infrastructure by providing detailed introspection...$153k - $222k
...Valley company is creating the digital infrastructure needed to bring intelligence to every moving... ..., training frameworks, compute, evaluation, and deployment) and work directly with... ...encourage all engineers to take ownership over technical and product decisions, closely interact...Full timeFor contractorsFor subcontractorCasual workWork at officeRemote workDay shift$170k - $216k
...Onboard Infrastructure Software Engineer Waymo is an autonomous driving technology company... ...position reporting to a Staff Engineer, Tech Lead Manager. In this hybrid role, you... ...environments ~ Experience developing evaluation systems and metrics ~ Experience...Full timeRemote work$170k - $216k
...across 15+ U.S. states. The Simulation Infrastructure team creates reliable, scalable, and cost... ...effective Simulation-based products that evaluate the Waymo Driver's software stack at a massive scale. We solve complex technical challenges to build services and tools...Full timeRemote work$152k - $228k
...leverages many different bench-top systems to evaluate and regression test different aspects... ...it reaches the road. You will own the infrastructure that makes this possible. Our... ...the entire autonomy stack. You'll be the technical DRI for the platform — setting the roadmap...Temporary work$160.36k - $240.54k
...Software Engineer, Onboard Infrastructure Mountain View, California (HQ) Nuro is a self-driving technology company on a mission to... ...with internal stakeholders and external suppliers to define, evaluate, and integrate the next-generation HW platform for Nuro's products...$157k - $235k
...Snap Engineering teams build fun and technically sophisticated products that reach hundreds... ...a critical role in scaling our ML Infrastructure, optimizing training and inference systems... ...perform scalable ML model training, evaluation, and inference in the cloud...Live inWork at officeLocal area$240k - $320k
...Define and drive execution of the technical roadmap and strategy for the E2E AI... ...with other functional tech leads (e.g. data engineering, infrastructure) to define and drive the overall architecture... ...framework that enables fast evaluation and integration of emerging E2E AI...Full timeWork experience placementLocal areaFlexible hours$174k - $252k
Senior Software Engineer, AI/ML, AI and Infrastructure Apply X Note: By applying to this... ...infrastructure (e.g., model deployment, model evaluation, optimization, data processing,... ...or PhD in Computer Science or related technical field. 5 years of experience with data...Full timeWorldwide$235.03k - $352.29k
...Technical Lead, Behavior & Triage Labeling Mountain View, California (HQ) Nuro is a... ...quantity and diversity of its training and evaluation data. The team plays a crucial role... ...execution framework, supporting infrastructure, and a suite of data annotation tools....$107.4k - $143.2k
...is currently looking for a dedicated Technical Program Lead for a high visibility technology and... ...program execution across complex lab and infrastructure environments. The ideal candidate... .../proposal at the project outset. Evaluate documents and communicate the client'...Full timeContract workTemporary workLocal area$158.9k - $238.3k
...centralizing the management of Infrastructure, Technology, and Data. The... ...SLAs for platform services; lead blameless post-mortems and... ...evolving R&D requirements. Evaluate and integrate new private... ...technologies and tooling, providing technical recommendations and proof-of...Permanent employmentLocal area$150k - $170k
...currently seeking a DevOps Engineer - Infrastructure to join our growing team working out of... ...View, CA HQ. This position will be the technical escalation point for Global End Users and... ...of applications. As part of the evaluation process, we provide Endorsed with job requirements...Temporary workWork at officeRemote workVisa sponsorshipFlexible hours$153k - $222k
About the role We are looking for both infrastructure engineers with expertise in machine learning... ..., training frameworks, compute, evaluation, and deployment) and work directly with... ...encourage all engineers to take ownership over technical and product decisions, closely interact...Full timeFor contractorsFor subcontractor$207k - $300k
...On-Device Machine Learning Infrastructure corporate_fare Google place... ...g., model deployment, model evaluation, data processing, debugging,... ...Computer Science, or a related technical field. 8 years of... ...device ML infrastructure with leading performance, enabling framework...Full timeShift work- A leading technology company in Sunnyvale is seeking a Software Engineer for its Edge Engineering team. This role involves evaluating, testing, developing, and maintaining software solutions for the Edge Infrastructure, which supports content distribution for Apple's services...
$147.4k - $220.9k
Software Engineer (Edge Services), Infrastructure Services Sunnyvale,... ...amazing technology to industry-leading environmental efforts. Join... ...Infrastructure. Responsibilities Evaluate, test, develop, and maintain... ...at scale and driving technical innovation. Experience with...Relocation$231.9k - $298.1k
...Valley company is creating the digital infrastructure needed to bring intelligence to every... ...States and Europe. We are looking for a Technical Lead Manager to own the perception model at... .... You will lead the team that trains, evaluates, and ships this model, and you will be...Odd jobFull timeFor contractorsFor subcontractorCasual workWork at officeRemote workDay shift- ...We're looking for a Tech Lead Manager (TLM) to own and drive... ...of your time on hands-on technical work and 30% on people management... ...Tech Lead Manager, AI 1 evaluation, and set the bar for applied... ...training and inference infrastructure, set standards for offline/...Remote workFlexible hours
$140k - $252k
...broader AI ecosystem. We're seeking an ML/RL Infra Engineer to build scalable, reliable infrastructure that powers these agents and enables seamless, high-volume rollouts for model evaluation & RL training. Top candidates will have deep experience in large-scale ML systems,...Hourly payFull timeTemporary workFlexible hours$124k - $420k
...Engineer for the Optimus team, you will build the tools and infrastructure to make and measure improvements to neural network architecture... ...with exporting and deploying neural networks to the bot, and evaluate experimental results. You will help us automate the entire...Hourly payFull timeTemporary workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Technical Lead, Evaluation Infrastructure. Be the first to apply!

