Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Technical Lead, Evaluation Infrastructure

Nuro

Job Description

Job Description

Who We Are

Nuro is a self-driving technology company on a mission to make autonomy accessible to all. Founded in 2016, Nuro is building the world's most scalable driver, combining cutting-edge AI with automotive-grade hardware. Nuro licenses its core technology, the Nuro Driver™, to support a wide range of applications, from robotaxis and commercial fleets to personally owned vehicles. With technology proven over years of self-driving deployments, Nuro gives the automakers and mobility platforms a clear path to AVs at commercial scale, empowering a safer, richer, and more connected future.

About the Role

Evaluation Infrastructure plays a critical role at Nuro, directly enabling L4 driverless deployment. The team supports two demanding workloads: day-to-day Autonomy Evaluation that powers rapid software iteration, and large-scale Driverless Safety Validation that produces the rigorous evidence required to deploy autonomy on public roads.

The Evaluation Infrastructure team builds the metrics framework, evaluation pipelines, introspection tooling, and analysis products that turn raw on-road and simulation logs into actionable insight. Our metrics stack spans both heuristic and ML-based approaches, covering everything from low-level component accuracy to end-to-end behavior quality. The platform empowers autonomy and Systems & Safety teams to run complex evaluations and validations across a wide range of configurations and scales, producing the high-fidelity metrics that drive both short-term iteration and long-term release confidence — in close partnership with Simulation and the broader AI Platform.

As the Technical Lead, you will lead the team with deep technical guidance and rigor, setting the technical bar, shortening the time-to-signal for evaluation and the time-to-confidence for validation, so that both autonomy and Systems & Safety teams can iterate fast while deploying software safely.

About the Work
  • Build and own a unified metrics, evaluation, and validation platform — pipelines, introspection tooling, and analysis products that turn on-road and simulation logs into high-fidelity signals for autonomy iteration and driverless safety validation
  • Drive the technical bar for metric quality across both heuristic and ML-based approaches; invest in the scale, reliability, and CI/CD of the evaluation stack to shorten time-to-signal for evaluation and time-to-confidence for validation, and to meet high SLAs for downstream stakeholders
  • Mentor and grow the Evaluation Infrastructure team, and champion AI-native engineering practices that compound team velocity and code quality
  • Partner with Product, Autonomy, Systems & Safety, and Simulation teams to define and execute the vision and strategy for evaluation at Nuro
About You
  • You have a degree in B.Sc or M.Sc., plus 4 years of relevant work experience
  • Domain experience: Strong fluency in distributed systems, large-scale data and ML evaluation pipelines, metrics frameworks (heuristic and/or ML-based), and analytics platforms
  • Engineering leadership: Experience setting technical vision, roadmap, and prioritization for a team operating at the intersection of autonomy, safety, and data infrastructure; a clear, concise communicator who partners effectively with PMs, engineers, and cross-functional stakeholders across Autonomy, Systems & Safety, and Simulation
  • Technical excellence: Ability and willingness to deep-dive into implementation; sets the technical bar for metric quality, pipeline rigor, and safety-critical engineering practice across the broader software organization; strong proficiency in Python, C++, or similar languages
  • AI-native mindset: Daily user of modern AI coding assistants and agentic tools (Claude Code, Cursor, and similar), with strong intuition for where they accelerate engineering work and where they don't; eager to apply LLMs and ML systems to evaluation problems, from automated triage and metric generation to natural-language analysis of fleet behavior; raises the team's productivity, code quality, and signal density through thoughtful AI integration
Bonus Points
  • Knowledge of data engineering, and its tooling and best practices
  • Knowledge of batch and streaming data processing, warehousing, and analytics solutions
  • Experience with data workflow orchestration platforms
  • Prior experience building evaluation, validation, or analytics platforms, ideally in autonomy, robotics, or safety-critical systems

At Nuro, your base pay is one part of your total compensation package. For this position, the reasonably expected base pay range is between $193,930,200 and $291,150/year for the level at which this job has been scoped. Your base pay will depend on several factors, including your experience, qualifications, education, location, and skills. In the event that you are considered for a different level, a higher or lower pay range would apply. This position is also eligible for an annual performance bonus, equity, and a competitive benefits package.

At Nuro, we celebrate differences and are committed to a diverse workplace that fosters inclusion and psychological safety for all employees. Nuro is proud to be an equal opportunity employer and expressly prohibits any form of workplace discrimination based on race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other legally protected characteristics.

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Technical Lead, Evaluation Infrastructure in Mountain View, CA vacancy
  •  ...safer, richer, and more connected future. About the Role Evaluation Infrastructure plays a critical role at Nuro, directly enabling L4...  ...partnership with Simulation and the broader AI Platform. As the Technical Lead, you will lead the team with deep technical guidance and... 
    Suggested
    Temporary work
    Work experience placement

    Nuro

    Mountain View, CA
    12 hours ago
  • $235.03k - $352.29k

     ...Technical Lead Manager, Autonomy Evaluation and Intelligence Mountain View, California (HQ) Who We Are Nuro is a self-driving technology company...  ...Work cross-functionally with Autonomy and Infrastructure engineers to set a roadmap that unifies evaluation frameworks... 
    Suggested

    Nuro

    Mountain View, CA
    6 days ago
  •  ...Tech Lead, AI Compute Infrastructure Los Angeles, Palo Alto, San Francisco, Toronto, Singapore About...  .... We are seeking a seasoned Technical Leader to build and scale the foundational...  ...model training, and continuous evaluation/benchmarking. Enhance Observability... 
    Suggested
    Full time

    HeyGen

    Palo Alto, CA
    4 days ago
  • $235.03k - $352.29k

     ...Technical Lead Manager, ML Platform Infrastructure Mountain View, California (HQ) Nuro is a self-driving technology company on a mission to make autonomy accessible to all. Founded in 2016, Nuro is building the world's most scalable driver, combining cutting-edge... 
    Suggested

    Nuro

    Mountain View, CA
    4 days ago
  • $237k - $329k

    Robotics Automation Technical Lead Manager, Platforms Infrastructure Google Sunnyvale, CA, USA Qualifications Bachelor’s degree or equivalent practical experience. 8 years of experience in leading cross-functional teams, and robotics product development. Experience... 
    Suggested
    Full time
    Contract work
    Remote work
    Worldwide
    Flexible hours

    Google Inc.

    Sunnyvale, CA
    17 hours ago
  • $238k - $302k

     ...Senior Software Engineer, ML Evaluation Infra and Efficiency Waymo is an autonomous driving technology company with the mission to...  ...requirements and scenarios, and improve DevX of the evaluation infrastructure. Improve runtime goodput of ML inference workload and... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    3 days ago
  • $170k - $216k

     ...Software Engineer, Quantitative Evaluations Waymo is an autonomous driving technology company with the mission to be the world's...  ...C++ Experience with ML Experience with A/B experiment infrastructure Experience building and validating metrics to measure quality... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    4 days ago
  • $332k - $421k

     ...Principal Software Engineer, ML Flywheel Technical Lead Waymo is an autonomous driving...  ..., (auto-)labeling, model training and evaluation, all the way to model deployment and monitoring...  ...in close collaboration with Waymo's infrastructure, modeling and evaluation teams. They... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    1 day ago
  • $238k - $302k

     ...Software Engineer, Simulator Evaluation Waymo is an autonomous...  ...Software Engineer to act as the Technical Architect for this domain....  ...Manager and act as a Technical Lead, bridging the gap between...  ...evaluation platforms, or back-end infrastructure. ~ Expertise in designing... 
    Full time
    Remote work
    Shift work

    Waymo

    Mountain View, CA
    2 days ago
  • $160.36k - $240.54k

     ...Software Engineer, Offboard Infrastructure Mountain View, California (HQ) Nuro is a self...  ...areas: Data Platform, Simulation, and Technical Infrastructure. Data Platform: The...  ..., the platform supports the autonomy evaluation infrastructure by providing detailed introspection... 

    Nuro

    Mountain View, CA
    3 days ago
  • $153k - $222k

     ...Valley company is creating the digital infrastructure needed to bring intelligence to every moving...  ..., training frameworks, compute, evaluation, and deployment) and work directly with...  ...encourage all engineers to take ownership over technical and product decisions, closely interact... 
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Remote work
    Day shift

    Applied Intuition

    Sunnyvale, CA
    2 days ago
  • $170k - $216k

     ...Onboard Infrastructure Software Engineer Waymo is an autonomous driving technology company...  ...position reporting to a Staff Engineer, Tech Lead Manager. In this hybrid role, you...  ...environments ~ Experience developing evaluation systems and metrics ~ Experience... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    4 days ago
  • $170k - $216k

     ...across 15+ U.S. states. The Simulation Infrastructure team creates reliable, scalable, and cost...  ...effective Simulation-based products that evaluate the Waymo Driver's software stack at a massive scale. We solve complex technical challenges to build services and tools... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    2 days ago
  • $152k - $228k

     ...leverages many different bench-top systems to evaluate and regression test different aspects...  ...it reaches the road. You will own the infrastructure that makes this possible. Our...  ...the entire autonomy stack. You'll be the technical DRI for the platform — setting the roadmap... 
    Temporary work

    Nuro

    Mountain View, CA
    1 day ago
  • $160.36k - $240.54k

     ...Software Engineer, Onboard Infrastructure Mountain View, California (HQ) Nuro is a self-driving technology company on a mission to...  ...with internal stakeholders and external suppliers to define, evaluate, and integrate the next-generation HW platform for Nuro's products... 

    Nuro

    Mountain View, CA
    3 days ago
  • $157k - $235k

     ...Snap Engineering teams build fun and technically sophisticated products that reach hundreds...  ...a critical role in scaling our ML Infrastructure, optimizing training and inference systems...  ...perform scalable ML model training, evaluation, and inference in the cloud... 
    Live in
    Work at office
    Local area

    Snapchat

    Palo Alto, CA
    2 days ago
  • $240k - $320k

     ...Define and drive execution of the technical roadmap and strategy for the E2E AI...  ...with other functional tech leads (e.g. data engineering, infrastructure) to define and drive the overall architecture...  ...framework that enables fast evaluation and integration of emerging E2E AI... 
    Full time
    Work experience placement
    Local area
    Flexible hours

    Bosch USA

    Sunnyvale, CA
    8 days ago
  • $174k - $252k

    Senior Software Engineer, AI/ML, AI and Infrastructure Apply X Note: By applying to this...  ...infrastructure (e.g., model deployment, model evaluation, optimization, data processing,...  ...or PhD in Computer Science or related technical field. 5 years of experience with data... 
    Full time
    Worldwide

    Google Inc.

    Mountain View, CA
    2 days ago
  • $235.03k - $352.29k

     ...Technical Lead, Behavior & Triage Labeling Mountain View, California (HQ) Nuro is a...  ...quantity and diversity of its training and evaluation data. The team plays a crucial role...  ...execution framework, supporting infrastructure, and a suite of data annotation tools.... 

    Nuro

    Mountain View, CA
    4 days ago
  • $107.4k - $143.2k

     ...is currently looking for a dedicated Technical Program Lead for a high visibility technology and...  ...program execution across complex lab and infrastructure environments. The ideal candidate...  .../proposal at the project outset. Evaluate documents and communicate the client'... 
    Full time
    Contract work
    Temporary work
    Local area

    Cumming Group

    Mountain View, CA
    2 days ago
  • $158.9k - $238.3k

     ...centralizing the management of Infrastructure, Technology, and Data. The...  ...SLAs for platform services; lead blameless post-mortems and...  ...evolving R&D requirements. Evaluate and integrate new private...  ...technologies and tooling, providing technical recommendations and proof-of... 
    Permanent employment
    Local area

    Rubrik

    Palo Alto, CA
    2 days ago
  • $150k - $170k

     ...currently seeking a DevOps Engineer - Infrastructure to join our growing team working out of...  ...View, CA HQ. This position will be the technical escalation point for Global End Users and...  ...of applications. As part of the evaluation process, we provide Endorsed with job requirements... 
    Temporary work
    Work at office
    Remote work
    Visa sponsorship
    Flexible hours

    Kodiak

    Mountain View, CA
    17 hours ago
  • $153k - $222k

    About the role We are looking for both infrastructure engineers with expertise in machine learning...  ..., training frameworks, compute, evaluation, and deployment) and work directly with...  ...encourage all engineers to take ownership over technical and product decisions, closely interact... 
    Full time
    For contractors
    For subcontractor

    Decisive Point

    Mountain View, CA
    2 days ago
  • $207k - $300k

     ...On-Device Machine Learning Infrastructure corporate_fare Google place...  ...g., model deployment, model evaluation, data processing, debugging,...  ...Computer Science, or a related technical field. 8 years of...  ...device ML infrastructure with leading performance, enabling framework... 
    Full time
    Shift work

    Google Inc.

    Sunnyvale, CA
    2 days ago
  • A leading technology company in Sunnyvale is seeking a Software Engineer for its Edge Engineering team. This role involves evaluating, testing, developing, and maintaining software solutions for the Edge Infrastructure, which supports content distribution for Apple's services... 

    Apple Inc.

    Sunnyvale, CA
    2 days ago
  • $147.4k - $220.9k

    Software Engineer (Edge Services), Infrastructure Services Sunnyvale,...  ...amazing technology to industry-leading environmental efforts. Join...  ...Infrastructure. Responsibilities Evaluate, test, develop, and maintain...  ...at scale and driving technical innovation. Experience with... 
    Relocation

    Apple Inc.

    Sunnyvale, CA
    2 days ago
  • $231.9k - $298.1k

     ...Valley company is creating the digital infrastructure needed to bring intelligence to every...  ...States and Europe. We are looking for a Technical Lead Manager to own the perception model at...  .... You will lead the team that trains, evaluates, and ships this model, and you will be... 
    Odd job
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Remote work
    Day shift

    Applied Intuition

    Sunnyvale, CA
    4 days ago
  •  ...We're looking for a Tech Lead Manager (TLM) to own and drive...  ...of your time on hands-on technical work and 30% on people management...  ...Tech Lead Manager, AI 1 evaluation, and set the bar for applied...  ...training and inference infrastructure, set standards for offline/... 
    Remote work
    Flexible hours

    TEEMA

    Sunnyvale, CA
    4 days ago
  • $140k - $252k

     ...broader AI ecosystem. We're seeking an ML/RL Infra Engineer to build scalable, reliable infrastructure that powers these agents and enables seamless, high-volume rollouts for model evaluation & RL training. Top candidates will have deep experience in large-scale ML systems,... 
    Hourly pay
    Full time
    Temporary work
    Flexible hours

    Tesla

    Palo Alto, CA
    1 day ago
  • $124k - $420k

     ...Engineer for the Optimus team, you will build the tools and infrastructure to make and measure improvements to neural network architecture...  ...with exporting and deploying neural networks to the bot, and evaluate experimental results. You will help us automate the entire... 
    Hourly pay
    Full time
    Temporary work
    Flexible hours

    Tesla

    Palo Alto, CA
    17 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Technical Lead, Evaluation Infrastructure. Be the first to apply!