Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Scientist, Behavior Evaluation

Zoox Inc.

Job Description

Job Description

As a Data Scientist on the Behavior Evaluation team, you will be the statistical anchor ensuring our autonomous driving systems navigate highway environments with world-class safety, efficiency, and comfort. Highway evaluation presents a unique industry challenge: verifying vehicle behavior at high velocities where the margin for error is razor-thin, and critical edge cases are buried in petabytes of data.

In this role, you will bridge advanced statistical methodology with scalable software engineering. You will design the mathematical frameworks, statistical tests, and data-driven metrics that evaluate our planner's decisions. Working directly with large-scale simulation and real-world fleet data, your insights will define our validation pipelines, identify behavioral regressions, and directly shape the software powering our next-generation autonomous fleet.

In this role, you will:

  • Design Advanced Experimental Frameworks: Formulate robust statistical models, hypothesis testing frameworks, and quasi-experimental designs (such as synthetic controls or matching) to rigorously validate highway planner behavior in simulation and shadow-mode deployments.

 

  • Model Tail Risks & Rare Events: Use Surrogate Safety Measures (e.g., TTC, PET) to accurately model and predict low-frequency, high-severity edge cases that traditional mean-based statistics miss.

 

  • Architect Scenario-Based Metrics: Own and mature critical behavioral KPIs, utilizing data stratification to analyze complex driving scenarios (e.g., high-speed merging, cut-ins) while proactively identifying statistical anomalies like Simpson’s Paradox.

 

  • Surface Statistical Edge Cases: Apply data mining and advanced statistical techniques to isolate low-frequency, high-severity edge cases and systemic Autonomy engineering debt.

  • Drive Cross-Functional Alignment: Translate complex statistical findings and multi-source evaluations into clear, actionable technical recommendations, collaborating closely with Autonomy Software Engineers, Safety Systems, and Product teams.
Qualifications:

  • Education: Bachelor’s or Master’s degree in a highly quantitative field (e.g., Statistics, Mathematics, Data Science, Operations Research, or a related field with a strong statistical focus).

 

  • Experience: 3–6+ years of professional experience as a Data Scientist or Quantitative Engineer, with a proven track record of landing data-driven impact.

 

  • Strong Statistical Foundations: Deep understanding of hypothesis testing, experimental design, regression analysis, non- parametric/resampling methods (e.g., bootstrapping, permutation tests), and time-series analysis handling autocorrelated data.

 

  • Strong Programming: High proficiency in Python (Pandas, NumPy, SciPy, scikit-learn) and the ability to write highly complex, optimized SQL queries for massive distributed databases.

  • Communication: Exceptional ability to articulate complex mathematical methodologies and statistical results to cross-functional engineering partners.
Bonus Qualifications

  • Robotics or Autonomy Background: Experience analyzing spatial-temporal data, sensor logs, or vehicle telemetry from robotics, autonomous vehicles, or aviation systems.

 

  • Simulation-Based Testing: Familiarity with validating software systems using empty-world or simulation platforms at scale.

  • Modern Data Stack: Experience with workflow orchestration tools (e.g., Airflow) and building advanced data visualization layers (e.g., Superset).

Base Salary Range

 

 

There are three major components to compensation for this position: salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. A sign-on bonus may be offered as part of the compensation package. The listed range applies only to the base salary. Compensation will vary based on geographic location and level. Leveling, as well as positioning within a level, is determined by a range of factors, including, but not limited to, a candidate's relevant years of experience, domain knowledge, and interview performance. The salary range listed in this posting is representative of the range of levels Zoox is considering for this position.

 

Zoox also offers a comprehensive package of benefits, including paid time off (e.g. sick leave, vacation, bereavement), unpaid time off, Zoox Stock Appreciation Rights, Amazon RSUs, health insurance, long-term care insurance, long-term and short-term disability insurance, and life insurance.

About Zoox

Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We’re looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.

Follow us on LinkedIn

Accommodations

If you need an accommodation to participate in the application or interview process please reach out to View email address on ziprecruiter.com or your assigned recruiter.

A Final Note:

You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Data Scientist, Behavior Evaluation in Boston, MA vacancy
  • $176k - $240k

     ...capacity to measure and understand autonomous vehicle behavior. We are looking for a Data Scientist to join our Autonomy Integration Metrics team, where...  ...focus on metric design and development or performance evaluation ~ Strong proficiency in Python in production... 
    Suggested
    Temporary work
    Relocation package

    Zoox

    Boston, MA
    2 days ago
  •  ...capacity to measure and understand autonomous vehicle behavior. We are looking for a Data Scientist to join our Autonomy Integration Metrics team, where...  ...focus on metric design and development or performance evaluation ~ Strong proficiency in Python in production... 
    Suggested
    Temporary work
    Relocation package

    Zoox

    Boston, MA
    3 days ago
  • Apple Inc. is seeking an Annotation Data Scientist for the Evaluation Integrity team in Cambridge, Massachusetts. This role focuses on designing human-in-the-loop (HITL) annotation projects that evaluate the quality of Siri interactions and systems. The ideal candidate... 
    Suggested

    Apple Inc.

    Cambridge, MA
    2 days ago
  • $154.6k - $274.9k

    Annotation Data Scientist, Evaluation Integrity (Siri) Cambridge, Massachusetts, United States — Machine Learning and AI Play a part in the ongoing revolution in human-computer interaction. Siri is evolving — and the way we evaluate it has to evolve with it. Join the... 
    Suggested
    Relocation

    Apple Inc.

    Cambridge, MA
    1 day ago
  •  ...About the Role We're looking for a Data Scientist to own the quality, reliability, and trustworthiness of our clinical AI outputs. You'll...  ...systems that ensure our AI "knows what it doesn't know"—developing evaluation frameworks, calibrated confidence scoring, and automated... 
    Suggested

    Bioscope.ai, Inc.

    Boston, MA
    3 days ago
  • $60 per hour

    A leading AI development company is seeking experienced quantitative professionals to evaluate and shape AI-generated analyses. This fully remote position offers flexibility in projects and competitive hourly pay up to $60 USD. Candidates should have 2+ years in quantitative... 
    Remote job
    Hourly pay

    DataAnnotation

    Boston, MA
    3 days ago
  • $172k - $229k

     ...petabytes of multimodal sensor data. Our next-generation...  ...prompting, and goal-directed behavior. Integrate such systems into...  ...Collaborate : Work closely with ML scientists, data engineers, and autonomy...  ...practices for model training, evaluation, and deployment. What We're... 
    Work at office
    Remote work

    Motional

    Boston, MA
    4 days ago
  •  ...seeking a highly skilled and experienced Data Scientist to join our Perception Verification and...  ...stack as well as end-to-end system behavior. In this role, You will... Define...  ...Create and maintain datasets used to evaluate the perception stack and end-to-end driving... 
    Temporary work
    Relocation package

    Zoox

    Boston, MA
    more than 2 months ago
  • $150k - $210k

     ...bodies and daily lives. Our data science algorithms teams are...  ...build on physiological and behavioral data streams. This role emphasizes...  ...in collaboration with data scientists and MLOps engineers...  ...ML development (frameworks, evaluation criteria, performance validation... 
    Full time
    Work at office
    Relocation

    WHOOP

    Boston, MA
    2 days ago
  •  ...Senior Software Engineer - AI & Data Engineering Location:...  ...enhancements LLM evaluation and response tuning Participate...  ...with AI engineers, data scientists, and cloud architects to optimize...  ...experience preferred. Behaviors & Abilities Strong ability... 

    Saviance

    Boston, MA
    4 days ago
  • $159k - $207k

     ...Senior Data Engineer Boston, MA September 24, 2025 We are seeking a highly skilled...  ...our large-scale AI model and software evaluation framework – Ground Truth Regression. The...  ...which can impact the autonomous vehicle behavior. The GTRegression team validates the... 

    Venturefizz Product Management Community

    Boston, MA
    1 day ago
  • $55 - $72 per hour

     ...About the job Data Scientist Why CiviTronix? At CiviTronix, we believe that data...  ...engineering solutions, predict system behaviors, and support strategic business decisions...  ...across projects and teams. Evaluate and fine-tune models over time to ensure... 
    Remote work
    Monday to Friday
    Shift work

    CPJ Recruitment

    Boston, MA
    1 day ago
  • $142.3k - $195.7k

     ..., and control to develop and evaluate agents that can set, adapt, and...  ...-step, self-directed agent behavior Develop evaluation frameworks...  ...in SQL, Python, and data analysis/data mining tools....  ...engineering or an applied research scientist position preferably with a focus... 
    Bi-weekly pay
    Full time
    Temporary work
    Apprenticeship
    Work at office
    Remote work
    Work from home
    Home office

    Humana Inc

    Boston, MA
    11 hours ago
  • $175.5k - $180k

    Tech & AI Senior Data Engineer I - QuantumBlack, AI by McKinsey Job ID: 109245...  ...functional Agile teams alongside Data Scientists, Machine Learning Engineers, and industry...  ...end, with sound judgment around model behavior, evaluation, reliability, guardrails, and the... 
    Apprenticeship
    Work at office
    Local area
    Easy work

    McKinsey & Company

    Boston, MA
    1 day ago
  • $144.5k - $170k

     ...fully supported. We are looking for a Data Protection Engineer (L4) to help...  ...Information Event Management - SIEM, User Behavioral Analytics - UBA, Data Loss Prevention -...  ...day period. We encourage you to carefully evaluate how your skills and interests align with... 
    Local area

    Coinbase

    Boston, MA
    4 days ago
  • $60 per hour

     ...experienced quantitative professionals for fully remote work. You'll evaluate AI-generated analyses and help shape future AI models. The ideal...  ...projects. Join the team to contribute to advancements in AI systems focusing on data and analytics. #J-18808-Ljbffr DataAnnotation
    Remote job

    DataAnnotation

    Boston, MA
    4 days ago
  •  ...We are seeking a Senior or Principal Data Scientist/Machine Learning Scientist to lead product...  ...Identify success metrics and build evaluation frameworks for both model and product performance...  ...A/B testing, product metrics and user behavior analytics Preferred Qualifications... 
    Work at office

    Datalign Advisory, Inc.

    Cambridge, MA
    21 hours ago
  • $75.28k - $109.55k

     ...Responsible for designing and implementing data analysis and reporting solutions to meet...  ..."looks like" by specifying which behaviors are most critical for successful performance...  ...success. These competencies are used to evaluate performance, make hiring decisions, identify... 
    Remote work
    Shift work

    Brigham and Women's Hospital

    Somerville, MA
    1 day ago
  • $153k - $222k

     ...or support role. Experience with "Big Data" technologies or concepts (e.g., analytics...  ...dynamics, and customer buying behavior. About the Job When leading companies choose...  ...from use case identification, technical evaluation, and through customer ramp. Combine business... 
    Full time

    Google

    Cambridge, MA
    3 days ago
  • $172k - $229k

     ...Level Technical Lead Manager to build and lead our new AI Data Engine team. This pivotal role will drive the strategy,...  ...generating ML datasets for large computer vision and behavioral models. Identify and evaluate cutting-edge technologies and methodologies for... 
    Work at office
    Remote work

    Motional

    Boston, MA
    11 days ago
  • $73.8k - $107.4k

     ...defines what effective leadership “looks like” by specifying which behaviors are most critical for successful performance at each job level...  ...and career success. These competencies are used to evaluate performance, make hiring decisions, identify development needs... 
    Work at office
    Remote work
    Flexible hours
    Shift work
    2 days per week
    1 day per week

    Mass General Brigham Health Plan, Inc.

    Somerville, MA
    3 days ago
  • $89.4k - $183.5k

     ...role for you. We are seeking a Senior Data Scientist to join our innovation hub-a small,...  ...predictive models to explain events, forecast behaviors, identify risk, or perform segmentation...  ..., and aligned with business needs. Evaluate alternative approaches and select... 
    Temporary work
    Worldwide
    Flexible hours

    Unum Group

    Boston, MA
    3 days ago
  • $117k - $167k

     ...Spotify is seeking a Data Scientist II to join Product Trust Insights (PTI) within Trust &...  ...work spans pre-launch risk assessment, evaluation of AI features and agentic systems, longitudinal...  ...LLM-based evaluation approaches, behavioral instrumentation, and measurement... 
    Work from home
    Flexible hours

    Spotify.space

    Boston, MA
    3 days ago
  • $98.34k - $201.9k

     ...Summary: We're looking for a Senior Data Scientist who can bridge the gap between our most...  ...prescriptive models to explain outcomes, forecast behavior, and identify risks and opportunities....  ...and rapid prototyping to evaluate emerging capabilities. Embed analytics... 
    Temporary work
    Worldwide
    Flexible hours

    Unum Group

    Boston, MA
    4 days ago
  •  ...development. As an Individual Contributor on the Data Studio team, you will play a key role in...  ...datasets that power model training, evaluation, and customer delivery. This role is...  ...needing new data. Influence model behavior by supplying representative engineering... 
    For contractors

    Foundationllm

    Boston, MA
    1 day ago
  •  ...serve as ground truth for training and evaluating AI systems. Define and maintain standards...  ...requirements and influence model behavior through representative engineering examples...  ...quality, and ensuring timely delivery of data assets What we’re looking for Bachelor... 
    For contractors

    Foundationllm

    Boston, MA
    4 days ago
  •  ...technology solutions that support complex data and operational workflows within a...  ...stakeholders to understand requirements, evaluate solution options, and recommend scalable...  ...including unit testing and domain-driven or behavior-driven development Familiarity with cloud... 

    Huxley

    Boston, MA
    3 days ago
  • $129k - $203.1k

     ...Learning expert for the position of Senior Scientist, Data Science within our Pharmacokinetics,...  ...automated report generation, quality evaluation, consistency checks, process monitoring...  ...monitoring for model drift, data quality, agent behavior, and downstream impact. Contribute to... 

    Merck & Co. Inc

    Boston, MA
    3 days ago
  • $137.8k - $206.6k

     ...responsible for architecting analytical data models, designing scalable cloud-based data...  ...with passion by modeling EMD Serono behaviors and working with a sense of urgency to achieve...  ..., and cost efficiency. Continuously evaluate and adopt new BI, analytics, and data-warehouse... 
    Temporary work
    Flexible hours

    EMD Millipore

    Boston, MA
    6 days ago
  • $166k - $200k

     ...enhancing the investment process through data driven insights, portfolio analytics, and...  ...relevant to company fundamentals and market behavior Designs and implements back testing...  ...to all applicants and employees, and we evaluate qualified applicants without regard to ancestry... 
    Local area
    Flexible hours

    Franklin Templeton

    Boston, MA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Scientist, Behavior Evaluation. Be the first to apply!