Data Scientist, Behavior Evaluation
Zoox Inc.
Job Description
Job Description
As a Data Scientist on the Behavior Evaluation team, you will be the statistical anchor ensuring our autonomous driving systems navigate highway environments with world-class safety, efficiency, and comfort. Highway evaluation presents a unique industry challenge: verifying vehicle behavior at high velocities where the margin for error is razor-thin, and critical edge cases are buried in petabytes of data.
In this role, you will bridge advanced statistical methodology with scalable software engineering. You will design the mathematical frameworks, statistical tests, and data-driven metrics that evaluate our planner's decisions. Working directly with large-scale simulation and real-world fleet data, your insights will define our validation pipelines, identify behavioral regressions, and directly shape the software powering our next-generation autonomous fleet.
In this role, you will:Design Advanced Experimental Frameworks: Formulate robust statistical models, hypothesis testing frameworks, and quasi-experimental designs (such as synthetic controls or matching) to rigorously validate highway planner behavior in simulation and shadow-mode deployments.
Model Tail Risks & Rare Events: Use Surrogate Safety Measures (e.g., TTC, PET) to accurately model and predict low-frequency, high-severity edge cases that traditional mean-based statistics miss.
Architect Scenario-Based Metrics: Own and mature critical behavioral KPIs, utilizing data stratification to analyze complex driving scenarios (e.g., high-speed merging, cut-ins) while proactively identifying statistical anomalies like Simpson’s Paradox.
Surface Statistical Edge Cases: Apply data mining and advanced statistical techniques to isolate low-frequency, high-severity edge cases and systemic Autonomy engineering debt.
- Drive Cross-Functional Alignment: Translate complex statistical findings and multi-source evaluations into clear, actionable technical recommendations, collaborating closely with Autonomy Software Engineers, Safety Systems, and Product teams.
Education: Bachelor’s or Master’s degree in a highly quantitative field (e.g., Statistics, Mathematics, Data Science, Operations Research, or a related field with a strong statistical focus).
Experience: 3–6+ years of professional experience as a Data Scientist or Quantitative Engineer, with a proven track record of landing data-driven impact.
Strong Statistical Foundations: Deep understanding of hypothesis testing, experimental design, regression analysis, non- parametric/resampling methods (e.g., bootstrapping, permutation tests), and time-series analysis handling autocorrelated data.
Strong Programming: High proficiency in Python (Pandas, NumPy, SciPy, scikit-learn) and the ability to write highly complex, optimized SQL queries for massive distributed databases.
- Communication: Exceptional ability to articulate complex mathematical methodologies and statistical results to cross-functional engineering partners.
Robotics or Autonomy Background: Experience analyzing spatial-temporal data, sensor logs, or vehicle telemetry from robotics, autonomous vehicles, or aviation systems.
Simulation-Based Testing: Familiarity with validating software systems using empty-world or simulation platforms at scale.
- Modern Data Stack: Experience with workflow orchestration tools (e.g., Airflow) and building advanced data visualization layers (e.g., Superset).
Base Salary Range
There are three major components to compensation for this position: salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. A sign-on bonus may be offered as part of the compensation package. The listed range applies only to the base salary. Compensation will vary based on geographic location and level. Leveling, as well as positioning within a level, is determined by a range of factors, including, but not limited to, a candidate's relevant years of experience, domain knowledge, and interview performance. The salary range listed in this posting is representative of the range of levels Zoox is considering for this position.
Zoox also offers a comprehensive package of benefits, including paid time off (e.g. sick leave, vacation, bereavement), unpaid time off, Zoox Stock Appreciation Rights, Amazon RSUs, health insurance, long-term care insurance, long-term and short-term disability insurance, and life insurance.
About Zoox
Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We’re looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.
Follow us on LinkedIn
Accommodations
If you need an accommodation to participate in the application or interview process please reach out to View email address on ziprecruiter.com or your assigned recruiter.
A Final Note:
You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
$176k - $240k
...capacity to measure and understand autonomous vehicle behavior. We are looking for a Data Scientist to join our Autonomy Integration Metrics team, where... ...focus on metric design and development or performance evaluation ~ Strong proficiency in Python in production...SuggestedTemporary workRelocation package- ...capacity to measure and understand autonomous vehicle behavior. We are looking for a Data Scientist to join our Autonomy Integration Metrics team, where... ...focus on metric design and development or performance evaluation ~ Strong proficiency in Python in production...SuggestedTemporary workRelocation package
- Apple Inc. is seeking an Annotation Data Scientist for the Evaluation Integrity team in Cambridge, Massachusetts. This role focuses on designing human-in-the-loop (HITL) annotation projects that evaluate the quality of Siri interactions and systems. The ideal candidate...Suggested
$154.6k - $274.9k
Annotation Data Scientist, Evaluation Integrity (Siri) Cambridge, Massachusetts, United States — Machine Learning and AI Play a part in the ongoing revolution in human-computer interaction. Siri is evolving — and the way we evaluate it has to evolve with it. Join the...SuggestedRelocation- ...About the Role We're looking for a Data Scientist to own the quality, reliability, and trustworthiness of our clinical AI outputs. You'll... ...systems that ensure our AI "knows what it doesn't know"—developing evaluation frameworks, calibrated confidence scoring, and automated...Suggested
$60 per hour
A leading AI development company is seeking experienced quantitative professionals to evaluate and shape AI-generated analyses. This fully remote position offers flexibility in projects and competitive hourly pay up to $60 USD. Candidates should have 2+ years in quantitative...Remote jobHourly pay$172k - $229k
...petabytes of multimodal sensor data. Our next-generation... ...prompting, and goal-directed behavior. Integrate such systems into... ...Collaborate : Work closely with ML scientists, data engineers, and autonomy... ...practices for model training, evaluation, and deployment. What We're...Work at officeRemote work- ...seeking a highly skilled and experienced Data Scientist to join our Perception Verification and... ...stack as well as end-to-end system behavior. In this role, You will... Define... ...Create and maintain datasets used to evaluate the perception stack and end-to-end driving...Temporary workRelocation package
$150k - $210k
...bodies and daily lives. Our data science algorithms teams are... ...build on physiological and behavioral data streams. This role emphasizes... ...in collaboration with data scientists and MLOps engineers... ...ML development (frameworks, evaluation criteria, performance validation...Full timeWork at officeRelocation- ...Senior Software Engineer - AI & Data Engineering Location:... ...enhancements LLM evaluation and response tuning Participate... ...with AI engineers, data scientists, and cloud architects to optimize... ...experience preferred. Behaviors & Abilities Strong ability...
$159k - $207k
...Senior Data Engineer Boston, MA September 24, 2025 We are seeking a highly skilled... ...our large-scale AI model and software evaluation framework – Ground Truth Regression. The... ...which can impact the autonomous vehicle behavior. The GTRegression team validates the...$55 - $72 per hour
...About the job Data Scientist Why CiviTronix? At CiviTronix, we believe that data... ...engineering solutions, predict system behaviors, and support strategic business decisions... ...across projects and teams. Evaluate and fine-tune models over time to ensure...Remote workMonday to FridayShift work$142.3k - $195.7k
..., and control to develop and evaluate agents that can set, adapt, and... ...-step, self-directed agent behavior Develop evaluation frameworks... ...in SQL, Python, and data analysis/data mining tools.... ...engineering or an applied research scientist position preferably with a focus...Bi-weekly payFull timeTemporary workApprenticeshipWork at officeRemote workWork from homeHome office$175.5k - $180k
Tech & AI Senior Data Engineer I - QuantumBlack, AI by McKinsey Job ID: 109245... ...functional Agile teams alongside Data Scientists, Machine Learning Engineers, and industry... ...end, with sound judgment around model behavior, evaluation, reliability, guardrails, and the...ApprenticeshipWork at officeLocal areaEasy work$144.5k - $170k
...fully supported. We are looking for a Data Protection Engineer (L4) to help... ...Information Event Management - SIEM, User Behavioral Analytics - UBA, Data Loss Prevention -... ...day period. We encourage you to carefully evaluate how your skills and interests align with...Local area$60 per hour
...experienced quantitative professionals for fully remote work. You'll evaluate AI-generated analyses and help shape future AI models. The ideal... ...projects. Join the team to contribute to advancements in AI systems focusing on data and analytics. #J-18808-Ljbffr DataAnnotationRemote job- ...We are seeking a Senior or Principal Data Scientist/Machine Learning Scientist to lead product... ...Identify success metrics and build evaluation frameworks for both model and product performance... ...A/B testing, product metrics and user behavior analytics Preferred Qualifications...Work at office
$75.28k - $109.55k
...Responsible for designing and implementing data analysis and reporting solutions to meet... ..."looks like" by specifying which behaviors are most critical for successful performance... ...success. These competencies are used to evaluate performance, make hiring decisions, identify...Remote workShift work$153k - $222k
...or support role. Experience with "Big Data" technologies or concepts (e.g., analytics... ...dynamics, and customer buying behavior. About the Job When leading companies choose... ...from use case identification, technical evaluation, and through customer ramp. Combine business...Full time$172k - $229k
...Level Technical Lead Manager to build and lead our new AI Data Engine team. This pivotal role will drive the strategy,... ...generating ML datasets for large computer vision and behavioral models. Identify and evaluate cutting-edge technologies and methodologies for...Work at officeRemote work$73.8k - $107.4k
...defines what effective leadership “looks like” by specifying which behaviors are most critical for successful performance at each job level... ...and career success. These competencies are used to evaluate performance, make hiring decisions, identify development needs...Work at officeRemote workFlexible hoursShift work2 days per week1 day per week$89.4k - $183.5k
...role for you. We are seeking a Senior Data Scientist to join our innovation hub-a small,... ...predictive models to explain events, forecast behaviors, identify risk, or perform segmentation... ..., and aligned with business needs. Evaluate alternative approaches and select...Temporary workWorldwideFlexible hours$117k - $167k
...Spotify is seeking a Data Scientist II to join Product Trust Insights (PTI) within Trust &... ...work spans pre-launch risk assessment, evaluation of AI features and agentic systems, longitudinal... ...LLM-based evaluation approaches, behavioral instrumentation, and measurement...Work from homeFlexible hours$98.34k - $201.9k
...Summary: We're looking for a Senior Data Scientist who can bridge the gap between our most... ...prescriptive models to explain outcomes, forecast behavior, and identify risks and opportunities.... ...and rapid prototyping to evaluate emerging capabilities. Embed analytics...Temporary workWorldwideFlexible hours- ...development. As an Individual Contributor on the Data Studio team, you will play a key role in... ...datasets that power model training, evaluation, and customer delivery. This role is... ...needing new data. Influence model behavior by supplying representative engineering...For contractors
- ...serve as ground truth for training and evaluating AI systems. Define and maintain standards... ...requirements and influence model behavior through representative engineering examples... ...quality, and ensuring timely delivery of data assets What we’re looking for Bachelor...For contractors
- ...technology solutions that support complex data and operational workflows within a... ...stakeholders to understand requirements, evaluate solution options, and recommend scalable... ...including unit testing and domain-driven or behavior-driven development Familiarity with cloud...
$129k - $203.1k
...Learning expert for the position of Senior Scientist, Data Science within our Pharmacokinetics,... ...automated report generation, quality evaluation, consistency checks, process monitoring... ...monitoring for model drift, data quality, agent behavior, and downstream impact. Contribute to...$137.8k - $206.6k
...responsible for architecting analytical data models, designing scalable cloud-based data... ...with passion by modeling EMD Serono behaviors and working with a sense of urgency to achieve... ..., and cost efficiency. Continuously evaluate and adopt new BI, analytics, and data-warehouse...Temporary workFlexible hours$166k - $200k
...enhancing the investment process through data driven insights, portfolio analytics, and... ...relevant to company fundamentals and market behavior Designs and implements back testing... ...to all applicants and employees, and we evaluate qualified applicants without regard to ancestry...Local areaFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Data Scientist, Behavior Evaluation. Be the first to apply!
- entry level data scientist remote Boston, MA
- senior data scientist Boston, MA
- entry level data scientist Boston, MA
- work from home data scientist Boston, MA
- healthcare data scientist Boston, MA
- python data scientist Boston, MA
- data scientist (hedge fund) Boston, MA
- energy data scientist Boston, MA
- ai data scientist Boston, MA
- part time data scientist Boston, MA


