Data Scientist, Behavior Evaluation
Zoox Inc.
Job Description
Job Description
As a Data Scientist on the Behavior Evaluation team, you will be the statistical anchor ensuring our autonomous driving systems navigate highway environments with world-class safety, efficiency, and comfort. Highway evaluation presents a unique industry challenge: verifying vehicle behavior at high velocities where the margin for error is razor-thin, and critical edge cases are buried in petabytes of data.
In this role, you will bridge advanced statistical methodology with scalable software engineering. You will design the mathematical frameworks, statistical tests, and data-driven metrics that evaluate our planner's decisions. Working directly with large-scale simulation and real-world fleet data, your insights will define our validation pipelines, identify behavioral regressions, and directly shape the software powering our next-generation autonomous fleet.
In this role, you will:Design Advanced Experimental Frameworks: Formulate robust statistical models, hypothesis testing frameworks, and quasi-experimental designs (such as synthetic controls or matching) to rigorously validate highway planner behavior in simulation and shadow-mode deployments.
Model Tail Risks & Rare Events: Use Surrogate Safety Measures (e.g., TTC, PET) to accurately model and predict low-frequency, high-severity edge cases that traditional mean-based statistics miss.
Architect Scenario-Based Metrics: Own and mature critical behavioral KPIs, utilizing data stratification to analyze complex driving scenarios (e.g., high-speed merging, cut-ins) while proactively identifying statistical anomalies like Simpson’s Paradox.
Surface Statistical Edge Cases: Apply data mining and advanced statistical techniques to isolate low-frequency, high-severity edge cases and systemic Autonomy engineering debt.
- Drive Cross-Functional Alignment: Translate complex statistical findings and multi-source evaluations into clear, actionable technical recommendations, collaborating closely with Autonomy Software Engineers, Safety Systems, and Product teams.
Education: Bachelor’s or Master’s degree in a highly quantitative field (e.g., Statistics, Mathematics, Data Science, Operations Research, or a related field with a strong statistical focus).
Experience: 3–6+ years of professional experience as a Data Scientist or Quantitative Engineer, with a proven track record of landing data-driven impact.
Strong Statistical Foundations: Deep understanding of hypothesis testing, experimental design, regression analysis, non- parametric/resampling methods (e.g., bootstrapping, permutation tests), and time-series analysis handling autocorrelated data.
Strong Programming: High proficiency in Python (Pandas, NumPy, SciPy, scikit-learn) and the ability to write highly complex, optimized SQL queries for massive distributed databases.
- Communication: Exceptional ability to articulate complex mathematical methodologies and statistical results to cross-functional engineering partners.
Robotics or Autonomy Background: Experience analyzing spatial-temporal data, sensor logs, or vehicle telemetry from robotics, autonomous vehicles, or aviation systems.
Simulation-Based Testing: Familiarity with validating software systems using empty-world or simulation platforms at scale.
- Modern Data Stack: Experience with workflow orchestration tools (e.g., Airflow) and building advanced data visualization layers (e.g., Superset).
Base Salary Range
There are three major components to compensation for this position: salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. A sign-on bonus may be offered as part of the compensation package. The listed range applies only to the base salary. Compensation will vary based on geographic location and level. Leveling, as well as positioning within a level, is determined by a range of factors, including, but not limited to, a candidate's relevant years of experience, domain knowledge, and interview performance. The salary range listed in this posting is representative of the range of levels Zoox is considering for this position.
Zoox also offers a comprehensive package of benefits, including paid time off (e.g. sick leave, vacation, bereavement), unpaid time off, Zoox Stock Appreciation Rights, Amazon RSUs, health insurance, long-term care insurance, long-term and short-term disability insurance, and life insurance.
About Zoox
Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We’re looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.
Follow us on LinkedIn
Accommodations
If you need an accommodation to participate in the application or interview process please reach out to View email address on ziprecruiter.com or your assigned recruiter.
A Final Note:
You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses and identifying potential inconsistencies or verification signals in application materials based on available information. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
$176k - $240k
...capacity to measure and understand autonomous vehicle behavior. We are looking for a Data Scientist to join our Autonomy Integration Metrics team, where... ...focus on metric design and development or performance evaluation ~ Strong proficiency in Python in production...SuggestedTemporary workRelocation package- Apple Inc. is seeking an Annotation Data Scientist for the Evaluation Integrity team in Cambridge, Massachusetts. This role focuses on designing human-in-the-loop (HITL) annotation projects that evaluate the quality of Siri interactions and systems. The ideal candidate...Suggested
$154.6k - $274.9k
Annotation Data Scientist, Evaluation Integrity (Siri) Cambridge, Massachusetts, United States — Machine Learning and AI Play a part in the ongoing revolution in human-computer interaction. Siri is evolving — and the way we evaluate it has to evolve with it. Join the...SuggestedRelocation- About the Role We're looking for a Data Scientist to own the quality, reliability, and trustworthiness of our clinical AI outputs. You'll... ...systems that ensure our AI "knows what it doesn't know"—developing evaluation frameworks, calibrated confidence scoring, and automated...Suggested
$60 per hour
A leading AI development company is seeking experienced quantitative professionals to evaluate and shape AI-generated analyses. This fully remote position offers flexibility in projects and competitive hourly pay up to $60 USD. Candidates should have 2+ years in quantitative...SuggestedHourly payRemote work$172k - $229k
...petabytes of multimodal sensor data. Our next-generation... ...prompting, and goal-directed behavior. Integrate such systems into... ...Collaborate : Work closely with ML scientists, data engineers, and autonomy... ...practices for model training, evaluation, and deployment. What We're...Work at officeRemote work$167k - $228k
...seeking a highly skilled and experienced Data Scientist to join our Perception Verification and... ...stack as well as end-to-end system behavior. In this role, You will... Define... ...Create and maintain datasets used to evaluate the perception stack and end-to-end driving...Temporary workRelocation package$159k - $207k
...Senior Data Engineer Boston, MA September 24, 2025 We are seeking a highly skilled... ...our large-scale AI model and software evaluation framework – Ground Truth Regression. The... ...which can impact the autonomous vehicle behavior. The GTRegression team validates the...$142.3k - $195.7k
..., and control to develop and evaluate agents that can set, adapt, and... ...-step, self-directed agent behavior Develop evaluation frameworks... ...in SQL, Python, and data analysis/data mining tools.... ...engineering or an applied research scientist position preferably with a focus...Bi-weekly payFull timeTemporary workApprenticeshipWork at officeRemote workWork from homeHome office$55 - $72 per hour
...About the job Data Scientist Why CiviTronix? At CiviTronix, we believe that data... ...engineering solutions, predict system behaviors, and support strategic business decisions... ...across projects and teams. Evaluate and fine-tune models over time to ensure...Remote workMonday to FridayShift work$144.5k - $170k
...fully supported. We are looking for a Data Protection Engineer (L4) to help... ...Information Event Management - SIEM, User Behavioral Analytics - UBA, Data Loss Prevention -... ...day period. We encourage you to carefully evaluate how your skills and interests align with...Local area$159k - $207k
...seeking a highly skilled and motivated Senior Data Analysis Engineer for our large-scale AI model and software evaluation framework - Ground Truth Regression. The ideal... ...perception which can impact the autonomous vehicle behavior. The GTRegression team validates the end...$104.48k - $156.72k
...there's only one choice.****Biomarker Data Scientist**As Biomarker Data Scientist within the... ...omics data, and real-world data sources.* Evaluate and prototype advanced analytical... ...role modelling Dynamic Shared Ownership behaviors combined with excellent written and verbal...Local area- ...We are seeking a Senior or Principal Data Scientist/Machine Learning Scientist to lead product... ...Identify success metrics and build evaluation frameworks for both model and product performance... ...A/B testing, product metrics and user behavior analytics Preferred Qualifications...Work at office
$175.5k - $180k
...leading the implementation of scalable data architectures for cutting-edge AI and... ...functional Agile teams alongside Data Scientists, Machine Learning Engineers, and industry... ...end, with sound judgment around model behavior, evaluation, reliability, guardrails, and the...ApprenticeshipWork at officeLocal areaEasy work$98.34k - $201.9k
...Summary: We're looking for a Senior Data Scientist who can bridge the gap between our most... ...prescriptive models to explain outcomes, forecast behavior, and identify risks and opportunities.... ...and rapid prototyping to evaluate emerging capabilities. Embed analytics...Temporary workWorldwideFlexible hours- ...serve as ground truth for training and evaluating AI systems. Define and maintain standards... ...requirements and influence model behavior through representative engineering examples... ...quality, and ensuring timely delivery of data assets What we’re looking for Bachelor...For contractors
$73.8k - $107.4k
...defines what effective leadership “looks like” by specifying which behaviors are most critical for successful performance at each job level... ...and career success. These competencies are used to evaluate performance, make hiring decisions, identify development needs...Work at officeRemote workFlexible hoursShift work2 days per week1 day per week$150k - $210k
...bodies and daily lives. Our data science algorithms teams are... ...solutions build on physiological and behavioral data streams. This role... ...in collaboration with data scientists and MLOps engineers Collaborate... ...ML development (frameworks, evaluation criteria, performance...Full timeWork at officeRelocation- ...the strategy, articulate solution options, evaluate tradeoffs, and influence key decisions... ...background in modern, cloud based architecture, data structures, algorithms, and object-... ...unit testing, isolation frameworks, and behavior driven development Experience with developing...Internship
$172k - $229k
...Level Technical Lead Manager to build and lead our new AI Data Engine team. This pivotal role will drive the strategy,... ...generating ML datasets for large computer vision and behavioral models. Identify and evaluate cutting-edge technologies and methodologies for...Work at officeRemote work$95k - $165k
Job Overview Investment Data Systems team is responsible for developing technology solutions... ..., articulate solution options and evaluate tradeoffs Lead adoption of cloud-native technology... ...unit testing, isolation frameworks, and behavior/domain-driven development Technology...Local areaWorldwide- ...technology solutions that support complex data and operational workflows within a... ...stakeholders to understand requirements, evaluate solution options, and recommend scalable... ...including unit testing and domain‑driven or behavior‑driven development Familiarity with cloud...
$142.3k - $195.7k
..., predictive modeling, and reliable measurement. The Lead Data Scientist (Experimentation) designs and operates the learning engine of... ...measurement. Impact Measurement: Conduct early and consistent evaluation of causal impact, ensuring actionable insights are generated...Bi-weekly payFull timeTemporary workApprenticeshipWork at officeRemote workWork from homeHome office- ...struggle of sourcing, cleaning, and maintaining incomplete parcel data. By bridging property and geography at the parcel level, we... ...leadership teams to align technical investments with business goals.Evaluate emerging technologies and recommend strategic improvements to...Remote workFlexible hours
$106.61k - $284.28k
...one family and one community at a time. Responsibilities Lead Data Privacy Engineer to assist in leading our Data Protection Engineering... ...is just as important as what we deliver. Our Heart At Work Behaviors™ support this purpose. We want everyone who works at CVS Health...Hourly payFull timeTemporary workWork experience placementLocal areaFlexible hours$152k - $217k
...Lead Data Scientist Plymouth Rock Assurance is on a mission to apply advanced data science to deliver breakthrough insights that propel... ...decisions at senior levels. Advance team excellence: evaluate new methods and tools, share reusable components, elevate engineering...$40 per hour
A technology-driven company is seeking a Statistician to support AI model development. In this flexible role, you will evaluate AI logic and solve complex mathematical problems, ensuring quality outputs. Candidates should possess strong mathematical reasoning and proficiency...Remote jobHourly payFlexible hours- ...alternate start dates are available. - Must be willing and able to work overtime as needed. - Please note upon hire, Veteran Evaluation Services (VES), a Maximus Co. will provide all necessary computer equipment that is to be utilized to fulfil the duties of your role...Contract workWork at officeRemote workWork from homeHome officeMonday to Friday
$166k - $200k
...to enhance the investment process through data‑driven insights, portfolio analytics, and... ...to company fundamentals and market behavior. Design and implement back‑testing frameworks... ...to all applicants and employees, and we evaluate qualified applicants without regard to ancestry...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Data Scientist, Behavior Evaluation. Be the first to apply!
- principal data scientist Boston, MA
- entry level data scientist Boston, MA
- energy data scientist Boston, MA
- data scientist machine learning engineer Boston, MA
- data scientist (hedge fund) Boston, MA
- work from home data scientist Boston, MA
- junior data scientist remote Boston, MA
- part time data scientist Boston, MA
- python data scientist (contract) Boston, MA
- healthcare data scientist Boston, MA


