Staff Data Scientist - Evaluations & LLM Quality (Hybrid)
$210k - $385kPantera Capital
A leading financial technology firm seeks a Data Scientist to architect evaluation pipelines and improve answer quality through innovative evaluations. Candidates should possess a PhD or MS in a technical field with at least 4 years of relevant experience, strong skills in Python and SQL, and familiarity with AWS and Databricks. The role offers a hybrid work environment and a competitive salary package ranging from $210K to $385K along with comprehensive benefits. #J-18808-Ljbffr Pantera Capital
$238k - $302k
...states. Rigorous behavioral evaluation of the Waymo Driver is a... ...initiatives for measuring the quality and trustworthiness of the behavior... ...priorities. In this hybrid role you will report to the DS... ...industry experience solving data science problems Solid statistical...QualityFull timeRemote work$210k - $385k
...Employment Type Full time Location Type Hybrid Department Data Science Compensation $210K - $3... ...users daily with reliable, high-quality answers grounded in an LLM-first search engine and our... ...Architect and maintain automated evaluation pipelines to assess answer...QualityFull timeLocal area- ...Full time Location Type Hybrid Department Data Science Perplexity... ...daily with reliable, high-quality answers grounded in an LLM‑first search engine and... ...our users. As a Data Scientist/Engineer on this team,... ...alignment with ground‑truth evaluations Analyze experimental...QualityFull time
$300k
...Senior Staff Data Scientist Grindr is an AI-native platform powering how... ...to production and improve quality of life for our millions of... ...live full lives. This is a hybrid role based in our Bay Area... ...strategy for designing and evaluating experiments to measure the...QualityWork at officeImmediate startWorldwideFlexible hours$175k - $235k
...maps and strategies with data-driven analytics and... .... As a senior/staff data scientist focused on product analytics... ...-making * Design and evaluate product experiments... ...distributed data, validating quality and delivering... ...stored in the United States. #LI-Hybrid #LI-KW1QualityWork at office3 days per week$200k - $235k
...LiveRamp is the data collaboration platform of choice... ...global network of top-quality partners. Hundreds... ...requirements. Staff Data Scientist LiveRamp is the data... ...the performance of our LLM-based agentic system.... ...and implement rigorous evaluation methodologies to measure...QualityWork at officeWork from homeFlexible hoursNight shift$204k - $259k
...Learning Engineer – VLM/LLM Evaluation Waymo is an... ...This role follows a hybrid work schedule and you... ...will report to a Senior Staff Software Engineer.... ...), for evaluating the quality, safety, and realism of... ...and extend large scale data and evaluation pipelines...QualityFull timeTemporary workRemote work$200k - $365k
...the highest standards of data security and privacy... ...systems, data pipelines, or evaluation harnesses that can run... ...like for a Speech LLM, translating capabilities... ...transcription accuracy, audio quality, and reasoning of audio... ...of gender. Hybrid Office: Minimum of 3x in...QualityFull timeWork at officeWorldwide$264.8k - $331k
...As the leading data and evaluation partner for frontier AI companies... ...building industry-leading LLM evals, setting new... ...AI labs to provide high quality data and accelerate progress... ...in GenAI research. As a Staff Machine Learning Research Scientist on the LLM Evals team, you...QualityFull time$179.5k - $269.5k
...2010. We're looking for a Staff Data Scientist, Finance to own the data science... ...AI/ML solutions—including LLM-based workflows—to automate... ...reporting cycles, and improve data quality at scale. Own the financial... ...assistance for things like hybrid work, family planning, along...QualityFull timeWork at officeFlexible hours- A leading AI evaluation company is looking for a Staff Machine Learning Research Scientist to advance LLM evaluation methodologies. This role involves designing benchmarks, collaborating with teams, and mentoring others. Ideal candidates have significant experience in...
$179.5k - $269.5k
...10. We're looking for a Staff Data Scientist (Pricing) to serve as the senior... ...and coding agents (e.g., LLM-based assistants, autonomous... ..., including monitoring, evaluation, and iteration in live environments... ...assistance for things like hybrid work, family planning, along...Full timeWork at officeLocal areaRemote workFlexible hoursShift work- ...research for their next generation of LLM products. Join us if you: Wish to... .... Responsibilities Own LLM evaluation processes and methods with a focus on... ...safety vulnerabilities. Generate high quality synthetic data, curate labels, and conduct rigorous...QualityLocal areaShift work
$212k - $265k
...Staff Data Scientist - Product Analytics 1 in 4 people in the US have a treatable mental health... ...will influence what we build, how we evaluate it, and how we scale what works. This... ..., online experiments, and human-quality signals when needed. Build scalable...QualityWork from homeFlexible hours- ...Staff Data Scientist The Data team at Imprint builds the data foundation that powers smarter, faster... ...and long-term strategy, enabling high-quality insights into customer behavior,... ...Location & Work Model This is a hybrid role requiring 2–3 days per week onsite...QualityRelocationFlexible hours2 days per week3 days per week
$155k - $189k
...Staff Data Scientist - Time Products San Francisco, CA - Hybrid; New York, NY - Hybrid; At Gusto, we're on a mission to grow the small business economy. We... ...offs and limitations based on sample size and data quality. Execution: Deliver multiple high-impact...QualityFull timeWork at officeLocal area2 days per week3 days per week- A leading AI contracting platform is seeking an experienced Data Scientist to optimize product analytics and enhance contract management solutions... ...and is familiar with tools such as BigQuery and Looker. This hybrid role requires in-office attendance twice a week in San...Contract workWork at office
$160k - $200k
The Opportunity As a Staff Data Scientist on the Clinical Performance team, you will be the lead... ...providers. While your primary focus is evaluative, you will be a key player in the broader... ...forecasting engines that predict our quality performance across various value-based...Quality- ...at About the Role We’re looking for a Data & LLM Systems Engineer to help us design, build... ...for monitoring, debugging, and evaluating LLM behavior in production Analysis & Insight... ...interactions to identify failure modes, drift, and quality issues and help ensure overall...QualityHome officeFlexible hours
- ...focused on ensuring the accuracy and reliability of Veeva AI Agents. The position involves defining evaluation strategies, assessing LLM outputs, and creating high-quality datasets through rigorous validation methodologies. Successful candidates will have strong...QualityFlexible hours
$158k - $187k
...performance? As an Innovation Data Scientist within Innovation... ...that enhance quality, speed, and decision-making... ...large language model (LLM)-powered and classical... ...success criteria, and evaluation plans aligned to business... ...prohibited by applicable law. #LI-Hybrid #LI-LC1QualityContract workWorldwideFlexible hours$133k - $150k
...role for you. Impact. Data Scientists sit within a group of... ...layout analysis, LLM-based information extraction... ...("LPT") leadership to evaluate and test internal... ...evaluating LLM output quality through benchmarking,... ...prohibited by applicable law. #LI-Hybrid #LI-LB1 #LI-RemoteQualityLocal areaRemote workWorldwideFlexible hours$185k - $221.4k
...develop and critically evaluate therapeutic hypotheses.... ...Build and own production data infrastructure. Design,... ...Model and curate high-quality data assets. Perform... ...that GenAI agents and LLM orchestrators depend on... ...healthcare, generous PTO, and hybrid/remote options aligned...QualityRemote workFlexible hours- ...Brisk Teaching Data Scientist Opportunity Brisk Teaching... ...automated pipelines, and LLM-powered tooling can... ...model generation, data quality monitoring, experiment... ...Bring rigor to how we evaluate features and make ship... ...times a week via our hybrid model in either our San...QualityShift work
$180k - $210k
...STAFF DATA SCIENTIST, PRODUCT - EXPERIMENTATION & CAUSAL INFERENCE $180K - $210K BASE REMOTE... ...and engineering leaders Design and evaluate experiments across web and in-app experiences... ...AI tools (e.g., coding copilots, LLM-assisted workflows) THE BENEFITS...Remote work$175k - $234k
...Future Neighbors As Data Scientists at Nextdoor, you will... ...environment that embraces a hybrid employment model,... ...increasingly relies on LLM-driven query... ...query understanding and evaluation. Your responsibilities... ...Care deeply about data quality and empower employees...QualityWork at officeLocal areaWork from home$249.6k - $312k
...Senior Staff Data Scientist - Bayesian Experimentation and Causal Inference New York, New York, United States; San Francisco, California,... ...serve as a core leader for standards, frameworks, and decision quality across Product, Growth, Ops, and Finance. Your work will...QualityWork from homeFlexible hours$212k - $265k
...Staff Data Scientist - Growth New York, New York, United States; San Francisco, California, United States; Seattle, Washington, United States... ...assumptions, offline conversion integration, and data quality monitoring so measurement is robust. Advocate for minimal, decision...QualityTemporary workWork from homeFlexible hoursShift work$160k - $260k
...FinTech unicorn in San Francisco is looking for a Data Engineer. In this hybrid role, you'll collaborate with data scientists to design and build data models and scalable... ...for optimizing datasets and ensuring data quality management. This position offers a competitive...Quality- ...financial technology company in San Francisco seeks a Data Engineer to design and maintain data models and... ...with teams to craft data solutions and uphold high data quality standards. The position allows for a hybrid work arrangement with flexible remote options and competitive...QualityRemote workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff Data Scientist - Evaluations & LLM Quality (Hybrid). Be the first to apply!
- assistant scientist San Francisco, CA
- python data scientist San Francisco, CA
- healthcare data scientist San Francisco, CA
- junior data scientist remote San Francisco, CA
- data scientist San Francisco, CA
- ai data scientist San Francisco, CA
- data scientist (hedge fund) San Francisco, CA
- entry level data scientist remote San Francisco, CA
- entry level data scientist San Francisco, CA
- energy data scientist San Francisco, CA


