Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Data Scientist - Evaluations & LLM Quality (Hybrid)

$210k - $385k

Pantera Capital

A leading financial technology firm seeks a Data Scientist to architect evaluation pipelines and improve answer quality through innovative evaluations. Candidates should possess a PhD or MS in a technical field with at least 4 years of relevant experience, strong skills in Python and SQL, and familiarity with AWS and Databricks. The role offers a hybrid work environment and a competitive salary package ranging from $210K to $385K along with comprehensive benefits. #J-18808-Ljbffr Pantera Capital

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Staff Data Scientist - Evaluations & LLM Quality (Hybrid) in San Francisco, CA vacancy
  • $238k - $302k

     ...states. Rigorous behavioral evaluation of the Waymo Driver is a...  ...initiatives for measuring the quality and trustworthiness of the behavior...  ...priorities. In this hybrid role you will report to the DS...  ...industry experience solving data science problems Solid statistical... 
    Quality
    Full time
    Remote work

    Waymo

    San Francisco, CA
    19 days ago
  • $210k - $385k

     ...Employment Type Full time Location Type Hybrid Department Data Science Compensation $210K - $3...  ...users daily with reliable, high-quality answers grounded in an LLM-first search engine and our...  ...Architect and maintain automated evaluation pipelines to assess answer... 
    Quality
    Full time
    Local area

    Pantera Capital

    San Francisco, CA
    4 days ago
  •  ...Full time Location Type Hybrid Department Data Science Perplexity...  ...daily with reliable, high-quality answers grounded in an LLM‑first search engine and...  ...our users. As a Data Scientist/Engineer on this team,...  ...alignment with ground‑truth evaluations Analyze experimental... 
    Quality
    Full time

    Pantera Capital

    San Francisco, CA
    4 days ago
  • $300k

     ...Senior Staff Data Scientist Grindr is an AI-native platform powering how...  ...to production and improve quality of life for our millions of...  ...live full lives. This is a hybrid role based in our Bay Area...  ...strategy for designing and evaluating experiments to measure the... 
    Quality
    Work at office
    Immediate start
    Worldwide
    Flexible hours

    Grindr

    San Francisco, CA
    5 days ago
  • $175k - $235k

     ...maps and strategies with data-driven analytics and...  .... As a senior/staff data scientist focused on product analytics...  ...-making * Design and evaluate product experiments...  ...distributed data, validating quality and delivering...  ...stored in the United States. #LI-Hybrid #LI-KW1
    Quality
    Work at office
    3 days per week

    Gallup

    San Francisco, CA
    4 days ago
  • $200k - $235k

     ...LiveRamp is the data collaboration platform of choice...  ...global network of top-quality partners. Hundreds...  ...requirements. Staff Data Scientist LiveRamp is the data...  ...the performance of our LLM-based agentic system....  ...and implement rigorous evaluation methodologies to measure... 
    Quality
    Work at office
    Work from home
    Flexible hours
    Night shift

    LiveRamp

    San Francisco, CA
    3 days ago
  • $204k - $259k

     ...Learning Engineer – VLM/LLM Evaluation Waymo is an...  ...This role follows a hybrid work schedule and you...  ...will report to a Senior Staff Software Engineer....  ...), for evaluating the quality, safety, and realism of...  ...and extend large scale data and evaluation pipelines... 
    Quality
    Full time
    Temporary work
    Remote work

    Waymo

    San Francisco, CA
    23 hours ago
  • $200k - $365k

     ...the highest standards of data security and privacy...  ...systems, data pipelines, or evaluation harnesses that can run...  ...like for a Speech LLM, translating capabilities...  ...transcription accuracy, audio quality, and reasoning of audio...  ...of gender. Hybrid Office: Minimum of 3x in... 
    Quality
    Full time
    Work at office
    Worldwide

    Plaud

    San Francisco, CA
    2 days ago
  • $264.8k - $331k

     ...As the leading data and evaluation partner for frontier AI companies...  ...building industry-leading LLM evals, setting new...  ...AI labs to provide high quality data and accelerate progress...  ...in GenAI research. As a Staff Machine Learning Research Scientist on the LLM Evals team, you... 
    Quality
    Full time

    Scale AI

    San Francisco, CA
    4 days ago
  • $179.5k - $269.5k

     ...2010. We're looking for a Staff Data Scientist, Finance to own the data science...  ...AI/ML solutions—including LLM-based workflows—to automate...  ...reporting cycles, and improve data quality at scale. Own the financial...  ...assistance for things like hybrid work, family planning, along... 
    Quality
    Full time
    Work at office
    Flexible hours

    GoFundMe

    San Francisco, CA
    1 day ago
  • A leading AI evaluation company is looking for a Staff Machine Learning Research Scientist to advance LLM evaluation methodologies. This role involves designing benchmarks, collaborating with teams, and mentoring others. Ideal candidates have significant experience in... 

    Scale AI, Inc.

    San Francisco, CA
    1 day ago
  • $179.5k - $269.5k

     ...10. We're looking for a Staff Data Scientist (Pricing) to serve as the senior...  ...and coding agents (e.g., LLM-based assistants, autonomous...  ..., including monitoring, evaluation, and iteration in live environments...  ...assistance for things like hybrid work, family planning, along... 
    Full time
    Work at office
    Local area
    Remote work
    Flexible hours
    Shift work

    GoFundMe

    San Francisco, CA
    3 days ago
  •  ...research for their next generation of LLM products. Join us if you: Wish to...  .... Responsibilities Own LLM evaluation processes and methods with a focus on...  ...safety vulnerabilities. Generate high quality synthetic data, curate labels, and conduct rigorous... 
    Quality
    Local area
    Shift work

    Dynamo AI

    San Francisco, CA
    4 days ago
  • $212k - $265k

     ...Staff Data Scientist - Product Analytics 1 in 4 people in the US have a treatable mental health...  ...will influence what we build, how we evaluate it, and how we scale what works. This...  ..., online experiments, and human-quality signals when needed. Build scalable... 
    Quality
    Work from home
    Flexible hours

    Headway - Design & Development

    San Francisco, CA
    4 days ago
  •  ...Staff Data Scientist The Data team at Imprint builds the data foundation that powers smarter, faster...  ...and long-term strategy, enabling high-quality insights into customer behavior,...  ...Location & Work Model This is a hybrid role requiring 2–3 days per week onsite... 
    Quality
    Relocation
    Flexible hours
    2 days per week
    3 days per week

    Imprint.com

    San Francisco, CA
    4 days ago
  • $155k - $189k

     ...Staff Data Scientist - Time Products San Francisco, CA - Hybrid; New York, NY - Hybrid; At Gusto, we're on a mission to grow the small business economy. We...  ...offs and limitations based on sample size and data quality. Execution: Deliver multiple high-impact... 
    Quality
    Full time
    Work at office
    Local area
    2 days per week
    3 days per week

    Gusto

    San Francisco, CA
    3 days ago
  • A leading AI contracting platform is seeking an experienced Data Scientist to optimize product analytics and enhance contract management solutions...  ...and is familiar with tools such as BigQuery and Looker. This hybrid role requires in-office attendance twice a week in San... 
    Contract work
    Work at office

    Ironclad

    San Francisco, CA
    1 day ago
  • $160k - $200k

    The Opportunity As a Staff Data Scientist on the Clinical Performance team, you will be the lead...  ...providers. While your primary focus is evaluative, you will be a key player in the broader...  ...forecasting engines that predict our quality performance across various value-based... 
    Quality

    Pearl Health

    San Francisco, CA
    1 day ago
  •  ...at About the Role We’re looking for a Data & LLM Systems Engineer to help us design, build...  ...for monitoring, debugging, and evaluating LLM behavior in production Analysis & Insight...  ...interactions to identify failure modes, drift, and quality issues and help ensure overall... 
    Quality
    Home office
    Flexible hours

    Allspice, Inc.

    San Francisco, CA
    1 day ago
  •  ...focused on ensuring the accuracy and reliability of Veeva AI Agents. The position involves defining evaluation strategies, assessing LLM outputs, and creating high-quality datasets through rigorous validation methodologies. Successful candidates will have strong... 
    Quality
    Flexible hours

    Veeva Systems, Inc.

    San Francisco, CA
    4 days ago
  • $158k - $187k

     ...performance? As an Innovation Data Scientist within Innovation...  ...that enhance quality, speed, and decision-making...  ...large language model (LLM)-powered and classical...  ...success criteria, and evaluation plans aligned to business...  ...prohibited by applicable law. #LI-Hybrid #LI-LC1
    Quality
    Contract work
    Worldwide
    Flexible hours

    Kirkland & Ellis

    San Francisco, CA
    3 days ago
  • $133k - $150k

     ...role for you. Impact. Data Scientists sit within a group of...  ...layout analysis, LLM-based information extraction...  ...("LPT") leadership to evaluate and test internal...  ...evaluating LLM output quality through benchmarking,...  ...prohibited by applicable law. #LI-Hybrid #LI-LB1 #LI-Remote
    Quality
    Local area
    Remote work
    Worldwide
    Flexible hours

    Kirkland & Ellis

    San Francisco, CA
    2 days ago
  • $185k - $221.4k

     ...develop and critically evaluate therapeutic hypotheses....  ...Build and own production data infrastructure. Design,...  ...Model and curate high-quality data assets. Perform...  ...that GenAI agents and LLM orchestrators depend on...  ...healthcare, generous PTO, and hybrid/remote options aligned... 
    Quality
    Remote work
    Flexible hours

    Foresite Labs

    San Francisco, CA
    2 days ago
  •  ...Brisk Teaching Data Scientist Opportunity Brisk Teaching...  ...automated pipelines, and LLM-powered tooling can...  ...model generation, data quality monitoring, experiment...  ...Bring rigor to how we evaluate features and make ship...  ...times a week via our hybrid model in either our San... 
    Quality
    Shift work

    Brisk Teaching

    San Francisco, CA
    3 days ago
  • $180k - $210k

     ...STAFF DATA SCIENTIST, PRODUCT - EXPERIMENTATION & CAUSAL INFERENCE $180K - $210K BASE REMOTE...  ...and engineering leaders Design and evaluate experiments across web and in-app experiences...  ...AI tools (e.g., coding copilots, LLM-assisted workflows) THE BENEFITS... 
    Remote work

    Harnham

    San Francisco, CA
    23 hours ago
  • $175k - $234k

     ...Future Neighbors As Data Scientists at Nextdoor, you will...  ...environment that embraces a hybrid employment model,...  ...increasingly relies on LLM-driven query...  ...query understanding and evaluation. Your responsibilities...  ...Care deeply about data quality and empower employees... 
    Quality
    Work at office
    Local area
    Work from home

    Nextdoor

    San Francisco, CA
    9 days ago
  • $249.6k - $312k

     ...Senior Staff Data Scientist - Bayesian Experimentation and Causal Inference New York, New York, United States; San Francisco, California,...  ...serve as a core leader for standards, frameworks, and decision quality across Product, Growth, Ops, and Finance. Your work will... 
    Quality
    Work from home
    Flexible hours

    Headway - Design & Development

    San Francisco, CA
    4 days ago
  • $212k - $265k

     ...Staff Data Scientist - Growth New York, New York, United States; San Francisco, California, United States; Seattle, Washington, United States...  ...assumptions, offline conversion integration, and data quality monitoring so measurement is robust. Advocate for minimal, decision... 
    Quality
    Temporary work
    Work from home
    Flexible hours
    Shift work

    Headway - Design & Development

    San Francisco, CA
    4 days ago
  • $160k - $260k

     ...FinTech unicorn in San Francisco is looking for a Data Engineer. In this hybrid role, you'll collaborate with data scientists to design and build data models and scalable...  ...for optimizing datasets and ensuring data quality management. This position offers a competitive... 
    Quality

    Kikoff Inc.

    San Francisco, CA
    2 days ago
  •  ...financial technology company in San Francisco seeks a Data Engineer to design and maintain data models and...  ...with teams to craft data solutions and uphold high data quality standards. The position allows for a hybrid work arrangement with flexible remote options and competitive... 
    Quality
    Remote work
    Flexible hours

    Brex Inc.

    San Francisco, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Data Scientist - Evaluations & LLM Quality (Hybrid). Be the first to apply!