Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Member of Technical Staff, AI Quality

$176k - $253k

Harper

Senior Member of Technical Staff, AI Quality Harper is an AI-native commercial insurance company in San Francisco. We're not bolting AI onto insurance — we're rebuilding the entire business as software, on a simple bet: turning expert human judgment into compute is one of the largest transitions left to make, and a trillion-dollar industry still run 90% by hand is the place to prove it. We've grown ~100x in the last year and we move at that speed — on-site, in person, long days, very high standards. Almost no one joins Harper for insurance; they join to build the company that replaces how it works. The role Turning judgment into compute only compounds if the company can tell whether the compute is getting better. Today that's mostly vibes: an engineer ships a prompt change, a tool change, or a new model and judges it by feel — "seems better," "the demo passed." Vibes don't survive Series B, and they definitely don't survive an agent that's quoting real coverage for real businesses. Your job is to turn agent quality from a vibe into a number. Harper's agents handle intake, sales, service, voice, and submission packaging; every one needs to be evaluated, regression-tested, and monitored in production. You'll work alongside the engineer setting AI-quality direction and own a specific agent surface end-to-end — so that when the agent improves we know, and when it regresses we know before the customer does. That's how we scale judgment without scaling headcount. What you'll do Build capability + regression eval suites for your assigned agents — intake, submissions, placements, renewals, CRM, or voice. Curate golden datasets from real failure modes: real transcripts, real underwriter back-and-forth, real call recordings. 20–50 sharp cases per agent, not thousands of synthetic ones. Design graders. Deterministic first (string match, state check, tool-call assertions); LLM-as-judge where deterministic fails; human calibration on samples. Ship pre-merge eval gates. Every PR touching an agent, prompt, or tool runs the relevant suite in CI. Below threshold, it's blocked. Wire production trajectory monitoring. Online evaluators score live trajectories; drift gets caught within hours. Turn ops findings into permanent tests. Every flagged failure becomes a regression case; every repeat issue becomes a test that catches it forever. What we're looking for 3–6 years building software, with hands-on production LLM/agent eval experience — capability + regression suite design, LLM-as-judge graders, golden datasets. You can describe a specific regression an eval suite you built caught — and exactly how it would have leaked otherwise. You've designed an LLM-as-judge rubric that survived human calibration, and you debug a hallucination by reading transcripts, not aggregate dashboards. Familiar with at least one major eval framework; strong written communication (rubric docs, failure-mode taxonomies). You write code with AI daily and have real opinions on which agent behaviors actually matter. Bonus: open-source eval-framework contributions; red-team/adversarial testing; voice eval (latency, interruption, transcription accuracy); ML eval/observability background. The reality On-site in San Francisco, in person, long days, high standards. AI quality is the discipline that decides whether the whole bet holds, which means the work is scrutinized and the bar is high — your evals are what let everyone else ship fast without flying blind. The right person wants that leverage and that pace. Logistics Compensation (OTE): $176,000–$253,000 cash (base + target performance bonus), plus competitive equity. Location: San Francisco, in-office. Based here or willing to relocate. Benefits: Uber commuter benefits; breakfast, lunch, and dinner provided; snacks and coffee stocked; free gym membership; health, dental, and vision. Process: Founder call (15 min) → Tech Lead deep-dive (60 min, eval architecture and real failure modes) → Super Day on-site → founder + Tech Lead offer. No committee. Best offer, first. To apply: If you've turned vibes into a number — built an eval suite that caught a regression a model upgrade silently introduced — send your resume, the framework, and a transcript of a failure you found that nobody else did. #J-18808-Ljbffr Harper

Vacancy posted 16 hours ago
Similar jobs that could be interesting for youBased on the Senior Member of Technical Staff, AI Quality in San Francisco, CA vacancy
  •  ...developers and enterprises who are building AI systems to power magical experiences...  ...that matter, and join the team. As a Member of Technical Staff with a focus on Multimodal AI, you...  ...and culture, fitness and well-being, quality time, and workspace improvement Remote... 
    Quality
    Senior
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    San Francisco, CA
    16 hours ago
  • $148.5k - $223.9k

     ...About Salesforce Salesforce is the #1 AI CRM, where humans with agents drive...  ...systems with customers. With your strong technical competence, strategic thinking and customer...  ...and/or Java Build and ship high-quality, production-grade software using modern engineering... 
    Quality
    Senior

    Salesforce.Com Inc

    San Francisco, CA
    1 day ago
  • $220k - $250k

    Our client, an AI-driven healthcare company focused on personalizing patient care, is hiring a Member of Technical Staff to join their team remotely. The successful candidate will help...  ...medical information. ~Enhance search quality, citation accuracy and data lineage across... 
    Quality
    Senior
    Permanent employment
    Remote work

    Alldus International Consulting Ltd

    San Francisco, CA
    17 days ago
  •  ...Senior Member of Technical Staff Harper is an AI-native commercial insurance company in San Francisco. We're not bolting AI onto insurance - we're rebuilding the entire business as software, on a simple bet: turning expert human judgment into compute is one of the... 
    Senior
    Work at office
    Relocation

    Harper Group

    San Francisco, CA
    4 days ago
  • Member of Technical Staff - Infrastructure Security We're partnering with a frontier AI research company that is building next-generation open-weight foundation models with the mission of making advanced AI broadly accessible. Their team includes researchers, engineers,... 
    Senior

    Xcede

    San Francisco, CA
    7 hours ago
  •  ...building the next generation of AI infrastructure: large-scale...  ...Gimlet Labs is seeking an Member of Staff focused on AI Research (Intern...  ...techniques to drive performance and quality optimizations across the...  ..., work alongside highly technical engineers, and help shape both... 
    Quality
    Internship

    Gimlet Labs

    San Francisco, CA
    1 day ago
  • $200k

     ...company's most important decisions. As a Member of Technical Staff on Evals, you will build both the...  ...evaluations, data ablations, and dataset quality analysis, while designing and...  ...collaborative team working on frontier AI systems Magic strives to be the place... 
    Quality
    Visa sponsorship
    Relocation package

    Magic AI Corp.

    San Francisco, CA
    1 day ago
  • $200k - $300k

     ...Member of Technical Staff (Platform) Title of Role: Member of Technical Staff (Platform) Location...  .... With a focus on building robust AI-driven systems, this organization is...  ...through to production, ensuring high-quality code and rapid deployment. Build and... 
    Quality
    Work at office

    Recruiting from Scratch

    San Francisco, CA
    3 days ago
  •  ...servicing with the industry's most advanced AI credit-servicing agents. We are...  ..., Product Hunt), Charlie Songhurst (Board Member, Meta), and Michael Jones (Former Chair, Huntington...  ...features Write tests, enforce code quality, and participate in design/review... 
    Quality
    Full time
    Work experience placement
    Internship
    Worldwide

    Krew Research

    San Francisco, CA
    4 days ago
  •  ...Member Of Technical Staff @ Lotus AI Lotus AI is a groundbreaking primary care app that integrates your medical records, AI, and real doctors to...  ...and maintain data curation pipelines that produce high-quality training and evaluation datasets from clinical interactions... 
    Quality

    Lotus Health

    San Francisco, CA
    8 days ago
  • $200k - $350k

     ...Member Of Technical Staff, Inference & Serving Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's fastest reasoning LLM and first commercially...  ...than today's LLMs, with best-in-class quality. We are the AI researchers and... 
    Quality
    Immediate start
    Flexible hours

    Inception LLC

    San Francisco, CA
    1 day ago
  • $200k - $350k

     ...Member Of Technical Staff, Training Infra Bay Area Ai Systems Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world...  ...efficiency than today's LLMs, with best-in-class quality. We are the AI researchers and engineers behind such... 
    Quality
    Immediate start
    Flexible hours

    Inception LLC

    San Francisco, CA
    3 days ago
  •  ...Member Of Technical Staff – Applied AI, Frontend Stuut is transforming accounts receivable for B2B companies—making collections smarter and faster...  ...processes, tools, and systems that scale productivity, code quality, and team impact. Recruit, mentor, and grow a high-... 
    Quality
    Full time
    Flexible hours

    Stuut

    San Francisco, CA
    2 days ago
  •  ...and implement intuitive, high-quality frontend experiences - with strong...  ...layers) Interest in AI systems and enthusiasm for building...  ...on company direction and technical decisions High ownership and...  ...sponsorship As a founding member, you'll help define the... 
    Quality
    Visa sponsorship
    Flexible hours

    NeoSigma

    San Francisco, CA
    1 day ago
  • We're partnering with a frontier AI research company on a search for a Member of Technical Staff focused on AI Safet y. The company is building next-generation open-weight foundation models with a mission to make advanced AI broadly accessible. Their team includes researchers... 
    Senior

    Xcede

    San Francisco, CA
    7 hours ago
  •  ...foundational infrastructure to train specialized AI agents. We turn real-world data...  ...feel like one seamless system. As a Member of Technical Staff, Infrastructure / DevOps, you will own...  ...that directly shapes the quality, speed, and reliability of Plato's research... 
    Quality

    Plato

    San Francisco, CA
    1 day ago
  • $300k

    Member of Technical Staff - Mechanistic Interpretability About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans...  ...Design metrics and baselines for reward quality, including alignment with intended... 
    Quality
    Work at office
    Local area

    Vmax

    San Francisco, CA
    16 hours ago
  • $300k

    Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all...  ...where reliability, ownership, and code quality were central. Evidence of strong... 
    Quality
    Work at office
    Local area

    Vmax

    San Francisco, CA
    2 days ago
  • $150k - $300k

     ...team. About the role We’re hiring a Member of Technical Staff - Research to help advance the frontier...  ...for training & evaluating autonomous AI agents Investigating failure modes of...  ...experience shipping reliable, production-quality code Have a track record of... 
    Quality
    Full time
    Work at office

    Cervin

    San Francisco, CA
    3 days ago
  • Member of Technical Staff, Post-Training Location: SF Bay Area or Tokyo, Japan Type: Full-time About...  ...Radical Numerics Radical Numerics is an AI lab bringing the rigor of distributed...  ...Help define, curate, or generate high‑quality post‑training datasets, including expert... 
    Quality
    Full time

    Radical Numerics

    San Francisco, CA
    4 days ago
  • $200k - $330k

     ...is seeking an intrepid, polymathic Member of Technical Staff to take on one of the AI industry's most unique engineering...  ...work directly with Perplexity's senior leadership to spearhead a broad portfolio...  ...standards for the craft and quality of your work. Strong communication... 
    Quality

    Perplexity

    San Francisco, CA
    7 hours ago
  •  ...developers and enterprises who are building AI systems to power magical experiences...  ...on our mission and shape the future! Member of Technical Staff, Search Why this role? We are looking...  ...-edge search models. Gather high-quality retrieval datasets and optimize data pipelines... 
    Quality
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    San Francisco, CA
    16 hours ago
  • Full-Stack TypeScript | React | AI Agents | Forward-Deployed Engineering Location...  ...Apple, Ramp, Stripe, and Meta. As a Member of Technical Staff , you will own products end‑to‑end...  ...ambiguous environments Balance speed, quality, scalability, and customer impactHelp... 
    Quality
    Work at office

    Love Freedom Solution

    San Francisco, CA
    16 hours ago
  • $200k

    Join to apply for the Member of Technical Staff role at Listen Labs . TL;DR: We are...  ...: Listen Labs is an AI‑powered research platform that...  ...making to be more nuanced than senior user researchers. Distributed...  .... You have a high bar for quality: In a fast‑moving startup, output... 
    Quality
    Flexible hours

    Listen Labs

    San Francisco, CA
    1 day ago
  • Member of Technical Staff, Pretraining Science Member of Technical Staff, Pre-Training Science Location...  ...Numerics Radical Numerics is an AI lab bringing the rigor of distributed...  ...architectural ideas that improve representation quality and downstream performance. Study... 
    Quality
    Full time

    Radical Numerics

    San Francisco, CA
    1 day ago
  • Member of Technical Staff - Applied Research Patronus AI is a frontier lab developing simulation research and infrastructure to accelerate progress toward human...  ..., shaping the industry standard for robust, high‑quality environments. Your work will inform how frontier labs... 
    Quality

    Patronus AI, Inc.

    San Francisco, CA
    4 days ago
  •  ...looking for someone highly technical (our current team...  ...Background Listen Labs is an AI-powered research...  ...effective than the most senior user researchers. This...  ...You have a high bar for quality: In a startup, moving fast...  ...to Grow: As an early member of the team, you’ll have... 
    Quality
    Flexible hours

    Listen Labs

    San Francisco, CA
    1 day ago
  • $227.5k - $401k

     ...individuals who tackle unique technical challenges at scale and...  ...Adyen is building a top-tier AI engineering team in San...  ...financial technology sector. As a Member of Technical Staff, you will operate with a...  ...alignment by setting high quality standards. Optimize and Scale... 
    Quality
    Work at office
    Immediate start
    Relocation
    Flexible hours

    Adyen

    San Francisco, CA
    16 hours ago
  •  ...exceptional mid-level to senior engineer to join our...  ...architecture reviews for other members of the team Help...  ...needs Requirements Technical 2+ YOE: 2+ years of full...  ...ship quickly with high quality. You should have a track...  ...own company Interest in AI/ML systems and... 
    Quality
    Full time
    Work experience placement
    Relocation
    Relocation package
    Shift work

    PetsApp

    San Francisco, CA
    3 days ago
  • Member of Technical Staff, Document Understanding Join us and help shape the future of AI by architecting next-generation knowledge systems. Join us and help shape the future...  ...evaluation frameworks, benchmarks, or data quality pipelines Experience with model serving frameworks... 
    Quality
    Work at office
    Remote work

    LlamaIndex, Inc.

    San Francisco, CA
    16 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Member of Technical Staff, AI Quality. Be the first to apply!