Senior Member of Technical Staff, AI Quality

$176k - $253k

Harper

Senior Member of Technical Staff, AI Quality Harper is an AI-native commercial insurance company in San Francisco. We're not bolting AI onto insurance — we're rebuilding the entire business as software, on a simple bet: turning expert human judgment into compute is one of the largest transitions left to make, and a trillion-dollar industry still run 90% by hand is the place to prove it. We've grown ~100x in the last year and we move at that speed — on-site, in person, long days, very high standards. Almost no one joins Harper for insurance; they join to build the company that replaces how it works. The role Turning judgment into compute only compounds if the company can tell whether the compute is getting better. Today that's mostly vibes: an engineer ships a prompt change, a tool change, or a new model and judges it by feel — "seems better," "the demo passed." Vibes don't survive Series B, and they definitely don't survive an agent that's quoting real coverage for real businesses. Your job is to turn agent quality from a vibe into a number. Harper's agents handle intake, sales, service, voice, and submission packaging; every one needs to be evaluated, regression-tested, and monitored in production. You'll work alongside the engineer setting AI-quality direction and own a specific agent surface end-to-end — so that when the agent improves we know, and when it regresses we know before the customer does. That's how we scale judgment without scaling headcount. What you'll do Build capability + regression eval suites for your assigned agents — intake, submissions, placements, renewals, CRM, or voice. Curate golden datasets from real failure modes: real transcripts, real underwriter back-and-forth, real call recordings. 20–50 sharp cases per agent, not thousands of synthetic ones. Design graders. Deterministic first (string match, state check, tool-call assertions); LLM-as-judge where deterministic fails; human calibration on samples. Ship pre-merge eval gates. Every PR touching an agent, prompt, or tool runs the relevant suite in CI. Below threshold, it's blocked. Wire production trajectory monitoring. Online evaluators score live trajectories; drift gets caught within hours. Turn ops findings into permanent tests. Every flagged failure becomes a regression case; every repeat issue becomes a test that catches it forever. What we're looking for 3–6 years building software, with hands-on production LLM/agent eval experience — capability + regression suite design, LLM-as-judge graders, golden datasets. You can describe a specific regression an eval suite you built caught — and exactly how it would have leaked otherwise. You've designed an LLM-as-judge rubric that survived human calibration, and you debug a hallucination by reading transcripts, not aggregate dashboards. Familiar with at least one major eval framework; strong written communication (rubric docs, failure-mode taxonomies). You write code with AI daily and have real opinions on which agent behaviors actually matter. Bonus: open-source eval-framework contributions; red-team/adversarial testing; voice eval (latency, interruption, transcription accuracy); ML eval/observability background. The reality On-site in San Francisco, in person, long days, high standards. AI quality is the discipline that decides whether the whole bet holds, which means the work is scrutinized and the bar is high — your evals are what let everyone else ship fast without flying blind. The right person wants that leverage and that pace. Logistics Compensation (OTE): $176,000–$253,000 cash (base + target performance bonus), plus competitive equity. Location: San Francisco, in-office. Based here or willing to relocate. Benefits: Uber commuter benefits; breakfast, lunch, and dinner provided; snacks and coffee stocked; free gym membership; health, dental, and vision. Process: Founder call (15 min) → Tech Lead deep-dive (60 min, eval architecture and real failure modes) → Super Day on-site → founder + Tech Lead offer. No committee. Best offer, first. To apply: If you've turned vibes into a number — built an eval suite that caught a regression a model upgrade silently introduced — send your resume, the framework, and a transcript of a failure you found that nobody else did. #J-18808-Ljbffr Harper

Apply

Vacancy posted 16 hours ago

Similar jobs that could be interesting for youBased on the Senior Member of Technical Staff, AI Quality in San Francisco, CA vacancy

Senior Member of Technical Staff, Multimodal AI
...developers and enterprises who are building AI systems to power magical experiences... ...that matter, and join the team. As a Member of Technical Staff with a focus on Multimodal AI, you... ...and culture, fitness and well-being, quality time, and workspace improvement Remote...
Quality
Senior
Full time
Work at office
Remote work
Flexible hours
Cohere
San Francisco, CA
16 hours ago
Senior Member of Technical Staff, AI Research
$148.5k - $223.9k
...About Salesforce Salesforce is the #1 AI CRM, where humans with agents drive... ...systems with customers. With your strong technical competence, strategic thinking and customer... ...and/or Java Build and ship high-quality, production-grade software using modern engineering...
Quality
Senior
Salesforce.Com Inc
San Francisco, CA
1 day ago
Senior Member of Technical Staff
$220k - $250k
Our client, an AI-driven healthcare company focused on personalizing patient care, is hiring a Member of Technical Staff to join their team remotely. The successful candidate will help... ...medical information. ~Enhance search quality, citation accuracy and data lineage across...
Quality
Senior
Permanent employment
Remote work
Alldus International Consulting Ltd
San Francisco, CA
17 days ago
Senior Member of Technical Staff
...Senior Member of Technical Staff Harper is an AI-native commercial insurance company in San Francisco. We're not bolting AI onto insurance - we're rebuilding the entire business as software, on a simple bet: turning expert human judgment into compute is one of the...
Senior
Work at office
Relocation
Harper Group
San Francisco, CA
4 days ago
Senior Member of Technical Staff - Infrastructure Security
Member of Technical Staff - Infrastructure Security We're partnering with a frontier AI research company that is building next-generation open-weight foundation models with the mission of making advanced AI broadly accessible. Their team includes researchers, engineers,...
Senior
Xcede
San Francisco, CA
7 hours ago
Member of Technical Staff - AI Research (Intern)
...building the next generation of AI infrastructure: large-scale... ...Gimlet Labs is seeking an Member of Staff focused on AI Research (Intern... ...techniques to drive performance and quality optimizations across the... ..., work alongside highly technical engineers, and help shape both...
Quality
Internship
Gimlet Labs
San Francisco, CA
1 day ago
Member of Technical Staff, Evals
$200k
...company's most important decisions. As a Member of Technical Staff on Evals, you will build both the... ...evaluations, data ablations, and dataset quality analysis, while designing and... ...collaborative team working on frontier AI systems Magic strives to be the place...
Quality
Visa sponsorship
Relocation package
Magic AI Corp.
San Francisco, CA
1 day ago
Member of Technical Staff (Platform)
$200k - $300k
...Member of Technical Staff (Platform) Title of Role: Member of Technical Staff (Platform) Location... .... With a focus on building robust AI-driven systems, this organization is... ...through to production, ensuring high-quality code and rapid deployment. Build and...
Quality
Work at office
Recruiting from Scratch
San Francisco, CA
3 days ago
Member of Technical Staff (Forward Deployed)
...servicing with the industry's most advanced AI credit-servicing agents. We are... ..., Product Hunt), Charlie Songhurst (Board Member, Meta), and Michael Jones (Former Chair, Huntington... ...features Write tests, enforce code quality, and participate in design/review...
Quality
Full time
Work experience placement
Internship
Worldwide
Krew Research
San Francisco, CA
4 days ago
Member of Technical Staff
...Member Of Technical Staff @ Lotus AI Lotus AI is a groundbreaking primary care app that integrates your medical records, AI, and real doctors to... ...and maintain data curation pipelines that produce high-quality training and evaluation datasets from clinical interactions...
Quality
Lotus Health
San Francisco, CA
8 days ago
Member of Technical Staff, Inference & Serving
$200k - $350k
...Member Of Technical Staff, Inference & Serving Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's fastest reasoning LLM and first commercially... ...than today's LLMs, with best-in-class quality. We are the AI researchers and...
Quality
Immediate start
Flexible hours
Inception LLC
San Francisco, CA
1 day ago
Member of Technical Staff, Training Infra
$200k - $350k
...Member Of Technical Staff, Training Infra Bay Area Ai Systems Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world... ...efficiency than today's LLMs, with best-in-class quality. We are the AI researchers and engineers behind such...
Quality
Immediate start
Flexible hours
Inception LLC
San Francisco, CA
3 days ago
Member of the Technical Staff Applied AI, Frontend
...Member Of Technical Staff – Applied AI, Frontend Stuut is transforming accounts receivable for B2B companies—making collections smarter and faster... ...processes, tools, and systems that scale productivity, code quality, and team impact. Recruit, mentor, and grow a high-...
Quality
Full time
Flexible hours
Stuut
San Francisco, CA
2 days ago
Member of Technical Staff - Full-stack
...and implement intuitive, high-quality frontend experiences - with strong... ...layers) Interest in AI systems and enthusiasm for building... ...on company direction and technical decisions High ownership and... ...sponsorship As a founding member, you'll help define the...
Quality
Visa sponsorship
Flexible hours
NeoSigma
San Francisco, CA
1 day ago
Senior Member of Technical Staff - Model Safety
We're partnering with a frontier AI research company on a search for a Member of Technical Staff focused on AI Safet y. The company is building next-generation open-weight foundation models with a mission to make advanced AI broadly accessible. Their team includes researchers...
Senior
Xcede
San Francisco, CA
7 hours ago
Member of Technical Staff, Infrastructure / DevOps
...foundational infrastructure to train specialized AI agents. We turn real-world data... ...feel like one seamless system. As a Member of Technical Staff, Infrastructure / DevOps, you will own... ...that directly shapes the quality, speed, and reliability of Plato's research...
Quality
Plato
San Francisco, CA
1 day ago
Member of Technical Staff - Mechanistic Interpretability
$300k
Member of Technical Staff - Mechanistic Interpretability About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans... ...Design metrics and baselines for reward quality, including alignment with intended...
Quality
Work at office
Local area
Vmax
San Francisco, CA
16 hours ago
Member of Technical Staff - RL Infrastructure
$300k
Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all... ...where reliability, ownership, and code quality were central. Evidence of strong...
Quality
Work at office
Local area
Vmax
San Francisco, CA
2 days ago
Member of Technical Staff - Research
$150k - $300k
...team. About the role We’re hiring a Member of Technical Staff - Research to help advance the frontier... ...for training & evaluating autonomous AI agents Investigating failure modes of... ...experience shipping reliable, production-quality code Have a track record of...
Quality
Full time
Work at office
Cervin
San Francisco, CA
3 days ago
Member of Technical Staff, Post-Training
Member of Technical Staff, Post-Training Location: SF Bay Area or Tokyo, Japan Type: Full-time About... ...Radical Numerics Radical Numerics is an AI lab bringing the rigor of distributed... ...Help define, curate, or generate high‑quality post‑training datasets, including expert...
Quality
Full time
Radical Numerics
San Francisco, CA
4 days ago
Member of Technical Staff (AI Policy and Strategic Initiatives)
$200k - $330k
...is seeking an intrepid, polymathic Member of Technical Staff to take on one of the AI industry's most unique engineering... ...work directly with Perplexity's senior leadership to spearhead a broad portfolio... ...standards for the craft and quality of your work. Strong communication...
Quality
Perplexity
San Francisco, CA
7 hours ago
Member of Technical Staff, Search
...developers and enterprises who are building AI systems to power magical experiences... ...on our mission and shape the future! Member of Technical Staff, Search Why this role? We are looking... ...-edge search models. Gather high-quality retrieval datasets and optimize data pipelines...
Quality
Full time
Work at office
Remote work
Flexible hours
Cohere
San Francisco, CA
16 hours ago
Member of Technical Staff
Full-Stack TypeScript | React | AI Agents | Forward-Deployed Engineering Location... ...Apple, Ramp, Stripe, and Meta. As a Member of Technical Staff , you will own products end‑to‑end... ...ambiguous environments Balance speed, quality, scalability, and customer impactHelp...
Quality
Work at office
Love Freedom Solution
San Francisco, CA
16 hours ago
Member of Technical Staff
$200k
Join to apply for the Member of Technical Staff role at Listen Labs . TL;DR: We are... ...: Listen Labs is an AI‑powered research platform that... ...making to be more nuanced than senior user researchers. Distributed... .... You have a high bar for quality: In a fast‑moving startup, output...
Quality
Flexible hours
Listen Labs
San Francisco, CA
1 day ago
Member of Technical Staff, Pretraining Science
Member of Technical Staff, Pretraining Science Member of Technical Staff, Pre-Training Science Location... ...Numerics Radical Numerics is an AI lab bringing the rigor of distributed... ...architectural ideas that improve representation quality and downstream performance. Study...
Quality
Full time
Radical Numerics
San Francisco, CA
1 day ago
Member of Technical Staff - Applied Research
Member of Technical Staff - Applied Research Patronus AI is a frontier lab developing simulation research and infrastructure to accelerate progress toward human... ..., shaping the industry standard for robust, high‑quality environments. Your work will inform how frontier labs...
Quality
Patronus AI, Inc.
San Francisco, CA
4 days ago
Member of Technical Staff, Platform
...looking for someone highly technical (our current team... ...Background Listen Labs is an AI-powered research... ...effective than the most senior user researchers. This... ...You have a high bar for quality: In a startup, moving fast... ...to Grow: As an early member of the team, you’ll have...
Quality
Flexible hours
Listen Labs
San Francisco, CA
1 day ago
Member of Technical Staff
$227.5k - $401k
...individuals who tackle unique technical challenges at scale and... ...Adyen is building a top-tier AI engineering team in San... ...financial technology sector. As a Member of Technical Staff, you will operate with a... ...alignment by setting high quality standards. Optimize and Scale...
Quality
Work at office
Immediate start
Relocation
Flexible hours
Adyen
San Francisco, CA
16 hours ago
Member of Technical Staff - Platform
...exceptional mid-level to senior engineer to join our... ...architecture reviews for other members of the team Help... ...needs Requirements Technical 2+ YOE: 2+ years of full... ...ship quickly with high quality. You should have a track... ...own company Interest in AI/ML systems and...
Quality
Full time
Work experience placement
Relocation
Relocation package
Shift work
PetsApp
San Francisco, CA
3 days ago
04 Member of Technical Staff, Document Understanding San Francisco
Member of Technical Staff, Document Understanding Join us and help shape the future of AI by architecting next-generation knowledge systems. Join us and help shape the future... ...evaluation frameworks, benchmarks, or data quality pipelines Experience with model serving frameworks...
Quality
Work at office
Remote work
LlamaIndex, Inc.
San Francisco, CA
16 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Member of Technical Staff, AI Quality. Be the first to apply!