Senior Member of Technical Staff, AI Quality
$176k - $253kHarper
Senior Member of Technical Staff, AI Quality Harper is an AI-native commercial insurance company in San Francisco. We're not bolting AI onto insurance — we're rebuilding the entire business as software, on a simple bet: turning expert human judgment into compute is one of the largest transitions left to make, and a trillion-dollar industry still run 90% by hand is the place to prove it. We've grown ~100x in the last year and we move at that speed — on-site, in person, long days, very high standards. Almost no one joins Harper for insurance; they join to build the company that replaces how it works. The role Turning judgment into compute only compounds if the company can tell whether the compute is getting better. Today that's mostly vibes: an engineer ships a prompt change, a tool change, or a new model and judges it by feel — "seems better," "the demo passed." Vibes don't survive Series B, and they definitely don't survive an agent that's quoting real coverage for real businesses. Your job is to turn agent quality from a vibe into a number. Harper's agents handle intake, sales, service, voice, and submission packaging; every one needs to be evaluated, regression-tested, and monitored in production. You'll work alongside the engineer setting AI-quality direction and own a specific agent surface end-to-end — so that when the agent improves we know, and when it regresses we know before the customer does. That's how we scale judgment without scaling headcount. What you'll do Build capability + regression eval suites for your assigned agents — intake, submissions, placements, renewals, CRM, or voice. Curate golden datasets from real failure modes: real transcripts, real underwriter back-and-forth, real call recordings. 20–50 sharp cases per agent, not thousands of synthetic ones. Design graders. Deterministic first (string match, state check, tool-call assertions); LLM-as-judge where deterministic fails; human calibration on samples. Ship pre-merge eval gates. Every PR touching an agent, prompt, or tool runs the relevant suite in CI. Below threshold, it's blocked. Wire production trajectory monitoring. Online evaluators score live trajectories; drift gets caught within hours. Turn ops findings into permanent tests. Every flagged failure becomes a regression case; every repeat issue becomes a test that catches it forever. What we're looking for 3–6 years building software, with hands-on production LLM/agent eval experience — capability + regression suite design, LLM-as-judge graders, golden datasets. You can describe a specific regression an eval suite you built caught — and exactly how it would have leaked otherwise. You've designed an LLM-as-judge rubric that survived human calibration, and you debug a hallucination by reading transcripts, not aggregate dashboards. Familiar with at least one major eval framework; strong written communication (rubric docs, failure-mode taxonomies). You write code with AI daily and have real opinions on which agent behaviors actually matter. Bonus: open-source eval-framework contributions; red-team/adversarial testing; voice eval (latency, interruption, transcription accuracy); ML eval/observability background. The reality On-site in San Francisco, in person, long days, high standards. AI quality is the discipline that decides whether the whole bet holds, which means the work is scrutinized and the bar is high — your evals are what let everyone else ship fast without flying blind. The right person wants that leverage and that pace. Logistics Compensation (OTE): $176,000–$253,000 cash (base + target performance bonus), plus competitive equity. Location: San Francisco, in-office. Based here or willing to relocate. Benefits: Uber commuter benefits; breakfast, lunch, and dinner provided; snacks and coffee stocked; free gym membership; health, dental, and vision. Process: Founder call (15 min) → Tech Lead deep-dive (60 min, eval architecture and real failure modes) → Super Day on-site → founder + Tech Lead offer. No committee. Best offer, first. To apply: If you've turned vibes into a number — built an eval suite that caught a regression a model upgrade silently introduced — send your resume, the framework, and a transcript of a failure you found that nobody else did. #J-18808-Ljbffr Harper
- ...developers and enterprises who are building AI systems to power magical experiences... ...that matter, and join the team. As a Member of Technical Staff with a focus on Multimodal AI, you... ...and culture, fitness and well-being, quality time, and workspace improvement Remote...QualitySeniorFull timeWork at officeRemote workFlexible hours
$148.5k - $223.9k
...About Salesforce Salesforce is the #1 AI CRM, where humans with agents drive... ...systems with customers. With your strong technical competence, strategic thinking and customer... ...and/or Java Build and ship high-quality, production-grade software using modern engineering...QualitySenior$220k - $250k
Our client, an AI-driven healthcare company focused on personalizing patient care, is hiring a Member of Technical Staff to join their team remotely. The successful candidate will help... ...medical information. ~Enhance search quality, citation accuracy and data lineage across...QualitySeniorPermanent employmentRemote work- ...Senior Member of Technical Staff Harper is an AI-native commercial insurance company in San Francisco. We're not bolting AI onto insurance - we're rebuilding the entire business as software, on a simple bet: turning expert human judgment into compute is one of the...SeniorWork at officeRelocation
- Member of Technical Staff - Infrastructure Security We're partnering with a frontier AI research company that is building next-generation open-weight foundation models with the mission of making advanced AI broadly accessible. Their team includes researchers, engineers,...Senior
- ...building the next generation of AI infrastructure: large-scale... ...Gimlet Labs is seeking an Member of Staff focused on AI Research (Intern... ...techniques to drive performance and quality optimizations across the... ..., work alongside highly technical engineers, and help shape both...QualityInternship
$200k
...company's most important decisions. As a Member of Technical Staff on Evals, you will build both the... ...evaluations, data ablations, and dataset quality analysis, while designing and... ...collaborative team working on frontier AI systems Magic strives to be the place...QualityVisa sponsorshipRelocation package$200k - $300k
...Member of Technical Staff (Platform) Title of Role: Member of Technical Staff (Platform) Location... .... With a focus on building robust AI-driven systems, this organization is... ...through to production, ensuring high-quality code and rapid deployment. Build and...QualityWork at office- ...servicing with the industry's most advanced AI credit-servicing agents. We are... ..., Product Hunt), Charlie Songhurst (Board Member, Meta), and Michael Jones (Former Chair, Huntington... ...features Write tests, enforce code quality, and participate in design/review...QualityFull timeWork experience placementInternshipWorldwide
- ...Member Of Technical Staff @ Lotus AI Lotus AI is a groundbreaking primary care app that integrates your medical records, AI, and real doctors to... ...and maintain data curation pipelines that produce high-quality training and evaluation datasets from clinical interactions...Quality
$200k - $350k
...Member Of Technical Staff, Inference & Serving Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's fastest reasoning LLM and first commercially... ...than today's LLMs, with best-in-class quality. We are the AI researchers and...QualityImmediate startFlexible hours$200k - $350k
...Member Of Technical Staff, Training Infra Bay Area Ai Systems Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world... ...efficiency than today's LLMs, with best-in-class quality. We are the AI researchers and engineers behind such...QualityImmediate startFlexible hours- ...Member Of Technical Staff – Applied AI, Frontend Stuut is transforming accounts receivable for B2B companies—making collections smarter and faster... ...processes, tools, and systems that scale productivity, code quality, and team impact. Recruit, mentor, and grow a high-...QualityFull timeFlexible hours
- ...and implement intuitive, high-quality frontend experiences - with strong... ...layers) Interest in AI systems and enthusiasm for building... ...on company direction and technical decisions High ownership and... ...sponsorship As a founding member, you'll help define the...QualityVisa sponsorshipFlexible hours
- We're partnering with a frontier AI research company on a search for a Member of Technical Staff focused on AI Safet y. The company is building next-generation open-weight foundation models with a mission to make advanced AI broadly accessible. Their team includes researchers...Senior
- ...foundational infrastructure to train specialized AI agents. We turn real-world data... ...feel like one seamless system. As a Member of Technical Staff, Infrastructure / DevOps, you will own... ...that directly shapes the quality, speed, and reliability of Plato's research...Quality
$300k
Member of Technical Staff - Mechanistic Interpretability About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans... ...Design metrics and baselines for reward quality, including alignment with intended...QualityWork at officeLocal area$300k
Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans in all... ...where reliability, ownership, and code quality were central. Evidence of strong...QualityWork at officeLocal area$150k - $300k
...team. About the role We’re hiring a Member of Technical Staff - Research to help advance the frontier... ...for training & evaluating autonomous AI agents Investigating failure modes of... ...experience shipping reliable, production-quality code Have a track record of...QualityFull timeWork at office- Member of Technical Staff, Post-Training Location: SF Bay Area or Tokyo, Japan Type: Full-time About... ...Radical Numerics Radical Numerics is an AI lab bringing the rigor of distributed... ...Help define, curate, or generate high‑quality post‑training datasets, including expert...QualityFull time
$200k - $330k
...is seeking an intrepid, polymathic Member of Technical Staff to take on one of the AI industry's most unique engineering... ...work directly with Perplexity's senior leadership to spearhead a broad portfolio... ...standards for the craft and quality of your work. Strong communication...Quality- ...developers and enterprises who are building AI systems to power magical experiences... ...on our mission and shape the future! Member of Technical Staff, Search Why this role? We are looking... ...-edge search models. Gather high-quality retrieval datasets and optimize data pipelines...QualityFull timeWork at officeRemote workFlexible hours
- Full-Stack TypeScript | React | AI Agents | Forward-Deployed Engineering Location... ...Apple, Ramp, Stripe, and Meta. As a Member of Technical Staff , you will own products end‑to‑end... ...ambiguous environments Balance speed, quality, scalability, and customer impactHelp...QualityWork at office
$200k
Join to apply for the Member of Technical Staff role at Listen Labs . TL;DR: We are... ...: Listen Labs is an AI‑powered research platform that... ...making to be more nuanced than senior user researchers. Distributed... .... You have a high bar for quality: In a fast‑moving startup, output...QualityFlexible hours- Member of Technical Staff, Pretraining Science Member of Technical Staff, Pre-Training Science Location... ...Numerics Radical Numerics is an AI lab bringing the rigor of distributed... ...architectural ideas that improve representation quality and downstream performance. Study...QualityFull time
- Member of Technical Staff - Applied Research Patronus AI is a frontier lab developing simulation research and infrastructure to accelerate progress toward human... ..., shaping the industry standard for robust, high‑quality environments. Your work will inform how frontier labs...Quality
- ...looking for someone highly technical (our current team... ...Background Listen Labs is an AI-powered research... ...effective than the most senior user researchers. This... ...You have a high bar for quality: In a startup, moving fast... ...to Grow: As an early member of the team, you’ll have...QualityFlexible hours
$227.5k - $401k
...individuals who tackle unique technical challenges at scale and... ...Adyen is building a top-tier AI engineering team in San... ...financial technology sector. As a Member of Technical Staff, you will operate with a... ...alignment by setting high quality standards. Optimize and Scale...QualityWork at officeImmediate startRelocationFlexible hours- ...exceptional mid-level to senior engineer to join our... ...architecture reviews for other members of the team Help... ...needs Requirements Technical 2+ YOE: 2+ years of full... ...ship quickly with high quality. You should have a track... ...own company Interest in AI/ML systems and...QualityFull timeWork experience placementRelocationRelocation packageShift work
- Member of Technical Staff, Document Understanding Join us and help shape the future of AI by architecting next-generation knowledge systems. Join us and help shape the future... ...evaluation frameworks, benchmarks, or data quality pipelines Experience with model serving frameworks...QualityWork at officeRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Member of Technical Staff, AI Quality. Be the first to apply!
- IT assistant San Francisco, CA
- desktop support analyst San Francisco, CA
- senior IT support technician San Francisco, CA
- personal computer support technician San Francisco, CA
- technical analyst San Francisco, CA
- customer support technician San Francisco, CA
- tech assistant San Francisco, CA
- technical support assistant San Francisco, CA
- customer support analyst San Francisco, CA
- remote (work from home) technical support representative San Francisco, CA

