Staff Engineer, Engineering Productivity & AI Quality
$253k - $308kHarper
The Problem 36 million businesses in America need insurance - it's not optional. 77% are underinsured. 40% have no coverage at all. The distribution system failed them: too slow, too opaque, too confusing. Over 90% of commercial insurance is still human-led. We're building the inverse: 90%+ AI-led, pushing toward the higher 90s. Not by patching legacy workflows - by building AI that makes humans more effective, improves the customer experience, and eliminates friction at every step. We're adding ~1,000 customers per month. We've grown 100x since last year. We're scaling toward Series B. AI-generated code volume has pulled forward the scaling problem - even with a 20-person engineering team, our coding agents create surface area, review burden, and architectural drift that look like a 100-person org. Build the rails before AI code volume turns every service into a rework trap. If we don't build the rails, the CTO becomes the rail. That doesn't scale. The Thesis Every great AI company ends up building the same invisible machine: the harnesses, tests, instructions, and review loops that let a small team ship with impossible leverage. At Harper, that machine is existential. Our agents write code, serve customers, assemble submissions, and make decisions that move revenue. If the rails are strong, twenty engineers can operate like one hundred. If the rails are weak, velocity turns into drag. This is the founding seat for that machine. You'll turn the CTO's taste into systems: PR preflight, integration tests, architecture rules, agent instructions, eval gates, and feedback loops every engineer feels every day. The mission is simple: make the right way the easy way, and make Harper's engineering org compound with every ship. The Role Harper operates like a factory with a series of modules spanning the full lifecycle from intake through renewals. Across them we run a stack of internal AI systems covering operator guidance, the operational backbone that matches risks to underwriters, autonomous communications, and voice AI for customer interactions. You own the rails underneath the factory - the CI gates, integration test harnesses, agent instructions, PR preflight, architecture linting, dev environment reliability, and dead-code cleanup that the entire engineering team builds against. Three sub-disciplines live under this function: Harness Engineering - the meta-harness on top of our frontier coding agents, OpenClaw, Hermes, and our internal agents Developer Experience - CI/CD gates, build caching, merge queues, dev/staging/CI parity, internal developer platform, eval framework infrastructure AI Quality - eval suite design, golden datasets, LLM-as-judge graders, production trajectory monitoring, drift detection, anti-slop guardrails What You'll Own CI/CD quality gates across Harper's most critical services - Define the minimum bar before code can merge Integration test harnesses anchored to real failure modes - Every repeated operational failure becomes a regression test, a validation, or an architecture rule The agent harness substrate - Sandbox lifecycle, tool routing, prompt/context layer, model-provider abstraction, multi-agent coordination Repo-level agent instructions and context hygiene - AGENTS.md per repo, canonical data model docs, banned patterns. The information environment our coding agents read. Automated PR preflight - Service impact summary, tests run, missing tests, model/migration changes, critical-path warnings. The robot that reviews every PR before a human does. Architecture-rule enforcement - Custom lints and structural tests that encode the CTO's taste mechanically. Once a rule is written down, it never has to be argued in PR comments again. Eval framework infrastructure - Pre-merge eval gating, experiment runs against curated datasets, production trajectory monitoring. All three wired together. Engineering metrics that matter - Rework rate, escaped defects, flaky test count, deploy rollbacks, time-to-confident-ship, AI-generated PR quality. Anti-vanity. Anti-LOC. You Might Be a Fit If… You've built or scaled developer productivity, platform, build/test, CI/CD, or internal tooling systems at a high-growth startup or AI-infrastructure company You can write and review production code at a Staff level - this is not a process or PM role You have strong opinions about maintainability, architecture, testability, and developer experience - and you back them up with mechanical enforcement, not lectures You're excited by AI coding agents but skeptical enough to build the guardrails they need You can describe a specific lint rule, integration test, or eval-harness pattern you built that prevented a class of bugs from reaching production again You write code with AI daily and routinely manage 3+ parallel coding sessions You like creating leverage for other engineers more than owning a single product surface You're 8–12 years into your career, with 3+ years at the Senior+ level If "Engineering Productivity" sounds like dashboards and roadmaps to you, this isn't it. We measure ourselves on rework prevented and confident-ship time, not artifacts produced. Requirements 8+ years software engineering experience, including senior+ scope at a high-growth company Track record of building developer productivity, platform, CI/CD, build systems, test infrastructure, or internal tooling that other engineers actually adopted Production AI/ML systems experience - agent harness, eval frameworks, LLM-as-judge graders, prompt/context engineering - even if not your primary stack Strong written communication - RFCs, architecture-rule docs, lint-rule rationale, internal playbooks Based in San Francisco or willing to relocate Nice to Have Built or contributed to eval-framework infrastructure (open-source or internal) Built developer platforms at an AI-native or high-growth company Custom lint-rule / structural-test authoring at scale Built or operated agent harnesses (sandboxing, isolation, agent execution environments) Worked alongside a CTO whose architectural taste needed to be encoded into mechanical rules Compensation OTE: $253,000–$308,000 cash compensation (base salary + target performance bonus) Equity: competitive equity, so you share in the company you are helping build Location: San Francisco, in-office Benefits Health, dental, and vision insurance Commuter benefits Team meals and snacks The Process Founder call (15 min) - Mission, pace, scope you'd own CTO deep-dive (60 min) - Architecture-rule taste, eval-harness depth, real-world examples Super Day on-site - full-day simulation of working at Harper: code review, eval-harness design, dev-environment debug, cross-functional sessions, and founder/CTO time Founder + CTO offer conversation - No committee. Best offer, first. To Apply If you want to be the engineer whose lint rules, test harnesses, and PR preflight checks let a 100-person engineering org run on a 25-person team - send your resume, link to a developer platform / eval harness / lint rule system you built, and tell us about an architectural drift you stopped before it reached production.
- ...mission to life requires products and infrastructure that make advanced AI useful, reliable, and... ...the world. Our product engineering teams build and operate... ...secure, compliant, high-quality experiences at scale. We... ...title Member of Technical Staff . We use Staff / Senior...QualityInternship
$210k - $260k
...Machine Learning Engineers at Rocket Money further... ...our mission by building products that deepen customer relationships... ...that support various AI product initiatives. We... ...scalable and high quality user experiences, and understand... ...on end users. At the Staff level, Machine Learning...QualityWork at officeLocal area- Resume Refiner seeks a Member of Technical Staff - Product Eng in San Francisco, CA, to enhance our core... ...individuals with a strong background in product engineering, a passion for building quality products, and experience with AI models. If you excel at writing clear...Quality
- Staff Platform Engineer About Titan: Titan is an AI holding company transforming IT services with its Augmented AI platform... ...seeing real-world impact to productivity. Our team brings together AI... ...Move quickly without sacrificing quality. Write clean, maintainable code....Quality
- ...zerotoone ideas into real products, and you "get stuff done" end-to-end. You use AI to work smarter and... ...partners closely with engineering, IT, and other stakeholders... ...Your role As a Staff Product Security Engineer... ...with high depth and quality of investigation ~ Strong...QualityWorldwide
- About David AI David AI is the first audio data research... ...use cases emerge, high-quality training data is the... ...of former Scale AI engineers and operators. In less... ...research, engineering, product, and operations minds to... ...About this role As a Staff Product Engineer at David...QualityWork at office
$170k - $350k
...Member of Technical Staff - Product Engineer Title of Role: Member of Technical Staff - Product... ...Funding: Series B - Software Development, AI Office Type: Hybrid Salary: $170K... ...Design, develop, and ship high-quality software features using React, TypeScript...QualityWork at office- ...biomedical scientist. We believe AI agents will fundamentally... ...brings together researchers and engineers across AI and biology. About... ...role We're looking for a Product Engineer to help build and... ...systems quickly without sacrificing quality. This role is deeply...Quality
- ...Product Engineer Valthos Inc. Valthos is an applied biological intelligence company. We build and deploy software and biological AI systems to safeguard humanity. The same AI architectures that... ...and building production-quality web applications. Strong frontend...QualityWork at office
- Salient is seeking a Member of Technical Staff to build speech and language models for voice agents in San Francisco... ...collaboration within a small, elite team focused on production systems and improving conversational quality. Salient provides competitive benefits, including...Quality
- ...5pm Pacific time each workday.The RoleWe're looking for an AI Product Engineer on our small but nimble AI team. Your mission is to deploy responsible... ...projects, and think beyond immediate deliverables.Shipping quality user interfaces: You have partnered closely with product and...QualityFull timeImmediate startRemote workWork from home
$200k - $250k
...On-site Department Science & Engineering Compensation $200K - $250K... ...We use advanced physics and AI to model catastrophic risk at... ...delivery mechanism. The real product is a scalable risk engine. We... ...problem spaces—architecture, quality, reliability, and long-term evolution...QualityFull timeTemporary workWork at officeFlexible hours- ...your help. The Role We're looking for a Staff Software Engineer to help help define and build the next generation of TruckSmarter’s AI-native products. This is a high-impact, high-... ...workflows. Establish and raise the bar for quality, observability, and performance...QualityFull timeWork at officeLocal areaFlexible hoursShift work
- ...technology company in San Francisco is seeking a Staff Software Engineer to lead the design and build of AI-native products. This high-impact role requires strong... ...environment and have a passion for delivering high-quality solutions. The role offers the opportunity for...Quality
$180k - $230k
...Full time Department Engineering Compensation San Francisco... ...for both Senior and Staff level engineers. Full-... ...CO₂. If you want product work where your decisions... ...finish line. Raise the quality bar through thoughtful... ...consensus‑by‑default. AI‑native workflow: you routinely...QualityFull timeLocal area- ...revolutionizing restaurant operations with AI-powered automation. Our... ...leader to build and scale our engineering team from the ground up. Role Overview: As a Staff Product Engineer, you will own a broad... ...systems with high code quality. Apply critical thinking to propose...Quality
- A tech startup focusing on AI optimization is seeking engineers in San Francisco to enhance their GPU kernel optimization framework... ...across the stack, and be able to deliver production code swiftly without sacrificing quality. Previous experience in GPU programming and AI...Quality
- ...Staff Security Engineer We are seeking a Staff Security Engineer to join our... ...engineering skills to build AI-driven solutions that transform... ...and tooling that raise the quality bar for every detection the... ...SQL, with experience building production-grade tooling not just...QualityWork at office3 days per week
- ...zerotoone ideas into real products, and you "get stuff done" end-to-end. You use AI to work smarter and... ...partners closely with engineering, IT, and other stakeholders... ...Your role As a Staff Corporate Security Engineer... ...with high depth and quality of investigation. ~ Experience...QualityWorldwide
$189k - $274k
.... We're searching for a Staff Security Engineer to join our Enterprise Security... ...infrastructure (not product or application security).... ...context - writing production-quality automation, integrations, or... ...Familiarity with securing AI/ML platforms or applications...QualityWork at officeLocal area3 days per weekEarly shift$150k - $226k
...Amplitude is the leading AI analytics platform, helping... ...NBCUniversal, and Square-build better products and digital experiences.... ...is seeking an experienced Staff IT Security Engineer to design and build... ...therapy Sessions and high quality physician office experience...QualityWork at officeHome officeFlexible hours$190k - $285k
P-1504 The Applied AI team at Databricks sits at the forefront of advancing GenAI-powered products. Over the past years, we’ve launched... ...significant strides in LLM quality for these products. These products... ...are seeking multiple GenAI Engineers from junior levels to more...QualityWork at officeLocal areaWorldwide$240k - $300k
...and our team is building its AI-native future. About the role As a Staff Machine Learning Engineer on Sentry's AI/ML team, you'll... ...and agents used to make our product smarter and more capable. This... ...writing production quality code (we use Python) Expertise...QualityHourly pay$225k - $325k
...Staff Machine Learning Engineer, Partnerships Company: TwelveLabs Location: San Francisco... ...and others). In This Role Productize Twelve Labs’ video... ...hold a high bar around code quality / engineering best practices... ...driven team on cutting-edge AI technology. Full health, dental...QualityFull timeH1bWork at officeRemote workVisa sponsorshipFlexible hours- ...building a great culture and product, you will find a home at... ...Fieldguide. About the Role As a Staff Infrastructure Engineer at Fieldguide, your impact... ...infrastructure in an AI‑native world. You will lead... ...Design and own the technical quality bar and architectural standards...QualityRemote workWork from homeFlexible hours
- About Us We’re building the AI infrastructure powering the... ...Role We're looking for a Staff Infrastructure Engineer to architect and own the systems... ...system stability or code quality. Partner closely with the... ...in high-throughput, production environments. Track record...QualityFull timeWork at office
$188.24k - $235.3k
...use Artificial Intelligence (AI) to help make our hiring process... ...'s next L4, Machine Learning Engineer, Trust Intelligence Platform... ...interaction, enabling Twilio's product teams and customers to move from... ..., test, and improve data quality, model performance, latency, and...QualityLocal areaRemote workWorldwide- ...Staff Machine Learning Engineer EvenUp is on a mission to close the justice gap using technology and AI. We empower personal injury lawyers and victims... ...justice they deserve. Our products enable law firms to... ...analysis to ensure high-quality training and evaluation...QualityFull timeTemporary workWork at officeLocal areaHome officeFlexible hours
$225k - $275k
...only vertically integrated AI infrastructure company... ...Cloud is seeking a Senior Staff Network Deployment Engineer to serve as the technical... ...improvements to deployment quality across the org. What You'... ...you've made are still in production. Hyperscale Operational Experience...QualityTemporary workRemote work$185.1k - $335.3k
...accessible mobility. For the AI Kernels & Compilers... ...planning research into production-grade software that can... ..., and performance engineering so that every cycle on... ...driving. The Role As a Staff Compiler Engineer on... ...writing production quality Python/C++ code? ~ Expertisein...QualityLocal areaRemote workWork from homeRelocation packageFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff Engineer, Engineering Productivity & AI Quality. Be the first to apply!
- assistant engineering manager San Francisco, CA
- staff data engineer San Francisco, CA
- staff design engineer San Francisco, CA
- engineering aide San Francisco, CA
- software engineer staff San Francisco, CA
- assistant chief engineer San Francisco, CA
- project engineer assistant project manager San Francisco, CA
- technology administrator San Francisco, CA
- staff engineer San Francisco, CA
- research assistant engineering San Francisco, CA


