Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Software Engineer, AI Evals

$240k - $280k

Sentry

Bad software is everywhere, and we’re tired of it. Sentry is on a mission to help developers write better software faster so we can get back to enjoying technology. With more than $217 million in funding and 100,000+ organizations that believe we’re on to something, we're building performance and error monitoring tools that help companies like Disney, Microsoft, and Atlassian spend less time fixing bugs and more time building products. Sentry embraces a hybrid work model across our global hubs, with Mondays, Tuesdays, and Thursdays set as in-office anchor days to encourage meaningful collaboration. If you like to selfishly build things that make your digital life better, come help us build the next generation of software monitoring tools. About the role As a Senior Software Engineer on Sentry’s AI/ML team, you’ll be responsible for building the evaluation infrastructure that measures the accuracy, reliability, and real‑world performance of our AI systems. This role is critical to ensuring that our debugging agents and AI‑powered features behave correctly, safely, and predictably as they scale. You’ll design datasets, benchmarks, and test harnesses that turn ambiguous AI behavior into measurable signals, helping the team ship AI with confidence. In this role you will Design and build robust evaluation frameworks to measure accuracy, reliability, regressions, and edge cases in AI systems. Create and curate high‑quality datasets, golden test cases, and benchmarks grounded in real production data. Build automated test harnesses and metrics pipelines to continuously evaluate models, prompts, and agentic workflows. Partner closely with applied AI engineers and product leaders to define what “good” looks like and translate it into measurable criteria. Own the evaluation lifecycle for major AI initiatives, from early experimentation through production monitoring. You’ll love this job if you Care deeply about correctness, rigor, and measurement in AI systems. Enjoy turning fuzzy product goals and model behavior into concrete tests and metrics. Like building foundational infrastructure that unlocks faster iteration and higher confidence for the entire AI team. Thrive in cross‑functional environments and enjoy influencing model design through better evaluation. Qualifications Minimum 5+ years of professional experience with a Bachelor’s degree in computer science, machine learning, or a related field. Experience building testing, evaluation, or data infrastructure for complex systems (AI/ML experience strongly preferred). Comfort writing production‑quality code (we use Python and TypeScript). Experience working with structured and unstructured datasets, labeling workflows, or data quality pipelines. Familiarity with modern ML systems and evaluation techniques (e.g., offline metrics, online evaluation, regression testing for models or prompts). Bonus: experience evaluating LLMs, agentic systems, or AI‑assisted developer tools. The base salary range (or hourly wage range, if applicable) that Sentry reasonably expects to pay for this position is $240,000 to $280,000. A successful candidate’s actual base salary (or hourly wage) amount will be determined by a variety of relevant factors including, without limitation, the candidate’s work location, education, work and other relevant experience, skills, and job‑related knowledge. A successful candidate will be eligible to participate in Sentry’s employee benefit plans/programs applicable to the candidate’s position (including incentive compensation, equity grants, paid time off, and group health insurance coverage). See Sentry Benefits for more details about the Company’s benefit plans/programs. Equal Opportunity at Sentry Sentry is committed to providing equal employment opportunities to its employees and candidates for employment regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, or other legally‑protected characteristic. This commitment includes the provision of reasonable accommodations to employees and candidates for employment with physical or mental disabilities who require such accommodations in order to (a) perform the essential functions of their jobs, or (b) seek employment with Sentry. We strive to build a diverse team, with an inclusive culture where every teammate can thrive. Sentry is an open‑source company because we believe that everyone, everywhere, should have the ability and tools to make great software. Software should be accessible. That starts with making our industry accessible. #J-18808-Ljbffr Sentry

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Senior Software Engineer, AI Evals in San Francisco, CA vacancy
  • At Commure, we're building the AI Operating System for...  ...interactions by partnering with engineering and product teams on how models...  ...extensive experience Professional software development industry experience...  ...applications, including designing evals and improving performance... 
    Senior
    Immediate start

    Monograph

    San Francisco, CA
    11 hours ago
  •  ...The role Watershed is building the AI suite for companies to measure their emissions...  ...their business. We’re looking for software engineers to help build the AI platform that powers...  ...and improve agent behavior at scale Build evals, harnesses, and guardrails that turn agent... 
    Senior
    Work at office
    Remote work

    WaterSHED

    San Francisco, CA
    1 day ago
  •  ...freedom. About the Role We are hiring Senior Software Engineers to help shape the future of January’s platform...  ...reality. Consumer Voice – Owns Voice AI conversation flows for inbound and...  ..., plus the LLM platform infrastructure—evals, consumer context, and guardrails—that... 
    Senior
    Currently hiring
    Work at office

    January Service Company

    San Francisco, CA
    2 days ago
  •  ...type : Full-time Department : Engineering and Development Workplace...  ...Office Experience : 0 years Senior Software Engineer (Startup) About the...  ...intersection of web apps and modern AI systems. That means we`re...  ...depend on LLMs—prompt pipelines, evals, guardrails, retrieval, and... 
    Senior
    Full time
    Work at office
    Flexible hours
    Weekend work

    SproutsAI

    San Francisco, CA
    1 day ago
  • Monograph is seeking talented individuals to join their Ambient AI team in San Francisco, California. The role focuses on revolutionizing healthcare technology through AI solutions that enhance the entire care journey, from clinical documentation to billing automation.... 
    Senior

    Monograph

    San Francisco, CA
    4 days ago
  • $320k

     ...interpretable, and steerable AI systems. We want AI to be safe...  ...group of committed researchers, engineers, policy experts, and business...  ...measurement gaps, and evolve evals so they remain unsaturated and...  ...Qualifications 6+ years of industry software engineering experience.... 
    Work at office
    Visa sponsorship
    Flexible hours

    Aimling

    San Francisco, CA
    2 days ago
  • $155k - $195k

     ...intelligent agents ubiquitous. We help developers build mission-critical AI applications across the entire agent development lifecycle. Our...  ...their organization. Founded in 2023, LangChain powers top engineering teams at companies like Replit, Lovable, Clay, Klarna, LinkedIn... 
    Senior

    LangChain

    San Francisco, CA
    2 days ago
  • $160k - $180k

     ...platform to help professionals scale. Role Summary As a Senior Software Engineer, you’ll own major parts of our AI stack. You’ll prototype zero-to-one workflows,...  ...with retrieval, embeddings, experimentation, and evals Ability to design multi-step pipelines and agentic... 
    Senior
    Full time
    Contract work
    Temporary work
    Work experience placement
    Work at office

    Filevine

    San Francisco, CA
    2 days ago
  • $200k - $240k

    Your Role You are the engineering anchor of Zip’s new Internal AI team. The Internal AI team operates in a hub-and...  ...accelerate AI adoption across every non-software engineering use case at Zip and we...  ...patterns, data handling, identity, evals, observability. Be the single... 
    Senior
    Home office
    Flexible hours

    ZipHQ, Inc.

    San Francisco, CA
    3 days ago
  • You’ll build Traba’s product on top of AI agents—taking frontier models and existing...  ...on millions of shifts This is a product-engineering role first, so you’ll go where the priorities...  ...real product—prompting, tool wiring, evals. You’ve shipped AI features, even if you... 
    Senior
    Flexible hours
    Shift work

    Traba

    San Francisco, CA
    1 day ago
  • $190k - $230k

    At Sanity, we’re building the future of AI-powered Content Operations. Our AI Content...  ...a designer, product manager, and fellow engineers to move fast, experiment constantly, and...  ...with minimal human guidance. Design and run evals - build evaluation suites using... 
    Senior
    Work at office
    Flexible hours
    2 days per week

    Sanity CMS

    San Francisco, CA
    3 days ago
  • $180k - $220k

    Ironclad is the leading AI contracting platform that transforms...  ...Team: a small, high-leverage engineering group focused on fast-moving,...  ...modern AI systems, including evals, orchestration, prompting, retrieval...  ...to build and ship production software in fast‑paced environments.... 
    Senior
    Full time
    Contract work
    Work at office

    Ironclad Inc

    San Francisco, CA
    2 days ago
  • $226k - $306k

    About the Team The AI Platform team is a newly formed team at the center of Mixpanel...  ...system for Mixpanel AI that uses evals and metrics from production to optimize...  ...looking for an experienced and driven Senior Software Engineer to join our AI Platform team. You will... 
    Senior
    Remote work

    Jobr

    San Francisco, CA
    4 days ago
  • $153k - $246.4k

     ...physical lab space, R&D capabilities, AI/ML tools, and decades of enterprise learning...  ...from scratch. We’re looking for a Senior Software Engineer to be one of the early engineers on this...  ...the practical challenges: reliability, evals, context management, and when not to... 
    Senior
    Full time
    H1b
    Visa sponsorship
    Work visa
    Flexible hours

    Initial Therapeutics, Inc.

    San Francisco, CA
    1 hour ago
  • About Nooks.ai: Nooks is an applied AI lab building the Agent...  .... About the Role Sales software was built for humans. But we...  ...set of design partners. As a Senior Software Engineer, you'll build those agents and...  ...the user, and have clear evals that prove they work. Work... 
    Senior
    Work at office
    3 days per week

    Nooks

    San Francisco, CA
    2 days ago
  •  ...Commure, we're building the AI Operating System for healthcare...  ...from prompt design and evals through production infrastructure...  .... We're looking for a Senior Backend Engineer who takes ownership end-to-end...  ...decisions Work across the entire software stack Work with a stack... 
    Senior
    Full time
    Work at office
    Immediate start

    Monograph

    San Francisco, CA
    2 days ago
  • ~ Senior Software Engineer (Rust) at Symbolica – San Francisco, US Senior Software Engineer (Rust) at Symbolica – San Francisco, US About Us Symbolica is an AI research lab pioneering the application of category theory to enable logical reasoning in machines... 
    Senior
    Work at office
    Shift work

    Victrays

    San Francisco, CA
    1 day ago
  •  ...patients and providers connect, driven by AI, built to embrace the complexities of...  ...the Role We're on the hunt for driven Senior Engineers to join our team at an exciting stage of...  ...with healthcare professionals to tailor software solutions that meet real‑world needs.... 
    Senior
    Work at office

    Assort Health Inc.

    San Francisco, CA
    4 days ago
  •  ...About the Role: We’re looking for a software engineer to work with the founding team on building a category-defining product and scale it to...  ...enterprise customers. If you’re excited about the opportunity to build AI products, in close collaboration with large tech enterprises,... 
    Senior

    HRB

    San Francisco, CA
    4 days ago
  • $186.4k - $266.3k

     ...SF office two days/week. What You'll Do: We are looking for a software engineer who is proactive, collaborative, and pragmatic with a passion...  ...pragmatic solutions to complex problems Explore and integrate AI‑based development tools into daily workflows Work across service... 
    Senior
    Work at office
    2 days per week

    Jobr

    San Francisco, CA
    11 hours ago
  •  ...AI agents are changing how enterprises operate. Companies want to move fast with MCPs...  ...and Boost, and we are a small team, mostly engineers, shipping fast. We are building the...  ...product problem and turn it into working software. You write code that is clear, maintainable... 
    Senior

    CodeIntegrity, Inc.

    San Francisco, CA
    1 day ago
  •  ...reliability, and hardware constraints. Software sits at the center of everything we...  ...the role As a Backend Software Engineer at Droyd, you’ll own core parts of the...  ...You’ll work in person with a small, senior team across robotics, AI, and hardware. Your work will ship directly... 
    Senior

    Droyd

    San Francisco, CA
    11 hours ago
  •  ...operate more efficiently, eliminating complexity and friction with seamless automation. As a Senior Software Engineer at Capably, you’ll build the systems that make enterprise AI actually work in production. You’ll design, ship, and scale the core platform capabilities... 
    Senior
    Immediate start

    Capably

    San Francisco, CA
    11 hours ago
  •  ...This is a job that Jill, our AI Recruiter, is recruiting for on behalf of one of our customers. She will pick the best candidates...  ...'s network The next step is to speak to Jack. Job Title: Senior Software Engineer Company Description: VC-backed B2B software platform Job... 
    Senior

    Jack & Jill/external Ats

    San Francisco, CA
    11 hours ago
  •  ...What you'll do: Contribute rapidly to our software platform that automates the lending...  ...architecture and design patterns Learn and drive engineering best practices Leverage automating...  ...full-stack development - machine learning, AI and natural language processing is a plus... 
    Senior

    F2 AI

    San Francisco, CA
    11 hours ago
  •  ...About Broccoli Broccoli is building the AI operating system for the $500B home services market. We deploy intelligent AI agents...  ...Looking For 4-8 years of experience in backend or full-stack software engineering Strong engineering fundamentals and product intuition... 
    Senior
    Work at office
    Immediate start

    Broccoli AI

    San Francisco, CA
    1 day ago
  •  ...About the Role We're seeking a Senior Software Engineer to join our in-person, fast-paced NYC startup. You'll be a full-stack developer with a frontend emphasis and a passion for AI and legal tech, building scalable, user-centric tools that help legal teams review contracts... 
    Senior
    Contract work
    Work at office

    Clera

    San Francisco, CA
    2 days ago
  •  ...more important — than ever, with AI enabling fraudsters to launch...  ...Compute team's mission: any engineer, using AI, should be able to stand...  ...-service by design. This is a software engineering role. You'll spend...  ...your design failure to fix. A Senior engineer will help define the... 
    Senior
    Full time
    For contractors
    Internship

    Persona

    San Francisco, CA
    11 hours ago
  • $200k - $220k

     ...Help lead a team of engineers to design, ship, and maintain user-facing and backend product...  ...based on your understanding of AI trends and software development. You are prepared to guide...  ...execute towards those goals. Role As a Senior Engineer at Pickaxe, you will design and... 
    Senior
    Full time
    Work at office
    Remote work

    Pickaxe

    San Francisco, CA
    11 hours ago
  •  ...harder — but more important — than ever, with AI enabling fraudsters to launch...  ...! About the role This isn't just another engineering role. This is a unique opportunity to be...  ...customers. What you'll bring to Persona A strong software engineering background, demonstrated by 1... 
    Senior
    Full time
    Temporary work
    For contractors
    Internship

    Persona

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Software Engineer, AI Evals. Be the first to apply!