Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Software Engineer, AI Evals

$240k - $280k

Sentry

Bad software is everywhere, and we’re tired of it. Sentry is on a mission to help developers write better software faster so we can get back to enjoying technology. With more than $217 million in funding and 100,000+ organizations that believe we’re on to something, we're building performance and error monitoring tools that help companies like Disney, Microsoft, and Atlassian spend less time fixing bugs and more time building products. Sentry embraces a hybrid work model across our global hubs, with Mondays, Tuesdays, and Thursdays set as in-office anchor days to encourage meaningful collaboration. If you like to selfishly build things that make your digital life better, come help us build the next generation of software monitoring tools. About the role As a Senior Software Engineer on Sentry’s AI/ML team, you’ll be responsible for building the evaluation infrastructure that measures the accuracy, reliability, and real‑world performance of our AI systems. This role is critical to ensuring that our debugging agents and AI‑powered features behave correctly, safely, and predictably as they scale. You’ll design datasets, benchmarks, and test harnesses that turn ambiguous AI behavior into measurable signals, helping the team ship AI with confidence. In this role you will Design and build robust evaluation frameworks to measure accuracy, reliability, regressions, and edge cases in AI systems. Create and curate high‑quality datasets, golden test cases, and benchmarks grounded in real production data. Build automated test harnesses and metrics pipelines to continuously evaluate models, prompts, and agentic workflows. Partner closely with applied AI engineers and product leaders to define what “good” looks like and translate it into measurable criteria. Own the evaluation lifecycle for major AI initiatives, from early experimentation through production monitoring. You’ll love this job if you Care deeply about correctness, rigor, and measurement in AI systems. Enjoy turning fuzzy product goals and model behavior into concrete tests and metrics. Like building foundational infrastructure that unlocks faster iteration and higher confidence for the entire AI team. Thrive in cross‑functional environments and enjoy influencing model design through better evaluation. Qualifications Minimum 5+ years of professional experience with a Bachelor’s degree in computer science, machine learning, or a related field. Experience building testing, evaluation, or data infrastructure for complex systems (AI/ML experience strongly preferred). Comfort writing production‑quality code (we use Python and TypeScript). Experience working with structured and unstructured datasets, labeling workflows, or data quality pipelines. Familiarity with modern ML systems and evaluation techniques (e.g., offline metrics, online evaluation, regression testing for models or prompts). Bonus: experience evaluating LLMs, agentic systems, or AI‑assisted developer tools. The base salary range (or hourly wage range, if applicable) that Sentry reasonably expects to pay for this position is $240,000 to $280,000. A successful candidate’s actual base salary (or hourly wage) amount will be determined by a variety of relevant factors including, without limitation, the candidate’s work location, education, work and other relevant experience, skills, and job‑related knowledge. A successful candidate will be eligible to participate in Sentry’s employee benefit plans/programs applicable to the candidate’s position (including incentive compensation, equity grants, paid time off, and group health insurance coverage). See Sentry Benefits for more details about the Company’s benefit plans/programs. Equal Opportunity at Sentry Sentry is committed to providing equal employment opportunities to its employees and candidates for employment regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, or other legally‑protected characteristic. This commitment includes the provision of reasonable accommodations to employees and candidates for employment with physical or mental disabilities who require such accommodations in order to (a) perform the essential functions of their jobs, or (b) seek employment with Sentry. We strive to build a diverse team, with an inclusive culture where every teammate can thrive. Sentry is an open‑source company because we believe that everyone, everywhere, should have the ability and tools to make great software. Software should be accessible. That starts with making our industry accessible. #J-18808-Ljbffr Sentry

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Senior Software Engineer, AI Evals in San Francisco, CA vacancy
  •  ...industry. Our growing suite of AI solutions spans ambient AI...  ...looking for a talented backend engineer to help take Ambient AI to the...  ...experience ~3+ years of professional software development industry...  ...applications, including designing evals and improving performance... 
    Senior
    Work at office

    Commure

    San Francisco, CA
    3 days ago
  • $170k - $240k

     ...At Commure, we're building the AI Operating System for...  ...interactions by partnering with engineering and product teams on how models...  ...experience ~3+ years of professional software development industry...  ...applications, including designing evals and improving performance through... 
    Senior
    Work at office
    Immediate start

    Commure

    San Francisco, CA
    17 days ago
  • $175k - $225k

     ...Senior Backend Engineer In person 5 days/week in San Francisco, Boston, MA, New York. We are looking...  ...power LangChain's observability and evals platform. You will work on the core...  ...developers to monitor and evaluate their AI applications at scale. While the focus... 
    Senior
    Work at office
    Flexible hours

    LangChain

    San Francisco, CA
    1 day ago
  • $175k - $240k

     ...Senior Fullstack Engineer In person 5 days/week in San Francisco We're looking...  ..., an observability and evals platform. In this role, you'...  ...~5+ years of experience in software engineering working on complex...  ...~ Bonus: Experience with AI/LLM-powered applications, developer... 
    Senior
    Work at office
    Flexible hours

    LangChain

    San Francisco, CA
    1 day ago
  • $240k - $280k

     ...About Sentry Software runs the world and the pace is faster than ever. Sentry helps developers fix...  ...monitoring standard and our team is building its AI-native future. About the role As a Senior Software Engineer on Sentry's AI/ML team, you'll be responsible... 
    Senior
    Hourly pay

    Sentry

    San Francisco, CA
    3 days ago
  • $175k - $240k

     ...our mission is to make intelligent agents ubiquitous. We build the foundation for agent engineering in the real world, helping developers move from prototypes to production-ready AI agents that teams can rely on. We began as widely adopted open-source tools and have grown... 
    Senior
    Work at office
    Flexible hours

    LangChain, Inc

    San Francisco, CA
    1 day ago
  •  ...hardware, by developing the first AI Hardware Engineer. Our goal is to democratize...  ...first AI Hardware Engineer, software that can design real,...  ...electronics from a prompt. As a Senior Software Engineer, Agentic...  .... Build and analyze evals; translate data into engineering... 
    Senior
    Remote work
    Shift work

    Flux Protocol

    San Francisco, CA
    22 hours ago
  • $230k

     ...Software Engineer This role blends traditional software engineering, agent management, and system...  ...be the expert on agentic harnesses, evals, and best practices. Your experience wielding...  ...what we're building more than ever. AI demand is driving dirty energy generation... 
    Senior
    Work at office
    Visa sponsorship

    Gravity

    San Francisco, CA
    2 days ago
  • $190k - $250k

     ...Job Description Filevine is a Legal AI company delivering Legal Operating Intelligence...  ...country. Role Summary:   As a Senior Software Engineer, you’ll own major parts of our AI stack...  ..., embeddings, experimentation, and evals   ~ Ability to design multi-step pipelines... 
    Senior
    Full time
    Contract work
    Temporary work
    Work experience placement

    Filevine

    San Francisco, CA
    9 days ago
  • $98.81k - $148.16k

     ...forBoltlineby Stoke Space, our commercial software platform that helps advanced hardware...  ..., build, and test efficiently. As a Senior AI Engineer on theBoltlineteam, you will design, build...  ...comprehensive evaluation frameworks (evals) to measure, monitor, and improve AI system... 
    Senior
    Permanent employment
    Full time

    GrabJobs

    San Francisco, CA
    22 hours ago
  •  ...Description Filevine is a Legal AI company delivering Legal...  ...As a Staff Full Stack Engineer on the contract review and...  ...embeddings, prompt libraries, and evals Collaborate directly...  ...Ship high-quality, reliable software quickly in a small, senior team Improve performance... 
    Senior
    Full time
    Contract work
    Temporary work
    Work at office

    Filevine

    San Francisco, CA
    9 days ago
  • $50 - $150 per hour

    A leading AI company is seeking a software engineer to review and evaluate model-generated code. This contract role requires several years of software engineering experience, particularly as a full-stack engineer at notable tech firms. You will assess code quality and provide... 
    Senior
    Hourly pay
    Contract work
    Flexible hours

    Turing

    San Francisco, CA
    2 days ago
  • A dynamic technology firm is seeking a Senior Software Program Engineer to lead software development efforts. You will work collaboratively across teams...  ...years of engineering experience and be eager to leverage AI tools. The role is fully remote, allowing for flexible... 
    Senior
    Remote work
    Flexible hours

    The10minutecareersolution

    San Francisco, CA
    22 hours ago
  • $226k - $306k

     ...Pinterest, LG, and Rakuten Viber, Mixpanel’s AI-first digital analytics help teams...  ...system for Mixpanel AI that uses evals and metrics from production to optimize...  ...looking for an experienced and driven Senior Software Engineer to join our AI Platform team. You will... 
    Senior
    Full time
    Contract work
    Remote work

    Mixpanel

    San Francisco, CA
    22 hours ago
  • A forward-thinking technology company in San Francisco is seeking a Senior Software Engineer to develop the next generation of AI systems. The ideal candidate has over 8 years of experience, strong skills in TypeScript and React, and a collaborative mindset. Responsibilities... 
    Senior
    Remote work

    Jobleads-US

    San Francisco, CA
    26 days ago
  • $50 - $150 per hour

    A leading AI company in San Francisco is seeking a Mid-Senior level contractor to improve large language model performance through software engineering expertise. The role involves leading projects, evaluating code quality, and collaborating with the team. Ideal candidates... 
    Senior
    Contract work
    For contractors
    Flexible hours

    Turing

    San Francisco, CA
    2 days ago
  • A leading AI research accelerator is looking for a skilled software engineer based in the US or Canada. This contractor role involves evaluating AI-generated code, collaborating with teams, and designing verification mechanisms. Candidates should have over 5 years of experience... 
    Senior
    For contractors
    Remote work
    10 hours per week

    Turing

    San Francisco, CA
    1 day ago
  • A leading AI research accelerator in San Francisco is hiring an entry-level contractor for software engineering tasks. Responsibilities include refining AI-generated code, enhancing coding solutions, and designing verification mechanisms. Ideal candidates will have over... 
    Senior
    Contract work
    For contractors
    Remote work
    10 hours per week
    Flexible hours

    Turing

    San Francisco, CA
    5 days ago
  •  ...Senior Software Engineer San Francisco | Fully onsite | Early-stage AI startup A YC-backed AI startup is hiring Senior Software Engineers to help build AI-powered voice and communication systems for major financial institutions. The role focuses heavily on backend... 
    Senior

    Acceler8 Talent

    San Francisco, CA
    22 hours ago
  •  ...continuous and deeply human. Heidi is building an AI Care Partner that works alongside...  ...possible. We’re a team of doctors, engineers, designers, researchers, and creatives building...  ...What we’re looking for ~5+ years of software engineering experience , with a track... 
    Senior
    Work at office
    Worldwide

    Heidi Health

    San Francisco, CA
    22 hours ago
  •  ...personalized smart mattress cover that tracks your sleep and uses AI to cool and heat your body to the perfect core body...  ...a full-time on-site role located in San Francisco, CA for a Senior Software Engineer. You will be responsible for building on our backend which serves... 
    Senior
    Full time
    Visa sponsorship

    Orion Sleep

    San Francisco, CA
    1 day ago
  •  ...Join Our Fast-Growing Startup 1/ Join a fast-growing startup before Series A, bringing AI to the $1T maps and geospatial industry. 2/ Work with technical founders who have led Eng, Product, Marketing teams at FAANG and Series C+ companies. 3/ Build systems that... 
    Senior
    Work at office

    Reprompt

    San Francisco, CA
    2 days ago
  •  ...healthcare technology company based in San Francisco is seeking a Senior Full-Stack Engineer to build and optimize core research infrastructure. This...  ..., enhancing efficiency, and empowering doctors and patients by leveraging cutting-edge AI technologies. #J-18808-Ljbffr
    Senior

    Sully

    San Francisco, CA
    22 hours ago
  • A leading AI research accelerator in San Francisco is looking for an experienced Software Engineer to evaluate AI-generated code for efficiency and reliability. You will work closely with cross-functional teams to develop innovative coding solutions and design verification... 
    Senior
    For contractors
    Remote work
    10 hours per week
    Flexible hours

    Turing

    San Francisco, CA
    22 hours ago
  • $320k - $405k

     ...create reliable, interpretable, and steerable AI systems. We want AI to be safe and...  ...growing group of committed researchers, engineers, policy experts, and business leaders working...  ...that shapes how engineers build and ship software. The team works across several areas... 
    Senior
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    8 days ago
  •  ...us Alpha is a product studio focused on the intersection of AI and consumer social – backed by a16z and many of the top investors...  ...d love to talk. The Role We’re hiring an early to mid-level Senior Backend Engineer for our Core Team that focuses on the live Clubhouse app in... 
    Senior
    Remote work

    GrabJobs

    San Francisco, CA
    22 hours ago
  • $320k - $405k

     ...Senior Software Security Engineer San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole... 
    Senior
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    2 days ago
  •  ...Endeavor is real-world enterprise AI. Our software is the distribution layer for powerful AI models that will build the world....  ...will win. And we work really hard to win. The Role | Senior Software Engineer As a Senior Software Engineer at Endeavor, you will design... 
    Senior

    Endeavor AI, Inc

    San Francisco, CA
    4 days ago
  • $200k - $270k

     ...Senior Software Engineer Title of Role: Senior Software Engineer Location: New York, New York, Hybrid Company Stage of Funding: Venture...  ...major features from concept to launch. Familiarity with AI or data tools, particularly in building solutions for... 
    Senior
    Work at office

    Recruiting from Scratch

    San Francisco, CA
    3 days ago
  • $200k - $240k

    Traba is the AI operating layer for the industrial supply...  ...seeking an entrepreneurial Senior Applied Agent Engineer to join as a founding member...  ...structured tools, written evals, and tuned prompts against...  ...5+ years of professional software engineering experience, with... 
    Senior
    Full time
    Temporary work
    Local area
    Flexible hours
    Shift work
    Day shift

    Traba

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Software Engineer, AI Evals. Be the first to apply!