Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Engineer, AI Evals

Sema4.ai, Inc.

The Opportunity At Sema4.ai , we’re building an Enterprise AI Agent platform that fundamentally changes how knowledge work gets done—by enabling people and AI agents to collaborate in durable, trustworthy ways. As a Staff Engineer, AI Evals , you’ll design and own the evaluation systems that determine whether our agents are actually good: correct, reliable, efficient, and improving over time. You’ll build the measurement backbone that guides model choice, agent design, product decisions, and customer trust. This is an early, high-impact role. You’ll be defining how we measure success for AI agents in production, where ambiguity is real, and ground truth can be messy. We’re looking for an engineer who brings rigor, judgment, and strong opinions about what “good” looks like, and who know how to operationalize it. Who You Are AI Systems & Evaluation Expert You understand that AI systems are only as good as the way they’re measured. You’ve worked with LLMs and agentic systems in production and have seen how offline benchmarks, synthetic data, and human judgment can all fail in different ways. You know how to design evaluations that are meaningful, repeatable, and decision-useful, not just theoretically impressive. You’re familiar with the sharp edges: non-determinism, prompt drift, regression risk, overfitting, data leakage, and the tension between fast iteration and statistical rigor. In-Depth Technologist You stay close to research and industry practice in evaluation, alignment, and reliability. You understand where automated metrics work, where they break down, and how to combine them with human review, golden datasets, and production signals. You bring creativity to building evaluation sets and scenarios, and in sourcing (or synthesizing) the data you need. Builder With High Standards You care deeply about correctness, clarity, and operational behavior. You can move fast, but you don’t confuse speed and rigor. You design eval systems that engineers trust, product relies on, and leadership uses to make decisions. You know when to build custom infrastructure and when to leverage existing tools without outsourcing critical thinking. What You’ll Do Build and Own the Evaluation Platform Design, build, and operate Sema4.ai’s core evaluation infrastructure for LLMs and agents: offline benchmarks, regression tests, task-level metrics, and production feedback loops. These systems will directly inform product launches, model upgrades, and customer requirements. Define “Good” for Agents in Production Work closely with agent, product, and field engineering teams to translate fuzzy goals around correctness, reliability, usefulness into concrete, measurable signals. You’ll help define success criteria for new capabilities and ensure we can detect regressions before customers do. Tackle Ambiguous, High-Leverage Problems Solve hard problems where the answer isn’t obvious: How to evaluate long-running, multi-step agents How to balance automated scoring with human judgment How to measure improvement when tasks evolve How to compare models under cost and latency constraints Influence Technical and Product Direction Use evaluation results to guide architectural decisions, model selection, and roadmap tradeoffs. You’ll participate in design reviews, set technical standards for eval rigor, mentor other engineers, and help interview senior technical candidates. What You Bring 7+ years of software engineering experience, including 2+ years building AI/ML systems in production Deep experience with backend systems in Python, including data pipelines, observability, and reliability Hands‑on experience evaluating LLM-based systems (agents, retrieval, tool use, workflows, etc.) Strong intuition for metrics, experimentation, and failure analysis in non‑deterministic systems Strong communication skills: whether you’re talking to colleagues, customers, or machines, you communicate clearly, concisely, and collaboratively A high‑ownership mindset: you care deeply about the integrity of the systems you build and the decisions they inform #J-18808-Ljbffr Sema4.ai, Inc.

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Staff Engineer, AI Evals in Atlanta, GA vacancy
  • A forward-thinking AI company is seeking a Staff Engineer for AI Evals in Atlanta, Georgia. In this high-impact role, you will design and own the evaluation platform for AI agents and LLMs, ensuring accuracy and reliability. Applicants should have over 7 years of software... 
    Suggested

    Sema4.ai, Inc.

    Atlanta, GA
    1 day ago
  •  ...A Little About Us EDB provides a data and AI platform that enables organizations to harness the full power of Postgres...  ...observability. For more information, visit Job Summary As a Staff Security Engineer at EDB, you will be a technical leader with a developer-... 
    Suggested
    Remote work

    EDB

    Atlanta, GA
    16 days ago
  • $184k - $241.5k

     ...POSITION Our roster has an opening with your name on it As a Staff Security Engineer on our Product Security team, you'll define and deliver...  ...our engineering strategy. ~ Set the direction for AI/LLM security architecture across FanDuel by defining the controls... 
    Suggested
    Temporary work
    Local area
    Worldwide
    Shift work

    FanDuel

    Atlanta, GA
    5 days ago
  • $218.03k - $256.5k

     ...within the IAM program, partnering with Engineering, IT, Platform, and business teams to architect...  ...middleware and machine learning or AI models to automate complex access lifecycles...  ..., or systems architecture, with a deep, Staff-level focus on Identity and Access Management... 
    Suggested
    For contractors
    Local area

    Coinbase

    Atlanta, GA
    8 days ago
  • $152.2k - $209.2k

     ...and real-time delivery platforms. As a Senior Operational Support Engineer, you provide technical and operational leadership for the most...  ...platform architecture with operability and resilience in mind  AI-Driven Operations, Automation & Tooling  Lead adoption of AI-... 
    Suggested
    Full time
    Local area
    Worldwide
    Flexible hours
    Shift work
    Night shift

    Dolby

    Atlanta, GA
    1 day ago
  • $218.03k - $256.5k

     ...infrastructure and platform services. This role partners closely with engineering teams to design, implement, and automate cutting-edge security...  ...are agreeing to arbitration of disputes as outlined here. AI Disclosure For select roles, Coinbase is piloting an AI tool... 
    Local area

    Coinbase

    Atlanta, GA
    9 days ago
  •  ...Staff Operational Support Engineer A leading video, audio, and voice technologies company Staff Operational Support Engineer to take ownership of...  ...DevOps teams. Enhance operational efficiency by leveraging AI-driven tools for incident triage, alert correlation,... 
    Shift work
    Night shift
    Rotating shift

    Avispa

    Atlanta, GA
    3 days ago
  • $184k - $241.5k

    THE POSITION As a Staff Security Engineer on our Product Security team, you'll define and deliver multi-year security initiatives and set the direction...  ...input into our engineering strategy. Set the direction for AI/LLM security architecture across FanDuel by defining the... 
    Temporary work
    Local area
    Shift work

    FanDuel

    Atlanta, GA
    4 days ago
  • $136.5k - $187.4k

     ...player, and real‑time delivery platforms. As an Operational Support Engineer (L2), you take end‑to‑end ownership of customer‑impacting...  ...or configuration changes through codified workflows. Leverage AI tools and automation to enhance operational efficiency and incident... 
    Full time
    Local area
    Immediate start
    Worldwide
    Flexible hours
    Shift work
    Night shift

    Dolby Laboratories

    Atlanta, GA
    9 hours ago
  • $140.6k - $175.8k

     ...of the outdoors and a desire to protect it for future generations. Role Summary As a Security Engineer at Rivian, you will spearhead the adversarial evaluation of our AI-enabled features and internal platforms. This role will operate across Offensive Security, Secure... 
    Full time
    Contract work
    Temporary work
    Part time
    Local area
    Shift work

    Rivian

    Atlanta, GA
    5 days ago
  • $159k - $208.95k

     ...FanDuel, data is the heartbeat of our organization. As a Staff Machine Learning Engineer at FanDuel, you will help us unlock the full potential of...  ...Scientists and Analysts to productionize, analyze, and validate AI powered insights. You will be asked to help organize,... 
    Temporary work
    Local area
    Worldwide

    Omaze

    Atlanta, GA
    3 days ago
  • $175k

    About The Role As a Staff Process Engineer at Veho, you will be a core practitioner within our Process Engineering team, sitting at the intersection...  ...observe, troubleshoot, and iterate until adoption is real. AI Tooling & Vibe Coding — Operator‑Facing Improvements You will... 
    Work at office
    Relocation
    Flexible hours
    Shift work

    Veho

    Atlanta, GA
    2 days ago
  • $160k - $210k

    Join to apply for the Full Stack Staff Engineer role at LAHZO Overview At Lahzo, we help companies with complex sales cycles grow revenue more efficiently by combining targeted marketing with AI-powered sales agents. Our technology guides buyers through the entire journey... 
    Immediate start
    Remote work
    Flexible hours

    LAHZO

    Atlanta, GA
    1 day ago
  • $175k

    A logistics technology company is seeking a Staff Process Engineer in Atlanta to oversee process engineering within the warehouse network. This role involves standardizing workflows, optimizing facility layouts, and acting as the liaison between Ground Operations and Technology... 

    Veho

    Atlanta, GA
    2 days ago
  • The Opportunity At Sema4.ai, we’re building an Enterprise AI Agent platform that fundamentally changes how knowledge work gets...  ...AI agents to collaborate in durable, trustworthy ways. As a Staff Backend Engineer on the Agent Platform , you will be the engineering... 

    Sema4.ai, Inc.

    Atlanta, GA
    1 day ago
  • $180.8k - $209.2k

     ...range of expertise related to computer science and electrical engineering, such as AI/ML, algorithms, digital signal processing, audio engineering...  ...otherwise unimaginable. We’re looking for a talented Staff Architect who is excited to advance the state of the art in... 
    Full time
    Temporary work
    Local area
    Worldwide
    Flexible hours

    Dolby

    Atlanta, GA
    3 days ago
  • $115.78k - $215.02k

     ...Role : We are seeking a Cloud Security Engineer with expertise in cloud security...  ...vulnerabilities and threats. Use advanced tools, AI, machine learning, or custom-built scripts...  ...forward without having direct control of staff. Then help us create the future with one... 
    Temporary work
    Work at office
    Local area
    3 days per week

    Warner Bros. Discovery

    Atlanta, GA
    5 days ago
  •  ...Staff Platform Engineer Roark is currently building out its data and analytics team, with the goal of better leveraging advanced analytic solutions...  ...Data and Analytics team in the buildout of analytics and AI products. An ideal candidate will have full stack experience... 

    Roark Capital

    Atlanta, GA
    4 days ago
  • $165k - $240k

     ...better, brighter future for the next generation depends on it. Staff Android Systems Engineer - Lead the development of Android-based products running on...  ...insights for engineering, QA, and leadership Apply AI tooling to real engineering problems in development, diagnostics... 
    Local area
    Remote work
    Flexible hours
    Day shift

    Capitolis

    Atlanta, GA
    4 days ago
  • $160k - $210k

    A leading AI startup in Atlanta is seeking a Full Stack Staff Engineer to design and implement robust systems across the tech stack, using TypeScript, SQL, MongoDB, and Redis. The ideal candidate has extensive experience in full-stack development and a passion for mentoring... 
    Remote job
    Flexible hours

    LAHZO

    Atlanta, GA
    9 hours ago
  •  ...Associate Staff Engineer - QA Automation Engineer We are a Digital Product Engineering company that is scaling in a big way! We build products...  ..., smoke tests, and performance test planning. Leverage AI-powered tools (GitHub Copilot, Copilot Chat, MCP-integrated agents... 

    Nagarro

    Atlanta, GA
    3 days ago
  • $180k - $220k

     ...Job Description Job Description Who We Are AI is changing how software gets built. Code production is becoming a commodity....  ...team grows. This is a hands-on role with broad influence across engineering, cloud platform, and customer-facing teams. The SRE team will... 
    Remote work
    Work from home
    Shift work

    Gradle Technologies

    Atlanta, GA
    a month ago
  •  ...built the most complete cloud analytics and data platform for AI. By delivering harmonized data, trusted AI, and faster innovation...  ...large-scale enterprise environments. As a member of our AI engineering team, you’ll play a critical role in designing and deploying advanced... 
    Permanent employment
    Flexible hours

    Teradata Corporation (SE)

    Atlanta, GA
    9 hours ago
  • $185.1k - $335.3k

     ...critical foundation for localization, perception, simulation, and autonomy at scale. The Role We are looking for a Staff Machine Learning Engineer to serve as a technical leader for automated map reconstruction within our Mapping Engineering team. In this role,... 
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Atlanta, GA
    5 days ago
  • ATX Venture Partners is looking for a Staff Backend Software Engineer in Atlanta, Georgia. In this role, you'll be responsible for architecting and...  ...developing scalable applications while actively integrating AI technologies. You will also mentor junior engineers and... 

    ATX Venture Partners

    Atlanta, GA
    2 days ago
  • $159.6k - $296.4k

     ...connected place. Your New Role … CNN is seeking a Sr. Staff Data Engineer to serve as the technical authority for CNN’s Data Platform —...  ...foundation that powers analytics, data science, machine learning, and AI across CNN’s digital products. You will define and execute... 
    Temporary work
    Local area

    Warner Bros. Discovery

    Atlanta, GA
    15 days ago
  • $159k - $208.95k

    FanDuel is seeking a Staff Machine Learning Engineer based in Atlanta, Georgia. In this role, you will oversee the development of intelligent search systems and scalable machine learning applications. With a focus on collaboration and engineering excellence, you will contribute... 

    Omaze

    Atlanta, GA
    3 days ago
  • $146.44k - $271.96k

     ...Senior Staff Data Engineer The Senior Staff Data Engineer is accountable for designing and delivering data pipelines that process billions of ad events daily across WBD's global ecosystem. This role sets technical standards for distributed data engineering, drives modernization... 
    Temporary work

    Warner Bros.

    Atlanta, GA
    5 days ago
  • A leading AI technology firm in Atlanta is seeking a Staff Backend Engineer to build production-grade agent infrastructure and ensure scalability and operability. The role requires 7+ years of backend experience, proficiency in Python, and familiarity with LLM or agent-... 

    Sema4.ai, Inc.

    Atlanta, GA
    1 day ago
  • $120k - $190k

    Home Depot is seeking a Staff Software Engineer based in Atlanta, Georgia. The role involves leading a team to build scalable machine learning solutions and providing technical direction on model development and lifecycle management. Candidates should have 3-6 years of... 

    Home Depot

    Atlanta, GA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Engineer, AI Evals. Be the first to apply!