Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Agentic AI Evaluation Engineer

$142.65k - $213.98k

Comcast Service Center

Job Summary The Agent Evaluation team is responsible for testing whether AI agents return the correct and expected responses. We build the framework, metrics, and test cases that validate agent behavior, accuracy, and reliability before release. Our goal is to ensure agents perform consistently and meet product and user expectations. Job Description Design and develop agent evaluation pipelines across development, staging, and production environments Define and standardize evaluation metrics and benchmarks for conversational AI quality (accuracy, relevance, CX, safety) Build automated and human-in-the-loop evaluation systems to assess agent performance Manage and curate evaluation datasets, test sets, and annotation workflows Enable continuous evaluation and monitoring of agents in production Integrate evaluation into CI/CD pipelines to support safe and efficient releases Conduct experiments, A/B testing, and case studies to drive improvements in agent quality Partner with engineering, and product teams to deliver high-quality AI solutions Create technical documentation and drive best practices across teams Mentor junior engineers and contribute to team growth Preferred Skills Experience in customer support AI or chatbot platforms Understanding of responsible AI (bias, fairness, hallucination mitigation) Disclaimer This information has been designed to indicate the general nature and level of work performed by employees in this role. It is not designed to contain or be interpreted as a comprehensive inventory of all duties, responsibilities and qualifications. Comcast is an equal opportunity workplace. We will consider all qualified applicants for employment without regard to race, color, religion, age, sex, sexual orientation, gender identity, national origin, disability, veteran status, genetic information, or any other basis protected by applicable law. Skills CI/CD; Machine Learning (ML); Large Language Models (LLMs); Curious Mindset; AI Agents; Benchmarking; Evaluation Metrics Salary Primary Location Pay Range: $142,651.46 - $213,977.19 Benefits Base pay is one part of the Total Rewards that Comcast provides to compensate and recognize employees for their work. Most non‑sales positions are eligible for a Bonus. Additionally, Comcast provides best‑in‑class Benefits to eligible employees. We believe that benefits should connect you to the support you need when it matters most, and should help you care for those who matter most. That’s why we provide an array of options, expert guidance and always‑on tools, that are personalized to meet the needs of your reality — to help support you physically, financially and emotionally through the big milestones and in your everyday life. Please visit the compensation and benefits summary on our careers site for more details. Education Bachelor's Degree While possessing the stated degree is preferred, Comcast also may consider applicants who hold some combination of coursework and experience, or who have extensive related professional experience. Relevant Work Experience 5-7 Years #J-18808-Ljbffr Comcast

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Agentic AI Evaluation Engineer in Washington DC vacancy
  • $142.65k - $213.98k

    Job Summary The Agent Evaluation team is responsible for testing whether AI agents return the correct and expected responses. We build the framework, metrics...  ...drive improvements in agent quality. Partner with engineering and product teams to deliver high-quality AI solutions... 
    Suggested

    Blueface Ltd

    Washington DC
    4 days ago
  • $99k - $225k

    Agentic AI Engineer Job Number: R0240706 The Opportunity: As an experienced engineer, you know how to design, develop, and deliver production...  ...-augmented generation (RAG), agentic orchestration, evaluation pipelines, and human-in-the-loop systems to deliver measurable... 
    Suggested
    Full time
    Contract work
    Part time
    Local area

    Phase2 Technology

    Arlington, VA
    1 day ago
  • $160k - $195k

     ...Space builds next-generation AI systems that help “supercharge...  ...platform. We are seeking an engineer who thrives in a dynamic, fast...  ...and traceability. Develop evaluation and regression testing for agent...  ...toidentifyand prioritize agentic automation opportunities tied... 
    Suggested
    Temporary work
    Work at office
    Flexible hours

    Valid8 Financial, Inc.

    Washington DC
    3 days ago
  • $176k - $265k

     ...Location Type Remote Department Engineering Compensation Zone 1 $195K -...  ...rebuilding biotech for the AI era. When a breakthrough is...  ...‑functional and company‑wide agentic AI applications that span departmental...  ...memory and state management, evaluation, and observability. Make... 
    Suggested
    Full time
    Local area
    Remote work

    Albert Invent

    Washington DC
    3 days ago
  • Agentic AI Engineer Location: Remote / Alexandria, VA Clearance: Eligibility to be cleared Are you ready to be part of a team that creates...  ...practices in developing agent-based AI solutions. Continuously evaluate, test, and enhance the capabilities and performance of... 
    Suggested
    Work at office
    Remote work

    GeoDelphi

    Alexandria, VA
    4 days ago
  • $99k - $225k

     ...to expect during your journey as a candidate with us. Agentic AI Machine Learning Engineer The Opportunity: As an experienced machine learning engineer...  ...to APIs, Cloud platforms, or databases Experience evaluating LLM performance and building observation layers for stakeholders... 
    Full time
    Contract work
    Part time
    Work at office
    Local area
    Remote work

    Booz Allen Hamilton

    Washington DC
    5 days ago
  • $95.5k - $181.7k

     ...For more than 70 years, scientists and engineers in a wide ranging disciplines at RTX BBN...  ...You Will Do: Design, develop, and deploy agentic AI systems capable of autonomous reasoning,...  ...scientists and engineers to design experiments, evaluate system performance, and produce high‑... 
    Full time
    Temporary work
    Work experience placement
    Work at office
    Remote work
    Worldwide
    Flexible hours

    RTX

    Arlington, VA
    2 days ago
  •  ...results for the government. We are currently seeking an Agentic AI Systems Engineer to join our team and fully embrace our commitment to deliver...  ..., and alignment with applicable standards. Design evaluation frameworks and test harnesses to assess quality, factuality... 
    Full time
    Temporary work
    Work experience placement
    Afternoon shift

    Corner Alliance

    Washington DC
    3 days ago
  • $86.8k - $198k

    Backend Infrastructure & Agentic AI Platforms - Software Development Engineer, Senior The Opportunity: To achieve an organization’s mission, leaders need strong...  ...usage, and LLM quality metrics and implement LLM evaluation pipelines including safety checks, regression... 
    Contract work
    Local area

    Booz Allen Hamilton

    Washington DC
    4 days ago
  • We build AI agents that actually work in enterprise environments — not prototypes, not demos. We need engineers who can own the entire agent stack: a production frontend, a robust backend...  ...technical architect and builder of agentic systems running in AWS, OCI, and Azure.... 
    Temporary work

    Trilagen

    Bethesda, MD
    2 days ago
  • $199.75k - $270.25k

     ...Job Family: Data Science and Data Engineering Job Qualifications: Skills: AI Agents, AI Ops, AI Systems, CI/CD,...  ...DESCRIPTION Iron EagleX is seeking a Senior Agentic AI/ML Engineer to support our AI...  ..., agentic workflow design, evaluation, integration, deployment, and... 
    Full time
    Contract work
    Temporary work
    Immediate start
    Remote work
    Worldwide
    Flexible hours

    General Dynamics Information Technology

    Arlington, VA
    1 day ago
  • Blueface Ltd in Washington seeks an experienced AI Evaluator to design and develop evaluation pipelines for conversational AI. The role involves defining metrics, conducting experiments, and ensuring high-quality AI solutions. The ideal candidate will have 5-7 years of... 

    Blueface Ltd

    Washington DC
    4 days ago
  • Phase2 Technology is seeking an experienced Agentic AI Engineer to design and deliver production-grade AI systems that demonstrate the existing potential of generative AI. You will work alongside data engineers and product owners to create applications that leverage advanced... 

    Phase2 Technology

    Arlington, VA
    1 day ago
  • $150k - $170k

    Danaher Corporation is seeking an AI Evaluation Engineer to join the Device Intelligence team. This remote role involves defining and executing AI evaluation strategies for cutting-edge systems used across Life Sciences and Diagnostics. The ideal candidate will have a... 
    Remote job

    Payfuture Technologies

    Washington DC
    1 day ago
  • $40 per hour

    A leading cybersecurity firm is looking for experienced cybersecurity professionals to evaluate AI-generated content and solve technical problems. The role involves working with advanced AI models, providing feedback, and contributing to the cybersecurity industry's future... 
    Hourly pay
    Remote work
    Flexible hours

    DataAnnotation

    Washington DC
    1 day ago
  • Phase2 Technology is seeking an experienced Software Development Engineer to advance AI-enabled systems, supporting clients at the Advanced Research Projects Agency for Health (ARPA-H). You'll build agentic workflows, design LLM integrations, and develop AI-powered... 

    Phase2 Technology

    Washington DC
    5 days ago
  • $142.65k - $213.98k

     ...an experienced professional to design and manage agent evaluation pipelines ensuring AI solutions meet user expectations. Responsibilities include...  ...metrics, conducting A/B testing, and mentoring junior engineers. Candidates should possess a Bachelor's degree and 5-7 years... 

    Comcast

    Washington DC
    1 day ago
  • Whitespace, Ltd. is seeking a creative Agentic AI Engineer to join our dynamic team in Alexandria, VA. This role focuses on building intelligent AI agents that navigate complex environments and make autonomous decisions. Responsibilities include architecting AI systems... 
    Remote job

    Whitespace, Ltd.

    Alexandria, VA
    4 days ago
  • $40 per hour

    A leading technology firm is seeking experienced cybersecurity professionals to evaluate AI-generated security content and provide feedback for improving AI systems. This remote position offers flexibility in project selection, with an hourly pay starting at $40+. Candidates... 
    Hourly pay
    Remote work

    DataAnnotation

    Washington DC
    4 days ago
  • A leading technology firm is seeking a skilled Agentic AI Engineer to develop advanced AI agents for cybersecurity. This role requires creativity, expertise in Python and agent frameworks, and a commitment to enhancing national security through innovative AI solutions.... 
    Remote job
    Work at office

    GeoDelphi

    Alexandria, VA
    4 days ago
  • $168.33k - $252.49k

     ...are looking for a Senior Software Engineer to lead the technical direction of our AI Agent initiatives. This is a...  ...production‑grade backend services and agentic workflows that solve complex, non...  ...operable over time. Testing & Evaluation of Probabilistic Systems - Design... 
    Worldwide

    Blueface

    Washington DC
    4 days ago
  • $130k - $150k

    BLEN Corp is seeking an AI Engineer in Washington, DC to design and build AI systems for federal and commercial clients. The role involves developing agentic systems, creating LLM-powered applications, and working closely with stakeholders. Ideal candidates should have... 
    Work from home

    BLEN Corp

    Washington DC
    5 days ago
  • $193.4k - $220.7k

    Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking...  ...model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc... 
    Full time
    Part time
    Local area

    Capital One National Association

    Mc Lean, VA
    5 days ago
  • $99k - $225k

    Booz Allen Hamilton is seeking an experienced Agentic AI Machine Learning Engineer to design and implement AI systems that transform client operations in the Defense sector. Candidates should have over 3 years of machine learning experience, particularly in production-... 
    Remote job

    Booz Allen Hamilton

    Washington DC
    1 day ago
  • Corner Alliance is seeking an Agentic AI Systems Engineer to design and develop AI-enabled interfaces in Washington, D.C. The role requires expertise in software engineering, cloud application development, and AI integration. Responsibilities include constructing a secure... 

    Corner Alliance

    Washington DC
    3 days ago
  • $197.3k - $225.1k

    Overview Lead AI Engineer (GenAI Platform, AI Foundations, LLM Core and Agentic AI) At Capital One, we are creating responsible and reliable AI systems, changing...  ...inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability,... 
    Full time
    Part time
    Local area

    Capital One

    Mc Lean, VA
    1 day ago
  • AI Engineer (Onsite) Design, develop, and support cloud-native automation and Al agent workflows...  .... Build AI orchestration and agentic workflows using Semantic Kernel, LangChain...  ...Implement observability, monitoring, and evaluation frameworks using LangSmith and Azure... 
    Temporary work

    Cognizant

    Washington DC
    3 days ago
  • $150k - $160k

    Platinum Technologies is seeking an AI Engineer to join our company. We are looking for an...  ...end-to-end AI solutions — RAG pipelines, agentic workflows, multi-modal applications, that...  .... Lead LLM-agnostic design decisions: evaluate and integrate models from OpenAI, Anthropic... 
    Immediate start
    Shift work

    Platinum Technologies

    Washington DC
    1 day ago
  •  ...shape their hybrid cloud and AI journeys. With support from our...  ...responsibilities As an AI Forward Deployed Engineer, you will work with customers...  ...deployment and adoption. Evaluate Model Performance: Assess the...  ...into real‑world applications Agentic AI & workflows: Experience... 
    Worldwide

    IBM Computing

    Alexandria, VA
    4 days ago
  • $99k - $225k

    Job Summary As an experienced engineer, you know that machine learning is critical to understanding...  ...by developing, deploying, and managing AI agents. You’ll be part of a talented team...  ...) architectures Experience with model evaluation and AI quality metrics Secret clearance... 
    Full time
    Contract work
    Part time
    Local area

    Booz Allen Hamilton

    Arlington, VA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Agentic AI Evaluation Engineer. Be the first to apply!