Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior AI Engineer, Agentic Evaluation & V&V

$150k - $250k

Slingshot Aerospace

Senior AI Engineer, Agentic Evaluation & V&V

At Slingshot Aerospace, we're on a mission to make space safer and more secure for everyone. Our work directly impacts global security, disaster response, climate monitoring, and the critical infrastructure that connects our world. We're a team of builders, thinkers, and problem-solvers who believe that the next generation of space operations will be powered by better data and smarter software.

This role focuses on building and scaling evaluation frameworks, benchmarks, and simulation-backed validation systems for agentic AI systems, including multi-step, tool-using, and autonomous decision-making workflows powered by LLMs and reinforcement learning. Your work will directly support the development of reliable and trustworthy autonomous mission planning systems.

You will partner closely with AI researchers and domain experts to translate real-world mission concepts into structured, testable evaluation systems.

Your mission (should you choose to accept it)

  • Extend and maintain Slingshot's V&V SDK and evaluation framework for simulation-backed validation of agentic AI systems
  • Design and implement agent-level and end-to-end evaluations, including benchmark scenarios, scoring logic, and experiment harnesses
  • Build benchmark scenarios and tooling that measure planning, reasoning, and operational performance for autonomous mission planning systems
  • Translate astrodynamics and mission-domain concepts into executable evaluation scenarios and simulation configurations
  • Develop reusable SDK interfaces, adapters, and evaluation utilities that connect V&V systems, TALOS benchmarks, and agent workflows
  • Define and apply metrics for capability evaluation, failure analysis, regression detection, and comparative benchmarking
  • Partner with cross-functional teams to identify evaluation needs and contribute to improving coverage of critical capabilities
  • Contribute to best practices for evaluating complex, autonomous AI systems
  • Uphold strong engineering standards through testing, documentation, reproducibility, and maintainable system design

Pre-flight checklist

  • 6+ years of experience in software engineering, machine learning engineering, applied AI, or equivalent experience
  • Strong Python engineering skills with experience building SDKs, libraries, or evaluation tooling
  • Experience designing evaluation frameworks, benchmarks, metrics, or test harnesses for AI/ML systems
  • Ability to analyze system behavior, identify failure modes, and evaluate performance in complex autonomous or semi-autonomous systems
  • Familiarity with modern agent frameworks, orchestration patterns, or protocol-based integrations
  • Experience working in cross-functional, multidisciplinary teams
  • Strong written and verbal communication skills
  • Bachelor's degree in a relevant science or engineering field, or equivalent experience
  • Must be a U.S. citizen and eligible to obtain and maintain a government security clearance

Bonus cargo

  • Experience in autonomous systems such as self-driving or ADAS, including perception, planning, simulation, or safety validation
  • Experience developing or evaluating agentic AI systems, including multi-step, tool-using, or autonomous workflows (e.g., LLM-based agents, planning agents, or reinforcement learning approaches)
  • Experience with reinforcement learning systems and simulation-based evaluation
  • Familiarity with benchmark design, experiment tracking, and trace-based evaluation workflows
  • Experience with orchestration frameworks such as LangGraph or similar tools
  • Knowledge of astrodynamics, orbital mechanics, or spacecraft mission planning
  • Experience translating mission or operational concepts into measurable evaluation scenarios
  • Familiarity with physics-based simulation, trajectory analysis, or space-domain modeling
  • Experience with observability and experiment tooling such as MLflow, Opik, or similar platforms
  • Experience transitioning advanced research systems into production environments

We're building a constellation here, not looking for identical satellites. Every member of the team brings different capabilities to the same mission. If your orbit intersects with ours and you're mission-ready, send it.

Location: Remote, US

Salary: $150,000-$250,000

Classification: Full time Exempt (learned professional exemption)

Equity, Diversity & Inclusion are key to our success. We are an Equal Opportunity Employer and our employees are people with different strengths, experiences, and backgrounds, who share a passion for creating a safer, more connected world. Diversity not only includes race and gender identity, but also national origin, citizenship, sex, color, veteran status, disability, genetic information, or any other protected characteristic that is part of one's identity. All of our employees' points of view are key to our success, and we embrace individuality.

Vacancy posted 8 hours ago
Similar jobs that could be interesting for youBased on the Senior AI Engineer, Agentic Evaluation & V&V in United States vacancy
  • Slingshot Aerospace is looking for a Senior AI Engineer to focus on Agentic Evaluation and V&V. The role involves building evaluation frameworks and simulation systems for autonomous AI. Candidates must have 6+ years of experience in software or ML engineering, strong Python... 
    Senior
    Full time
    Remote work

    Slingshot Aerospace

    Phoenix, AZ
    4 days ago
  • $158.4k - $237.6k

     ...Qualcomm Technologies, Inc. Job Area Engineering Group, Machine Learning Engineering General Summary Qualcomm AI Research looking for talented ML/AI algorithm evaluation engineers with experience in...  ...to enable embedded GenAI and Agentic AI. Come join a high-caliber team... 
    Suggested
    Work experience placement
    Work from home

    Qualcomm

    San Diego, CA
    2 days ago
  •  ...Title: Senior AI Native Software Engineer (Agentic AI) Location: Remote with up to 20% travel required (travel will be reimbursed) Job Description...  ...invocation Policy-based routing Build evaluation frameworks for accuracy, latency, and reliability... 
    Senior
    Remote work

    RIT Solutions, Inc.

    United States
    5 days ago
  •  ...Senior Agentic AI Engineer Rayda is an asset lifecycle management company, enabling businesses to acquire, manage, track and dispose of their...  ...platforms, and third-party tools. Implement agent evaluation, monitoring, observability, and guardrail mechanisms to ensure... 
    Senior
    Full time
    Work experience placement
    Remote work

    Rayda

    United States
    2 days ago
  • $300k - $400k

     ...Senior Agentic AI Engineer $300,000 - $400,000 Remote / Hybrid, Palo Alto Full time / Permanent A well-known, frontier GenAI...  ...execution, scheduling, and goal-directed workflows Create evaluation frameworks and metrics that measure agent quality,... 
    Senior
    Permanent employment
    Full time
    Remote work
    Shift work

    DeepRec.ai

    Palo Alto, CA
    2 days ago
  •  ...applications and next steps. Our partner is looking for a Senior AI Engineer, Agentic Systems based in United States. This is a high-impact engineering...  ...such as planners, tool registries, and policy guards Evaluate and apply agentic frameworks such as LangGraph, LangChain... 
    Senior
    Remote job
    Full time
    Flexible hours

    jobgether

    United States
    3 days ago
  •  ...experts can help you find the best job for you. Role: Senior Agentic AI Engineer (Palo Alto Networks Ecosystem) Location: Palo Alto, CA...  ...a final response. Agentic Validation: Build "Evaluation LLM-as-a-Judge" frameworks to measure agent accuracy, latency... 
    Senior
    Permanent employment
    Contract work
    Remote work

    Tekfortune Inc

    Palo Alto, CA
    15 hours ago
  •  ...Revolutionize the Future of Logistics as Our Next Senior Agentic AI Engineer Are you ready to redefine how global freight moves by building...  ...embedding pipelines, custom chunkers, and advanced retrieval evaluators. Lead the integration of intelligent agent services... 
    Senior
    Local area
    Remote work
    Worldwide

    Trimble

    United States
    15 hours ago
  •  ...Senior Agentic AI Engineer Build and Scale the AI Layer of the Business Rebel Athletic is seeking a highly experienced Agentic AI Engineer...  ..., APIs, and event-driven systems Define and implement evaluation frameworks, guardrails, and monitoring for AI systems... 
    Senior

    Rebel Athletic

    Carrollton, TX
    2 days ago
  •  ...Senior AI Engineer LLM & Agentic Systems (Python) Remote Role Overview We are seeking a senior AI engineer to design and build production...  ..., backend services, and cloud platforms Establish evaluation, reliability, and performance strategies (accuracy,... 
    Senior
    Remote work

    RIT Solutions Inc/ Tech Dev IT/ Texperts Inc/ConceptsIT, Inc...

    Atlanta, GA
    1 day ago
  •  ...Senior AI Engineer, Agentic Interactions Pittsburgh, Pennsylvania, United States Company Description Govini transforms Defense Acquisition...  ...OpenAI Agent SDK Experience building observability, evaluation, and feedback loops for agent behavior (telemetry, prompt... 
    Senior
    Full time
    Work at office

    Govini

    Pittsburgh, PA
    15 hours ago
  • $150k - $180k

     ...ARE Zeta Global (NYSE: ZETA) is the AI-Powered Marketing Cloud that leverages...  ...Role Snapshot We’re hiring a hands-on Agentic AI engineer to build bidder-adjacent agents that improve...  ...-loop ACE (Observe → Reason → Act → Evaluate). This is not a classic “train a model... 
    Senior

    Zeta Global

    San Francisco, CA
    15 hours ago
  • Teradata Corporation (SE) is seeking an AI Engineer in Phoenix, Arizona, to build agentic systems for AI-driven analytics. Candidates should have 3-6 years...  ...designing agent skills, improving workflows, and evaluating agent performance. Proficiency in Python and understanding... 
    Senior
    Flexible hours

    Teradata Corporation (SE)

    Phoenix, AZ
    15 hours ago
  •  ...Worth AI is hiring a Senior Agentic AI Engineer to design and ship production agent systems that automate KYB, underwriting, and risk decisions on...  ...Real eval experience golden sets, offline and online evaluations, used to make ship/no-ship calls. ~ Production MLOps... 
    Senior
    Remote work
    Flexible hours

    Worth AI

    United States
    2 days ago
  •  ...Senior AI Agentic Engineer The Senior AI Agentic Engineer designs, builds, and operationalizes intelligent agent systems that automate complex...  ...to improve grounding and reduce hallucination. Evaluate and continuously tune RAG systems for accuracy, latency, factual... 
    Senior
    Work experience placement

    American Bureau of Shipping

    Houston, TX
    3 days ago
  •  ...California where you will develop AI-driven analytics systems as part of...  ...role requires 3-6 years of software engineering experience, specifically in building agentic AI systems in production. You will design, implement, and evaluate agent skills, while collaborating with... 
    Senior
    Flexible hours

    Teradata Group

    California, MO
    1 day ago
  • $180k - $240k

     ...About the role We are looking for a Senior Agentic AI Engineer to join our team. You will be at the forefront of designing, developing,...  ...platforms, internal APIs, and legal workflow systems. Evaluate third-party models, frameworks, and services to determine... 
    Senior
    Full time
    Contract work
    Work at office
    Remote work
    Worldwide
    Flexible hours

    LegalOn Technologies

    San Francisco, CA
    3 days ago
  • $123k - $215.25k

     ...Senior AI Engineer I - Agentic AI New York, NY, United States Charlotte, NC, United States Palo Alto, CA, United States Phoenix, AZ,...  ...including LLM services, agent orchestration frameworks, and evaluation and monitoring tooling, that scales agentic development... 
    Senior
    Full time
    Work at office
    Local area
    Remote work
    Visa sponsorship
    Flexible hours
    Shift work

    American Express

    Phoenix, AZ
    4 days ago
  • Teradata Corporation (SE) is looking for an experienced AI Engineer to build intelligent systems that enhance data analytics. You will design, implement, and evaluate agentic AI solutions aiming for superior performance. The ideal candidate will have robust software engineering... 
    Senior
    Flexible hours

    Teradata Corporation (SE)

    Des Moines, IA
    3 days ago
  •  ...Hartford, CT, is seeking a software engineer with 3-6 years of experience to develop AI-driven analytics systems. The...  ...designing agent skills, building evaluation frameworks, and ensuring the performance...  ...in Python and experience in agentic AI systems. Teradata promotes a... 
    Senior
    Flexible hours

    Teradata Corporation (SE)

    Hartford, CT
    3 days ago
  •  ...seeking an experienced candidate to build agentic systems that enhance AI-driven analytics. The role involves designing agent skills and evaluating system performance to solve customer...  ...have hands-on experience in software engineering with a strong foundation in Python, a... 
    Senior
    Flexible hours

    Teradata Corporation (SE)

    Annapolis, MD
    2 days ago
  •  ...Title: Senior AI Agentic Engineer Location: Spring, TX 77389 (hybrid: 3 days onsite / 2 days remote) Duration: Direct Hire Work Requirements...  ...to improve grounding and reduce hallucination. Evaluate and continuously tune RAG systems for accuracy, latency, factual... 
    Senior
    Work experience placement
    Local area
    Remote work
    Flexible hours

    INSPYR Solutions

    United States
    9 hours ago
  •  ...Senior AI Agentic Engineer ***This position is 5 days a week in the Charlotte Office***. The Senior AI Agentic Engineer is a senior production...  ...context management, retrieval-augmented generation (RAG), evaluation, observability, and governed deployment. The engineer... 
    Senior
    Permanent employment
    Full time
    Part time
    H1b
    Work at office
    Work visa

    SunTrust Investment Services, Inc.

    Charlotte, NC
    15 hours ago
  •  ...mighty team works with senior leaders and partners across...  ...strategic and rigorous engineering leader who is...  ...innovation and approaches AI agent design as an engineering...  ...for the Problem: Evaluate business problems and...  ...workflows, ensuring that agentic hallucinations are... 
    Senior
    Temporary work
    Local area

    Intuit

    San Diego, CA
    4 days ago
  •  ...their hybrid cloud and AI journeys. With support...  ...are seeking a skilled AI Engineer to join an active...  ...facing capacity. This is a senior individual contributor...  ...technical proficiency in Agentic AI — you will be expected...  ...Establish monitoring, evaluation, and feedback loops to... 
    Senior
    Worldwide

    IBM

    Chicago, IL
    15 hours ago
  • $229.9k - $262.4k

     ...Senior Lead AI Engineer(MLX, Agentic AI, Gen AI platform Services) Overview: At Capital One, we are creating responsible and reliable AI systems,...  ...model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage... 
    Senior
    Full time
    Part time
    Local area

    Capital One

    San Jose, CA
    11 days ago
  •  ...Role: Senior AI Engineer - Agentic Systems and LLM Client Location: Mason, OH 100% Remote Job Description: We are seeking a senior AI engineer...  ...APIs, backend services, and cloud platforms Establish evaluation, reliability, and performance strategies (accuracy,... 
    Senior
    Remote work

    Vytwo

    Prosper, TX
    2 days ago
  •  ...their full potential. About the Role We are seeking a Senior Agentic AI Software Engineer to design, build, and operate production-grade AI agent...  ...and integration with existing services, to reliability, evaluation, and long-term operation. You’ll work closely with... 
    Senior
    Full time
    Work at office
    Work from home

    Siemens AG

    Charlotte, NC
    15 hours ago
  •  ...landscapes. You'll develop the models and agentic architectures that power demand...  ...largest retailers. This is applied AI at its most impactful. You'll work at...  ...: data preprocessing, feature engineering, model training, evaluation, and production deployment Architect... 
    Senior
    Immediate start
    Home office
    Visa sponsorship
    Flexible hours
    3 days per week

    Merciv

    New York, NY
    2 days ago
  • Senior Agentic AI Software Engineer - Hybrid US Job ID: 497243 Posted since: 04-Mar-2026 Organization: Siemens Industry Software Inc. Job type:...  ...and integration with existing services, to reliability, evaluation, and long-term operation. You’ll work closely with... 
    Senior
    Permanent employment
    Full time
    Work at office
    Local area
    Remote work
    Work from home

    Siemens Mobility

    Wilsonville, OR
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior AI Engineer, Agentic Evaluation & V&V. Be the first to apply!