Senior AI Engineer, Agentic Evaluation & V&V

$150k - $250k

Slingshot Aerospace

Senior AI Engineer, Agentic Evaluation & V&V

At Slingshot Aerospace, we're on a mission to make space safer and more secure for everyone. Our work directly impacts global security, disaster response, climate monitoring, and the critical infrastructure that connects our world. We're a team of builders, thinkers, and problem-solvers who believe that the next generation of space operations will be powered by better data and smarter software.

This role focuses on building and scaling evaluation frameworks, benchmarks, and simulation-backed validation systems for agentic AI systems, including multi-step, tool-using, and autonomous decision-making workflows powered by LLMs and reinforcement learning. Your work will directly support the development of reliable and trustworthy autonomous mission planning systems.

You will partner closely with AI researchers and domain experts to translate real-world mission concepts into structured, testable evaluation systems.

Your mission (should you choose to accept it)

Extend and maintain Slingshot's V&V SDK and evaluation framework for simulation-backed validation of agentic AI systems
Design and implement agent-level and end-to-end evaluations, including benchmark scenarios, scoring logic, and experiment harnesses
Build benchmark scenarios and tooling that measure planning, reasoning, and operational performance for autonomous mission planning systems
Translate astrodynamics and mission-domain concepts into executable evaluation scenarios and simulation configurations
Develop reusable SDK interfaces, adapters, and evaluation utilities that connect V&V systems, TALOS benchmarks, and agent workflows
Define and apply metrics for capability evaluation, failure analysis, regression detection, and comparative benchmarking
Partner with cross-functional teams to identify evaluation needs and contribute to improving coverage of critical capabilities
Contribute to best practices for evaluating complex, autonomous AI systems
Uphold strong engineering standards through testing, documentation, reproducibility, and maintainable system design

Pre-flight checklist

6+ years of experience in software engineering, machine learning engineering, applied AI, or equivalent experience
Strong Python engineering skills with experience building SDKs, libraries, or evaluation tooling
Experience designing evaluation frameworks, benchmarks, metrics, or test harnesses for AI/ML systems
Ability to analyze system behavior, identify failure modes, and evaluate performance in complex autonomous or semi-autonomous systems
Familiarity with modern agent frameworks, orchestration patterns, or protocol-based integrations
Experience working in cross-functional, multidisciplinary teams
Strong written and verbal communication skills
Bachelor's degree in a relevant science or engineering field, or equivalent experience
Must be a U.S. citizen and eligible to obtain and maintain a government security clearance

Bonus cargo

Experience in autonomous systems such as self-driving or ADAS, including perception, planning, simulation, or safety validation
Experience developing or evaluating agentic AI systems, including multi-step, tool-using, or autonomous workflows (e.g., LLM-based agents, planning agents, or reinforcement learning approaches)
Experience with reinforcement learning systems and simulation-based evaluation
Familiarity with benchmark design, experiment tracking, and trace-based evaluation workflows
Experience with orchestration frameworks such as LangGraph or similar tools
Knowledge of astrodynamics, orbital mechanics, or spacecraft mission planning
Experience translating mission or operational concepts into measurable evaluation scenarios
Familiarity with physics-based simulation, trajectory analysis, or space-domain modeling
Experience with observability and experiment tooling such as MLflow, Opik, or similar platforms
Experience transitioning advanced research systems into production environments

We're building a constellation here, not looking for identical satellites. Every member of the team brings different capabilities to the same mission. If your orbit intersects with ours and you're mission-ready, send it.

Location: Remote, US

Salary: $150,000-$250,000

Classification: Full time Exempt (learned professional exemption)

Equity, Diversity & Inclusion are key to our success. We are an Equal Opportunity Employer and our employees are people with different strengths, experiences, and backgrounds, who share a passion for creating a safer, more connected world. Diversity not only includes race and gender identity, but also national origin, citizenship, sex, color, veteran status, disability, genetic information, or any other protected characteristic that is part of one's identity. All of our employees' points of view are key to our success, and we embrace individuality.

Apply

Vacancy posted 8 hours ago

Similar jobs that could be interesting for youBased on the Senior AI Engineer, Agentic Evaluation & V&V in United States vacancy

Senior AI Engineer: Agentic Evaluation & V&V for Autonomy
Slingshot Aerospace is looking for a Senior AI Engineer to focus on Agentic Evaluation and V&V. The role involves building evaluation frameworks and simulation systems for autonomous AI. Candidates must have 6+ years of experience in software or ML engineering, strong Python...
Senior
Full time
Remote work
Slingshot Aerospace
Phoenix, AZ
4 days ago
Agentic AI Benchmarking and Evaluation Engineer
$158.4k - $237.6k
...Qualcomm Technologies, Inc. Job Area Engineering Group, Machine Learning Engineering General Summary Qualcomm AI Research looking for talented ML/AI algorithm evaluation engineers with experience in... ...to enable embedded GenAI and Agentic AI. Come join a high-caliber team...
Suggested
Work experience placement
Work from home
Qualcomm
San Diego, CA
2 days ago
Senior AI Native Software Engineer (Agentic AI)
...Title: Senior AI Native Software Engineer (Agentic AI) Location: Remote with up to 20% travel required (travel will be reimbursed) Job Description... ...invocation Policy-based routing Build evaluation frameworks for accuracy, latency, and reliability...
Senior
Remote work
RIT Solutions, Inc.
United States
5 days ago
Senior Agentic AI Engineer
...Senior Agentic AI Engineer Rayda is an asset lifecycle management company, enabling businesses to acquire, manage, track and dispose of their... ...platforms, and third-party tools. Implement agent evaluation, monitoring, observability, and guardrail mechanisms to ensure...
Senior
Full time
Work experience placement
Remote work
Rayda
United States
2 days ago
Senior Agentic AI Engineer
$300k - $400k
...Senior Agentic AI Engineer $300,000 - $400,000 Remote / Hybrid, Palo Alto Full time / Permanent A well-known, frontier GenAI... ...execution, scheduling, and goal-directed workflows Create evaluation frameworks and metrics that measure agent quality,...
Senior
Permanent employment
Full time
Remote work
Shift work
DeepRec.ai
Palo Alto, CA
2 days ago
Senior AI Engineer, Agentic Systems
...applications and next steps. Our partner is looking for a Senior AI Engineer, Agentic Systems based in United States. This is a high-impact engineering... ...such as planners, tool registries, and policy guards Evaluate and apply agentic frameworks such as LangGraph, LangChain...
Senior
Remote job
Full time
Flexible hours
jobgether
United States
3 days ago
Senior Agentic AI Engineer
...experts can help you find the best job for you. Role: Senior Agentic AI Engineer (Palo Alto Networks Ecosystem) Location: Palo Alto, CA... ...a final response. Agentic Validation: Build "Evaluation LLM-as-a-Judge" frameworks to measure agent accuracy, latency...
Senior
Permanent employment
Contract work
Remote work
Tekfortune Inc
Palo Alto, CA
15 hours ago
Senior Agentic AI Engineer
...Revolutionize the Future of Logistics as Our Next Senior Agentic AI Engineer Are you ready to redefine how global freight moves by building... ...embedding pipelines, custom chunkers, and advanced retrieval evaluators. Lead the integration of intelligent agent services...
Senior
Local area
Remote work
Worldwide
Trimble
United States
15 hours ago
Senior Agentic Ai Engineer
...Senior Agentic AI Engineer Build and Scale the AI Layer of the Business Rebel Athletic is seeking a highly experienced Agentic AI Engineer... ..., APIs, and event-driven systems Define and implement evaluation frameworks, guardrails, and monitoring for AI systems...
Senior
Rebel Athletic
Carrollton, TX
2 days ago
Senior AI Engineer - LLM & Agentic Systems (Python)
...Senior AI Engineer LLM & Agentic Systems (Python) Remote Role Overview We are seeking a senior AI engineer to design and build production... ..., backend services, and cloud platforms Establish evaluation, reliability, and performance strategies (accuracy,...
Senior
Remote work
RIT Solutions Inc/ Tech Dev IT/ Texperts Inc/ConceptsIT, Inc...
Atlanta, GA
1 day ago
Senior AI Engineer, Agentic Interactions
...Senior AI Engineer, Agentic Interactions Pittsburgh, Pennsylvania, United States Company Description Govini transforms Defense Acquisition... ...OpenAI Agent SDK Experience building observability, evaluation, and feedback loops for agent behavior (telemetry, prompt...
Senior
Full time
Work at office
Govini
Pittsburgh, PA
15 hours ago
Senior Agentic AI Engineer
$150k - $180k
...ARE Zeta Global (NYSE: ZETA) is the AI-Powered Marketing Cloud that leverages... ...Role Snapshot We’re hiring a hands-on Agentic AI engineer to build bidder-adjacent agents that improve... ...-loop ACE (Observe → Reason → Act → Evaluate). This is not a classic “train a model...
Senior
Zeta Global
San Francisco, CA
15 hours ago
Senior AI Engineer: Agentic AI for Production
Teradata Corporation (SE) is seeking an AI Engineer in Phoenix, Arizona, to build agentic systems for AI-driven analytics. Candidates should have 3-6 years... ...designing agent skills, improving workflows, and evaluating agent performance. Proficiency in Python and understanding...
Senior
Flexible hours
Teradata Corporation (SE)
Phoenix, AZ
15 hours ago
Senior Agentic (AI) Engineer
...Worth AI is hiring a Senior Agentic AI Engineer to design and ship production agent systems that automate KYB, underwriting, and risk decisions on... ...Real eval experience golden sets, offline and online evaluations, used to make ship/no-ship calls. ~ Production MLOps...
Senior
Remote work
Flexible hours
Worth AI
United States
2 days ago
Senior, Agentic AI Engineer
...Senior AI Agentic Engineer The Senior AI Agentic Engineer designs, builds, and operationalizes intelligent agent systems that automate complex... ...to improve grounding and reduce hallucination. Evaluate and continuously tune RAG systems for accuracy, latency, factual...
Senior
Work experience placement
American Bureau of Shipping
Houston, TX
3 days ago
Senior AI Engineer — Build Production Agentic Analytics
...California where you will develop AI-driven analytics systems as part of... ...role requires 3-6 years of software engineering experience, specifically in building agentic AI systems in production. You will design, implement, and evaluate agent skills, while collaborating with...
Senior
Flexible hours
Teradata Group
California, MO
1 day ago
Senior Agentic AI Engineer
$180k - $240k
...About the role We are looking for a Senior Agentic AI Engineer to join our team. You will be at the forefront of designing, developing,... ...platforms, internal APIs, and legal workflow systems. Evaluate third-party models, frameworks, and services to determine...
Senior
Full time
Contract work
Work at office
Remote work
Worldwide
Flexible hours
LegalOn Technologies
San Francisco, CA
3 days ago
Senior AI Engineer I - Agentic AI
$123k - $215.25k
...Senior AI Engineer I - Agentic AI New York, NY, United States Charlotte, NC, United States Palo Alto, CA, United States Phoenix, AZ,... ...including LLM services, agent orchestration frameworks, and evaluation and monitoring tooling, that scales agentic development...
Senior
Full time
Work at office
Local area
Remote work
Visa sponsorship
Flexible hours
Shift work
American Express
Phoenix, AZ
4 days ago
Senior AI Engineer — Flexible Work & Agentic Analytics
Teradata Corporation (SE) is looking for an experienced AI Engineer to build intelligent systems that enhance data analytics. You will design, implement, and evaluate agentic AI solutions aiming for superior performance. The ideal candidate will have robust software engineering...
Senior
Flexible hours
Teradata Corporation (SE)
Des Moines, IA
3 days ago
Senior AI Engineer — Flexible Work & Agentic Analytics
...Hartford, CT, is seeking a software engineer with 3-6 years of experience to develop AI-driven analytics systems. The... ...designing agent skills, building evaluation frameworks, and ensuring the performance... ...in Python and experience in agentic AI systems. Teradata promotes a...
Senior
Flexible hours
Teradata Corporation (SE)
Hartford, CT
3 days ago
Senior AI Engineer — Flexible Work & Agentic Analytics
...seeking an experienced candidate to build agentic systems that enhance AI-driven analytics. The role involves designing agent skills and evaluating system performance to solve customer... ...have hands-on experience in software engineering with a strong foundation in Python, a...
Senior
Flexible hours
Teradata Corporation (SE)
Annapolis, MD
2 days ago
Senior AI Agentic Engineer
...Title: Senior AI Agentic Engineer Location: Spring, TX 77389 (hybrid: 3 days onsite / 2 days remote) Duration: Direct Hire Work Requirements... ...to improve grounding and reduce hallucination. Evaluate and continuously tune RAG systems for accuracy, latency, factual...
Senior
Work experience placement
Local area
Remote work
Flexible hours
INSPYR Solutions
United States
9 hours ago
Senior AI Agentic Engineer
...Senior AI Agentic Engineer ***This position is 5 days a week in the Charlotte Office***. The Senior AI Agentic Engineer is a senior production... ...context management, retrieval-augmented generation (RAG), evaluation, observability, and governed deployment. The engineer...
Senior
Permanent employment
Full time
Part time
H1b
Work at office
Work visa
SunTrust Investment Services, Inc.
Charlotte, NC
15 hours ago
Senior Staff Agentic AI Engineer
...mighty team works with senior leaders and partners across... ...strategic and rigorous engineering leader who is... ...innovation and approaches AI agent design as an engineering... ...for the Problem: Evaluate business problems and... ...workflows, ensuring that agentic hallucinations are...
Senior
Temporary work
Local area
Intuit
San Diego, CA
4 days ago
Senior AI Engineer - Agentic Systems
...their hybrid cloud and AI journeys. With support... ...are seeking a skilled AI Engineer to join an active... ...facing capacity. This is a senior individual contributor... ...technical proficiency in Agentic AI — you will be expected... ...Establish monitoring, evaluation, and feedback loops to...
Senior
Worldwide
IBM
Chicago, IL
15 hours ago
Senior Lead AI Engineer(MLX, Agentic AI, Gen AI platform Services)
$229.9k - $262.4k
...Senior Lead AI Engineer(MLX, Agentic AI, Gen AI platform Services) Overview: At Capital One, we are creating responsible and reliable AI systems,... ...model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage...
Senior
Full time
Part time
Local area
Capital One
San Jose, CA
11 days ago
Senior AI Engineer Agentic Systems and LLM
...Role: Senior AI Engineer - Agentic Systems and LLM Client Location: Mason, OH 100% Remote Job Description: We are seeking a senior AI engineer... ...APIs, backend services, and cloud platforms Establish evaluation, reliability, and performance strategies (accuracy,...
Senior
Remote work
Vytwo
Prosper, TX
2 days ago
Senior Agentic AI Software Engineer - US Hybrid
...their full potential. About the Role We are seeking a Senior Agentic AI Software Engineer to design, build, and operate production-grade AI agent... ...and integration with existing services, to reliability, evaluation, and long-term operation. You’ll work closely with...
Senior
Full time
Work at office
Work from home
Siemens AG
Charlotte, NC
15 hours ago
Senior ML/AI Engineer, Agentic Intelligence
...landscapes. You'll develop the models and agentic architectures that power demand... ...largest retailers. This is applied AI at its most impactful. You'll work at... ...: data preprocessing, feature engineering, model training, evaluation, and production deployment Architect...
Senior
Immediate start
Home office
Visa sponsorship
Flexible hours
3 days per week
Merciv
New York, NY
2 days ago
Senior Agentic AI Software Engineer - Hybrid US
Senior Agentic AI Software Engineer - Hybrid US Job ID: 497243 Posted since: 04-Mar-2026 Organization: Siemens Industry Software Inc. Job type:... ...and integration with existing services, to reliability, evaluation, and long-term operation. You’ll work closely with...
Senior
Permanent employment
Full time
Work at office
Local area
Remote work
Work from home
Siemens Mobility
Wilsonville, OR
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior AI Engineer, Agentic Evaluation & V&V. Be the first to apply!