Senior AI Engineer, Agentic Evaluation & V&V
$150k - $250kSlingshot Aerospace
Senior AI Engineer, Agentic Evaluation & V&V
At Slingshot Aerospace, we're on a mission to make space safer and more secure for everyone. Our work directly impacts global security, disaster response, climate monitoring, and the critical infrastructure that connects our world. We're a team of builders, thinkers, and problem-solvers who believe that the next generation of space operations will be powered by better data and smarter software.
This role focuses on building and scaling evaluation frameworks, benchmarks, and simulation-backed validation systems for agentic AI systems, including multi-step, tool-using, and autonomous decision-making workflows powered by LLMs and reinforcement learning. Your work will directly support the development of reliable and trustworthy autonomous mission planning systems.
You will partner closely with AI researchers and domain experts to translate real-world mission concepts into structured, testable evaluation systems.
Your mission (should you choose to accept it)
- Extend and maintain Slingshot's V&V SDK and evaluation framework for simulation-backed validation of agentic AI systems
- Design and implement agent-level and end-to-end evaluations, including benchmark scenarios, scoring logic, and experiment harnesses
- Build benchmark scenarios and tooling that measure planning, reasoning, and operational performance for autonomous mission planning systems
- Translate astrodynamics and mission-domain concepts into executable evaluation scenarios and simulation configurations
- Develop reusable SDK interfaces, adapters, and evaluation utilities that connect V&V systems, TALOS benchmarks, and agent workflows
- Define and apply metrics for capability evaluation, failure analysis, regression detection, and comparative benchmarking
- Partner with cross-functional teams to identify evaluation needs and contribute to improving coverage of critical capabilities
- Contribute to best practices for evaluating complex, autonomous AI systems
- Uphold strong engineering standards through testing, documentation, reproducibility, and maintainable system design
Pre-flight checklist
- 6+ years of experience in software engineering, machine learning engineering, applied AI, or equivalent experience
- Strong Python engineering skills with experience building SDKs, libraries, or evaluation tooling
- Experience designing evaluation frameworks, benchmarks, metrics, or test harnesses for AI/ML systems
- Ability to analyze system behavior, identify failure modes, and evaluate performance in complex autonomous or semi-autonomous systems
- Familiarity with modern agent frameworks, orchestration patterns, or protocol-based integrations
- Experience working in cross-functional, multidisciplinary teams
- Strong written and verbal communication skills
- Bachelor's degree in a relevant science or engineering field, or equivalent experience
- Must be a U.S. citizen and eligible to obtain and maintain a government security clearance
Bonus cargo
- Experience in autonomous systems such as self-driving or ADAS, including perception, planning, simulation, or safety validation
- Experience developing or evaluating agentic AI systems, including multi-step, tool-using, or autonomous workflows (e.g., LLM-based agents, planning agents, or reinforcement learning approaches)
- Experience with reinforcement learning systems and simulation-based evaluation
- Familiarity with benchmark design, experiment tracking, and trace-based evaluation workflows
- Experience with orchestration frameworks such as LangGraph or similar tools
- Knowledge of astrodynamics, orbital mechanics, or spacecraft mission planning
- Experience translating mission or operational concepts into measurable evaluation scenarios
- Familiarity with physics-based simulation, trajectory analysis, or space-domain modeling
- Experience with observability and experiment tooling such as MLflow, Opik, or similar platforms
- Experience transitioning advanced research systems into production environments
We're building a constellation here, not looking for identical satellites. Every member of the team brings different capabilities to the same mission. If your orbit intersects with ours and you're mission-ready, send it.
Location: Remote, US
Salary: $150,000-$250,000
Classification: Full time Exempt (learned professional exemption)
Equity, Diversity & Inclusion are key to our success. We are an Equal Opportunity Employer and our employees are people with different strengths, experiences, and backgrounds, who share a passion for creating a safer, more connected world. Diversity not only includes race and gender identity, but also national origin, citizenship, sex, color, veteran status, disability, genetic information, or any other protected characteristic that is part of one's identity. All of our employees' points of view are key to our success, and we embrace individuality.
- Slingshot Aerospace is looking for a Senior AI Engineer to focus on Agentic Evaluation and V&V. The role involves building evaluation frameworks and simulation systems for autonomous AI. Candidates must have 6+ years of experience in software or ML engineering, strong Python...SeniorFull timeRemote work
$158.4k - $237.6k
...Qualcomm Technologies, Inc. Job Area Engineering Group, Machine Learning Engineering General Summary Qualcomm AI Research looking for talented ML/AI algorithm evaluation engineers with experience in... ...to enable embedded GenAI and Agentic AI. Come join a high-caliber team...SuggestedWork experience placementWork from home- ...Title: Senior AI Native Software Engineer (Agentic AI) Location: Remote with up to 20% travel required (travel will be reimbursed) Job Description... ...invocation Policy-based routing Build evaluation frameworks for accuracy, latency, and reliability...SeniorRemote work
- ...Senior Agentic AI Engineer Rayda is an asset lifecycle management company, enabling businesses to acquire, manage, track and dispose of their... ...platforms, and third-party tools. Implement agent evaluation, monitoring, observability, and guardrail mechanisms to ensure...SeniorFull timeWork experience placementRemote work
$300k - $400k
...Senior Agentic AI Engineer $300,000 - $400,000 Remote / Hybrid, Palo Alto Full time / Permanent A well-known, frontier GenAI... ...execution, scheduling, and goal-directed workflows Create evaluation frameworks and metrics that measure agent quality,...SeniorPermanent employmentFull timeRemote workShift work- ...applications and next steps. Our partner is looking for a Senior AI Engineer, Agentic Systems based in United States. This is a high-impact engineering... ...such as planners, tool registries, and policy guards Evaluate and apply agentic frameworks such as LangGraph, LangChain...SeniorRemote jobFull timeFlexible hours
- ...experts can help you find the best job for you. Role: Senior Agentic AI Engineer (Palo Alto Networks Ecosystem) Location: Palo Alto, CA... ...a final response. Agentic Validation: Build "Evaluation LLM-as-a-Judge" frameworks to measure agent accuracy, latency...SeniorPermanent employmentContract workRemote work
- ...Revolutionize the Future of Logistics as Our Next Senior Agentic AI Engineer Are you ready to redefine how global freight moves by building... ...embedding pipelines, custom chunkers, and advanced retrieval evaluators. Lead the integration of intelligent agent services...SeniorLocal areaRemote workWorldwide
- ...Senior Agentic AI Engineer Build and Scale the AI Layer of the Business Rebel Athletic is seeking a highly experienced Agentic AI Engineer... ..., APIs, and event-driven systems Define and implement evaluation frameworks, guardrails, and monitoring for AI systems...Senior
- ...Senior AI Engineer LLM & Agentic Systems (Python) Remote Role Overview We are seeking a senior AI engineer to design and build production... ..., backend services, and cloud platforms Establish evaluation, reliability, and performance strategies (accuracy,...SeniorRemote work
- ...Senior AI Engineer, Agentic Interactions Pittsburgh, Pennsylvania, United States Company Description Govini transforms Defense Acquisition... ...OpenAI Agent SDK Experience building observability, evaluation, and feedback loops for agent behavior (telemetry, prompt...SeniorFull timeWork at office
$150k - $180k
...ARE Zeta Global (NYSE: ZETA) is the AI-Powered Marketing Cloud that leverages... ...Role Snapshot We’re hiring a hands-on Agentic AI engineer to build bidder-adjacent agents that improve... ...-loop ACE (Observe → Reason → Act → Evaluate). This is not a classic “train a model...Senior- Teradata Corporation (SE) is seeking an AI Engineer in Phoenix, Arizona, to build agentic systems for AI-driven analytics. Candidates should have 3-6 years... ...designing agent skills, improving workflows, and evaluating agent performance. Proficiency in Python and understanding...SeniorFlexible hours
- ...Worth AI is hiring a Senior Agentic AI Engineer to design and ship production agent systems that automate KYB, underwriting, and risk decisions on... ...Real eval experience golden sets, offline and online evaluations, used to make ship/no-ship calls. ~ Production MLOps...SeniorRemote workFlexible hours
- ...Senior AI Agentic Engineer The Senior AI Agentic Engineer designs, builds, and operationalizes intelligent agent systems that automate complex... ...to improve grounding and reduce hallucination. Evaluate and continuously tune RAG systems for accuracy, latency, factual...SeniorWork experience placement
- ...California where you will develop AI-driven analytics systems as part of... ...role requires 3-6 years of software engineering experience, specifically in building agentic AI systems in production. You will design, implement, and evaluate agent skills, while collaborating with...SeniorFlexible hours
$180k - $240k
...About the role We are looking for a Senior Agentic AI Engineer to join our team. You will be at the forefront of designing, developing,... ...platforms, internal APIs, and legal workflow systems. Evaluate third-party models, frameworks, and services to determine...SeniorFull timeContract workWork at officeRemote workWorldwideFlexible hours$123k - $215.25k
...Senior AI Engineer I - Agentic AI New York, NY, United States Charlotte, NC, United States Palo Alto, CA, United States Phoenix, AZ,... ...including LLM services, agent orchestration frameworks, and evaluation and monitoring tooling, that scales agentic development...SeniorFull timeWork at officeLocal areaRemote workVisa sponsorshipFlexible hoursShift work- Teradata Corporation (SE) is looking for an experienced AI Engineer to build intelligent systems that enhance data analytics. You will design, implement, and evaluate agentic AI solutions aiming for superior performance. The ideal candidate will have robust software engineering...SeniorFlexible hours
- ...Hartford, CT, is seeking a software engineer with 3-6 years of experience to develop AI-driven analytics systems. The... ...designing agent skills, building evaluation frameworks, and ensuring the performance... ...in Python and experience in agentic AI systems. Teradata promotes a...SeniorFlexible hours
- ...seeking an experienced candidate to build agentic systems that enhance AI-driven analytics. The role involves designing agent skills and evaluating system performance to solve customer... ...have hands-on experience in software engineering with a strong foundation in Python, a...SeniorFlexible hours
- ...Title: Senior AI Agentic Engineer Location: Spring, TX 77389 (hybrid: 3 days onsite / 2 days remote) Duration: Direct Hire Work Requirements... ...to improve grounding and reduce hallucination. Evaluate and continuously tune RAG systems for accuracy, latency, factual...SeniorWork experience placementLocal areaRemote workFlexible hours
- ...Senior AI Agentic Engineer ***This position is 5 days a week in the Charlotte Office***. The Senior AI Agentic Engineer is a senior production... ...context management, retrieval-augmented generation (RAG), evaluation, observability, and governed deployment. The engineer...SeniorPermanent employmentFull timePart timeH1bWork at officeWork visa
- ...mighty team works with senior leaders and partners across... ...strategic and rigorous engineering leader who is... ...innovation and approaches AI agent design as an engineering... ...for the Problem: Evaluate business problems and... ...workflows, ensuring that agentic hallucinations are...SeniorTemporary workLocal area
- ...their hybrid cloud and AI journeys. With support... ...are seeking a skilled AI Engineer to join an active... ...facing capacity. This is a senior individual contributor... ...technical proficiency in Agentic AI — you will be expected... ...Establish monitoring, evaluation, and feedback loops to...SeniorWorldwide
$229.9k - $262.4k
...Senior Lead AI Engineer(MLX, Agentic AI, Gen AI platform Services) Overview: At Capital One, we are creating responsible and reliable AI systems,... ...model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc. Leverage...SeniorFull timePart timeLocal area- ...Role: Senior AI Engineer - Agentic Systems and LLM Client Location: Mason, OH 100% Remote Job Description: We are seeking a senior AI engineer... ...APIs, backend services, and cloud platforms Establish evaluation, reliability, and performance strategies (accuracy,...SeniorRemote work
- ...their full potential. About the Role We are seeking a Senior Agentic AI Software Engineer to design, build, and operate production-grade AI agent... ...and integration with existing services, to reliability, evaluation, and long-term operation. You’ll work closely with...SeniorFull timeWork at officeWork from home
- ...landscapes. You'll develop the models and agentic architectures that power demand... ...largest retailers. This is applied AI at its most impactful. You'll work at... ...: data preprocessing, feature engineering, model training, evaluation, and production deployment Architect...SeniorImmediate startHome officeVisa sponsorshipFlexible hours3 days per week
- Senior Agentic AI Software Engineer - Hybrid US Job ID: 497243 Posted since: 04-Mar-2026 Organization: Siemens Industry Software Inc. Job type:... ...and integration with existing services, to reliability, evaluation, and long-term operation. You’ll work closely with...SeniorPermanent employmentFull timeWork at officeLocal areaRemote workWork from home
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior AI Engineer, Agentic Evaluation & V&V. Be the first to apply!
- senior ai engineer United States
- ai ml engineer United States
- ai engineer remote United States
- ai engineer United States
- ai prompt engineer United States
- ai developer United States
- ai research engineer United States
- machine learning ai engineer United States
- senior hr assistant United States
- senior cost analyst United States


