Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior AI Engineer Inference & Agent Systems

Arcana Analytics

Senior AI Engineer — Inference & Agent Systems

Title: Applied AI Engineer — Inference & Agent Systems

Location: United States

Arcana is building AI agents that synthesize information across heterogeneous sources and deliver structured, reasoned answers in real time. The product only works if the agents are fast, reliable, and correct, not approximately correct.

Our stack: Go + Temporal for orchestration, a Plan-Execute-Synthesize agent architecture, and an evaluation harness we use to measure every regression. The problems are hard. The latency bar is aggressive. The accuracy requirements are unforgiving.

The Work

Inference Optimization

  • Drive TTFT below 400ms for multi-step agent pipelines
  • Streaming optimization: first token to user while sub-agents are still running
  • KV cache strategy, prompt compression, dynamic context window management
  • Multi-provider routing: model selection by latency, cost, and task type across OpenAI, Anthropic, Gemini, and open-weight models

Agent Architecture

  • Design and implement Plan-Execute-Synthesize pipelines that run sub-agents in parallel DAGs, not sequential chains
  • Build reliable orchestration on top of Temporal: retries, timeouts, partial failure recovery, idempotency
  • Structured output enforcement: JSON schema validation, retry loops on malformed LLM output, graceful degradation
  • Tool call design: schema design that LLMs actually follow reliably across providers

Evaluation & Harness

  • Own the eval framework end to end: ground truth datasets, automated scoring pipelines, regression detection on every PR
  • LLM-as-judge pipelines for qualitative output assessment
  • Latency regression testing - p50/p95/p99 tracked across every deployment
  • Adversarial test case design: ambiguous queries, missing data, conflicting sources, malformed tool responses

Infrastructure

  • Model serving and cold start optimization
  • Async worker architecture for parallel sub-agent execution
  • Observability: trace every token, every tool call, every synthesis step

What We're Looking For

You've built something that runs in production at a meaningful scale and you understand why it's fast (or why it isn't).

Strong signal:

  • You've worked on inference pipelines where TTFT was the primary metric and you moved it meaningfully
  • You've built multi-step agent systems and you know where they break not from reading papers but from watching them fail in production
  • You've written eval harnesses from scratch and you have opinions about what makes a ground truth dataset actually useful
  • You've debugged LLM non-determinism in production and built systems resilient to it
  • You've worked with streaming LLM responses and built infrastructure around partial output handling

Weaker signal (but not disqualifying):

  • You've fine-tuned models but haven't shipped inference systems
  • You've used LangChain/LlamaIndex but haven't built the layer underneath
  • Strong ML research background without systems exposure

Stack familiarity (we care more about depth than match): Go, Python, Temporal, Kafka, PostgreSQL, Docker

Why This Role

The problems here don't have blog posts about them yet. Parallel agent DAG execution under hard latency budgets, streaming synthesis across partial sub-agent results, eval harnesses for non-deterministic multi-step systems: these are genuinely unsolved at production quality. Small team. High ownership. Every engineer's decisions ship to production.

Who We Want to Hear From

  • You've shipped inference systems at:
    • A real-time AI product (search, coding assistant, chat at scale)
    • A model serving infrastructure company
    • An agent platform (any domain)
  • Or you've built eval/harness infrastructure that a team of 10+ engineers actually trusted to catch regressions.

Apply

Send to: View email address on click.appcast.io

Include:

  1. One system you built where latency was the primary constraint what you measured, what you changed, what moved
  2. Link to anything public (code, writing, talks)
  3. No cover letter required

We respond to every application.

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Senior AI Engineer Inference & Agent Systems in Washington DC vacancy
  • $96.8k - $306.4k

     ...Job Description The Senior Principal AI Agent / ML Software Engineer is a Senior Staff-level, hands-on technical...  ...and operating next-generation AI systems on Oracle Cloud Infrastructure (OCI...  ..., autonomous workflows, scalable inference infrastructure, and enterprise AI... 
    Senior
    Temporary work
    Flexible hours

    Oracle

    Washington DC
    5 days ago
  •  ...technology company in Washington, D.C. is seeking a senior software engineer. This role focuses on utilizing AI-assisted coding practices to deliver features at...  ...have strong experience with LLMs and distributed systems development in Scala, as well as the ability to architect... 
    Senior

    Hopper

    Washington DC
    3 days ago
  • $99.6k - $223.4k

     ...Job Description Oracle Health is seeking a Senior AI Agent Engineer to build production AI agents and workflow automation capabilities that...  ...implement AI-driven workflows that integrate with enterprise systems and support use cases such as SQL generation, pipeline... 
    Senior
    Temporary work
    Flexible hours

    Oracle

    Washington DC
    7 days ago
  • $99k - $225k

    Booz Allen Hamilton is seeking an Agentic AI Engineer to join their team in Washington, DC. The role focuses on building autonomous AI systems that actively engage in multi-agent orchestrations, utilizing advanced technologies such as agent orchestration frameworks and... 
    Suggested

    Booz Allen Hamilton

    Washington DC
    19 hours ago
  • Ocrolus is seeking a hands-on Applied AI Specialist to drive AI solutions that enhance internal processes. The ideal candidate will...  ...Applied AI and ML and a proven track record of building scalable AI systems. Responsibilities include identifying AI opportunities,... 
    Senior

    Ocrolus

    Washington DC
    3 days ago
  •  ...seeking a Sr. Security Software Engineer for the Starshield project in Washington...  ...This role focuses on utilizing AI for security automation and integration within systems. Candidates should have...  ...systems, build security-critical agents, and assist in fixing security vulnerabilities... 
    Senior

    Latent AI

    Washington DC
    3 days ago
  • $130k - $150k

    BLEN Corp is seeking an AI Engineer in Washington, DC to design and build AI systems for federal and commercial clients. The role involves developing agentic systems, creating LLM-powered applications, and working closely with stakeholders. Ideal candidates should have... 
    Senior
    Work from home

    BLEN Corp

    Washington DC
    3 days ago
  • A technology solutions provider in Virginia is seeking a Senior AI Integration Engineer to manage the integration of AI solutions and oversee system engineering activities. The role requires a top-secret security clearance and a Bachelor's degree in a related field, alongside... 
    Senior
    Full time

    Leidos

    Alexandria, VA
    4 days ago
  • $86.8k - $198k

    Booz Allen Hamilton is seeking a Senior Software Development Engineer to design and build agentic workflows and LLM integrations for an innovative...  ...platforms. You will collaborate with various teams, support AI systems, and validate LLM quality under production conditions,... 
    Senior

    Booz Allen Hamilton

    Washington DC
    2 days ago
  • $124k - $280k

     ...Competency: Data, Analytics & AI Industry/Sector: Health...  ...people in data and analytics engineering focus on leveraging advanced technologies...  ...algorithms, models, and systems to enable intelligent decision...  ...for health systems. As a Senior Manager, you will serve as a strategic... 
    Senior
    Full time
    H1b

    PwC

    Washington DC
    5 days ago
  • $124k - $280k

     ...Competency: Data, Analytics & AI Industry/Sector: Health...  ...people in data and analytics engineering focus on leveraging advanced technologies...  ...algorithms, models, and systems to enable intelligent decision...  ...system and health plans. As a Senior Manager, you will drive use... 
    Senior
    Full time
    H1b

    PwC

    Washington DC
    2 days ago
  •  ...Job Description Job Description Senior AI Engineer Hybrid - Washington D.C. (preferred)...  ...analyses, including back-testing, rejection inference, and performance analyses using...  ...operationalize models into production systems, ensuring scalability and resilience at... 
    Senior
    Flexible hours

    VantageScore

    Washington DC
    18 days ago
  • $99k - $225k

    Phase2 Technology in McLean, Virginia, is searching for an Agentic AI Engineer to develop proactive AI systems. You will utilize expertise in agent orchestration frameworks and LLM fine-tuning. The role requires 5+ years of software development experience and deep knowledge... 

    Phase2 Technology

    Mc Lean, VA
    3 days ago
  •  ...Senior AI/ML Engineer Elevate your career with MANTECH International Corporation! Join a dynamic...  ...model serving platforms for efficient inference Implement automated machine...  ...Troubleshoot complex issues in AI/ML systems Minimum Qualifications: ~ Bachelor... 
    Senior
    Temporary work
    Local area

    ManTech

    Arlington, VA
    3 days ago
  • $335k

     ...changing how military systems are designed, built and...  ...powered by Lattice OS, an AI-powered operating...  ...seeking a Director of AI Engineering & Research to build and...  ...quantization, on‑device inference, and runtime safety. Set...  ...training, and deployment of agents, VLA models, and... 
    Full time
    Work experience placement

    Neura Market

    Washington DC
    4 days ago
  • A leading insurance provider is seeking a Senior Staff Engineer in Chevy Chase, MD to drive AI-powered capabilities and lead engineering teams. The ideal candidate will have over 8 years of experience, particularly in Generative AI, and a strong programming background... 
    Senior

    GEICO

    Chevy Chase, MD
    4 days ago
  •  ...A technology solutions provider is seeking a Software Engineer Level 4 to develop and maintain diverse software systems. This role will focus on backend development using Python and FastAPI, along with managing PostgreSQL databases and high-performance API integrations... 
    Senior

    RPMGlobal

    Washington DC
    3 days ago
  • $220k - $350k

     ...SPACE EXPLORATION TECHNOLOGIES CORP is seeking a Sr. AI Engineer for special programs. This role focuses on engineering AI capabilities...  ...include optimizing integrations between AI models and government systems, collaborating on developer tools, and shipping production-... 
    Senior

    SPACE EXPLORATION TECHNOLOGIES CORP

    Washington DC
    13 hours ago
  •  ...Quality Support, Inc. is looking for a Senior Software Engineer in Alexandria, Virginia. Candidates must have 25-30 years of experience with strong...  ...involves developing software solutions, conducting systems analysis, and supporting a team of engineers. Quality Support... 
    Senior

    Quality Support

    Alexandria, VA
    4 days ago
  •  ...Comcast is hiring a Senior Software Engineer to lead technical direction in AI Agent initiatives. You will focus on building scalable backend systems that leverage modern AI techniques. Responsibilities include architectural leadership, mentorship of developers, and ensuring... 
    Senior

    Blueface

    Washington DC
    3 days ago
  • $148.9k - $223.4k

     ...KBR is looking for a Principal System Engineer/Enterprise Architect in Chevy Chase, MD. This role involves designing IT enterprise architecture, working closely with cross-functional teams, and contributing to national security solutions. The ideal candidate will have... 
    Senior

    Carlsbad Tech

    Chevy Chase, MD
    4 days ago
  • $100k - $160k

     ...Majus Consulting is looking for a highly skilled Senior Software Engineer to join our Washington, DC team. In this full-time position, you'll...  ...solutions, lead technical initiatives, and contribute to scalable systems supporting diverse mission needs. The role demands strong... 
    Senior
    Full time

    MAJUS Consulting

    Washington DC
    3 days ago
  •  ...Senior AI Engineer As a Senior AI Engineer you build and improve the AI agent systems that power personalized genetics based health guidance. You work side by side with our AI Backend Architect to move from simple large language model calls to a true multi agent brain... 
    Senior
    Remote work

    XRC Ventures

    Washington DC
    3 days ago
  •  ...Quality Support, Inc. has openings for Senior Software/Computer/AI Engineers with 25 – 30 years’ experience with a strong background of submarine construction...  ...information needs, conferring with users, studying systems flow, data usage, and work processes; investigating... 
    Senior

    Quality Support

    Alexandria, VA
    3 days ago
  •  ...A global consulting firm is seeking a Senior AI Native Engineer to deliver innovative AI solutions. The role requires a Bachelor's degree and 3...  ...You will enhance data pipelines and implement scalable AI systems that meet diverse business requirements, collaborating within... 
    Senior
    Flexible hours

    Ernst & Young Oman

    Washington DC
    3 days ago
  • $137k - $200.2k

    MAXAR TECHNOLOGIES, INC. is looking for an AI/ML Engineer in McLean, Virginia. The role involves developing AI applications, maintaining RAG pipelines, and integrating LLMs. Candidates must have at least 8 years of experience, a bachelor's degree in computer science, and... 
    Senior

    MAXAR TECHNOLOGIES, INC.

    Mc Lean, VA
    3 days ago
  •  ...Senior AI Agentic Engineer The Senior AI Agentic Engineer designs, builds, and operationalizes intelligent agent systems that automate complex enterprise business processes end-to-end. This role works at the intersection of LLMs, systems engineering, and applied machine... 
    Senior
    Work experience placement

    American Bureau of Shipping

    Washington DC
    3 days ago
  • $160k - $205k

    A leading AI solutions company in Washington, DC, is seeking an experienced engineer to design and build multi-agent systems for complex workflows. The ideal candidate will have strong skills in Python, AWS services, and experience in deploying LLM technologies. This role... 
    Flexible hours

    Valid8 Financial, Inc.

    Washington DC
    1 day ago
  • $140k - $190k

    Steampunk is looking for a highly skilled AI Developer to design, build, and optimize advanced AI solutions. The role covers the...  ...ingestion to application integration. The ideal candidate has a strong engineering background, proficiency in Python, and experience with LLM... 
    Senior

    Steampunk

    Mc Lean, VA
    2 days ago
  • OATS is seeking an Engineer I, AI Agents to support team activities related to design and implementation. The role involves collaborating on technical solutions and maintaining systems while using modern programming techniques. The ideal candidate should have a degree in... 
    Remote job

    OATS

    Washington DC
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior AI Engineer Inference & Agent Systems. Be the first to apply!