Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior AI Engineer Inference & Agent Systems

Arcana Analytics

Senior AI Engineer — Inference & Agent Systems

Title: Applied AI Engineer — Inference & Agent Systems

Location: United States

Arcana is building AI agents that synthesize information across heterogeneous sources and deliver structured, reasoned answers in real time. The product only works if the agents are fast, reliable, and correct, not approximately correct.

Our stack: Go + Temporal for orchestration, a Plan-Execute-Synthesize agent architecture, and an evaluation harness we use to measure every regression. The problems are hard. The latency bar is aggressive. The accuracy requirements are unforgiving.

The Work

Inference Optimization

  • Drive TTFT below 400ms for multi-step agent pipelines
  • Streaming optimization: first token to user while sub-agents are still running
  • KV cache strategy, prompt compression, dynamic context window management
  • Multi-provider routing: model selection by latency, cost, and task type across OpenAI, Anthropic, Gemini, and open-weight models

Agent Architecture

  • Design and implement Plan-Execute-Synthesize pipelines that run sub-agents in parallel DAGs, not sequential chains
  • Build reliable orchestration on top of Temporal: retries, timeouts, partial failure recovery, idempotency
  • Structured output enforcement: JSON schema validation, retry loops on malformed LLM output, graceful degradation
  • Tool call design: schema design that LLMs actually follow reliably across providers

Evaluation & Harness

  • Own the eval framework end to end: ground truth datasets, automated scoring pipelines, regression detection on every PR
  • LLM-as-judge pipelines for qualitative output assessment
  • Latency regression testing - p50/p95/p99 tracked across every deployment
  • Adversarial test case design: ambiguous queries, missing data, conflicting sources, malformed tool responses

Infrastructure

  • Model serving and cold start optimization
  • Async worker architecture for parallel sub-agent execution
  • Observability: trace every token, every tool call, every synthesis step

What We're Looking For

You've built something that runs in production at a meaningful scale and you understand why it's fast (or why it isn't).

Strong signal:

  • You've worked on inference pipelines where TTFT was the primary metric and you moved it meaningfully
  • You've built multi-step agent systems and you know where they break not from reading papers but from watching them fail in production
  • You've written eval harnesses from scratch and you have opinions about what makes a ground truth dataset actually useful
  • You've debugged LLM non-determinism in production and built systems resilient to it
  • You've worked with streaming LLM responses and built infrastructure around partial output handling

Weaker signal (but not disqualifying):

  • You've fine-tuned models but haven't shipped inference systems
  • You've used LangChain/LlamaIndex but haven't built the layer underneath
  • Strong ML research background without systems exposure

Stack familiarity (we care more about depth than match): Go, Python, Temporal, Kafka, PostgreSQL, Docker

Why This Role

The problems here don't have blog posts about them yet. Parallel agent DAG execution under hard latency budgets, streaming synthesis across partial sub-agent results, eval harnesses for non-deterministic multi-step systems: these are genuinely unsolved at production quality. Small team. High ownership. Every engineer's decisions ship to production.

Who We Want to Hear From

  • You've shipped inference systems at:
    • A real-time AI product (search, coding assistant, chat at scale)
    • A model serving infrastructure company
    • An agent platform (any domain)
  • Or you've built eval/harness infrastructure that a team of 10+ engineers actually trusted to catch regressions.

Apply

Send to: View email address on click.appcast.io

Include:

  1. One system you built where latency was the primary constraint what you measured, what you changed, what moved
  2. Link to anything public (code, writing, talks)
  3. No cover letter required

We respond to every application.

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Senior AI Engineer Inference & Agent Systems in Washington DC vacancy
  • $96.8k - $251.6k

     ...Job Description The Senior Principal AI Agent / ML Software Engineer is a Senior Staff-level, hands-on technical...  ...and operating next-generation AI systems on Oracle Cloud Infrastructure (...  ..., autonomous workflows, scalable inference infrastructure, and enterprise AI... 
    Senior
    Temporary work
    Flexible hours

    Oracle

    Washington DC
    2 days ago
  • $129k - $193.5k

    Recorded Future in Washington, DC is looking for a Senior Software Engineer to drive the development of scalable software systems focused on AI and Large Language Models. The ideal candidate will have expertise in Python, distributed systems, and modern cloud platforms... 
    Senior

    TryApplyNow

    Washington DC
    2 days ago
  •  ...technology company in Washington, D.C. is seeking a senior software engineer. This role focuses on utilizing AI-assisted coding practices to deliver features at...  ...have strong experience with LLMs and distributed systems development in Scala, as well as the ability to architect... 
    Senior

    Hopper

    Washington DC
    2 days ago
  • $99k - $225k

    Booz Allen Hamilton is seeking an Agentic AI Engineer to join their team in Washington, DC. The role focuses on building autonomous AI systems that actively engage in multi-agent orchestrations, utilizing advanced technologies such as agent orchestration frameworks and... 
    Suggested

    Booz Allen Hamilton

    Washington DC
    12 hours ago
  • $99k - $225k

    A leading AI-focused technology firm in Maryland is seeking an experienced Agentic AI Engineer. The role involves designing intelligent agent architectures, developing multi-agent systems, and enhancing reasoning accuracy through advanced RAG pipelines. Candidates should... 
    Suggested

    Phase2 Technology

    Bethesda, MD
    3 days ago
  • Ocrolus is seeking a hands-on Applied AI Specialist to drive AI solutions that enhance internal processes. The ideal candidate will...  ...Applied AI and ML and a proven track record of building scalable AI systems. Responsibilities include identifying AI opportunities,... 
    Senior

    Ocrolus

    Washington DC
    3 days ago
  • BLACKROCK STRATEGY INC is seeking a Senior Systems Engineer to integrate AI-enabled capabilities into defense and national security operations. This role requires a Bachelor’s degree in Engineering or related field and at least 5 years of experience in systems engineering... 
    Senior

    BLACKROCK STRATEGY INC

    Washington DC
    12 hours ago
  • $130k - $150k

    BLEN Corp is seeking an AI Engineer in Washington, DC to design and build AI systems for federal and commercial clients. The role involves developing agentic systems, creating LLM-powered applications, and working closely with stakeholders. Ideal candidates should have... 
    Senior
    Work from home

    BLEN Corp

    Washington DC
    3 days ago
  •  ...Senior AI Systems Quality Engineer This role sits at the intersection of AI engineering, platform quality, and production reliability for advanced...  ...automated testing and evaluation frameworks for LLM-driven and agent-based architectures. You will work closely with AI... 
    Senior
    Remote work
    Home office

    Jobgether

    Washington DC
    2 days ago
  • $130k - $150k

    BLEN is looking for an AI Engineer in Washington, DC. The role involves designing and building agentic systems for federal and commercial clients, focusing on large language model applications. Candidates should have 5+ years of software engineering experience, hands-on... 
    Senior

    BLEN

    Washington DC
    4 days ago
  • Radley James is seeking an AI Systems Engineer in Washington, DC to work on advanced AI systems for high-security environments. The role entails designing multi-agent architectures and refining reasoning and decision-making systems. Candidates should have 1-6 years of relevant... 
    Relocation package

    Radley James

    Washington DC
    2 days ago
  • $130k - $260k

    Geico is seeking a Senior Staff Engineer in Bethesda, Maryland. This role focuses on creating AI applications that enhance customer self-service across various communication...  ...a strong background in developing scalable systems. An attractive salary range of $130,000 to $26... 
    Senior

    Geico

    Bethesda, MD
    1 day ago
  • $99.8k - $135.3k

     ...Electrical/Critical Systems Commissioning Agent WSP USA is initiating a search for a Full-time Electrical...  ...Virginia. Job Description Our engineers work on high quality, high-profile,...  ...growth opportunities: Many of our senior leaders started out as young engineers... 
    Senior
    Full time
    For subcontractor
    Work at office

    WSP

    Washington DC
    1 day ago
  • $99.8k - $135.3k

     ...for a Full-time Electrical/ Critical Systems’ Commissioning Agent for our New York office. Other...  ...Virginia. Job Description Our engineers work on high quality, high-profile, national...  ...growth opportunities: Many of our senior leaders started out as young engineers... 
    Senior
    Full time
    For subcontractor
    Work at office
    Local area
    Flexible hours

    WSP

    Arlington, VA
    28 days ago
  •  ...Job Description Job Description Senior AI Engineer Hybrid - Washington D.C. (preferred)...  ...analyses, including back-testing, rejection inference, and performance analyses using...  ...operationalize models into production systems, ensuring scalability and resilience at... 
    Senior
    Flexible hours

    VantageScore

    Washington DC
    28 days ago
  • Capital One National Association is looking for a Lead AI Engineer to develop and deploy innovative AI solutions, enhancing customer interactions...  ...business operations. The role emphasizes building scalable AI systems and requires solid programming skills, preferably in Python.... 
    Senior

    Capital One National Association

    Mc Lean, VA
    12 hours ago
  • $99k - $225k

    Phase2 Technology in McLean, Virginia, is searching for an Agentic AI Engineer to develop proactive AI systems. You will utilize expertise in agent orchestration frameworks and LLM fine-tuning. The role requires 5+ years of software development experience and deep knowledge... 

    Phase2 Technology

    Mc Lean, VA
    3 days ago
  • $103.2k - $203.4k

     ...government forward! Build AI that matters . We ship...  ...workflows and RAG systems tailored to mission data...  ...predictable cost. Agent frameworks & orchestration...  ...AI services or on prem inference stacks Background in...  ...; mentorship of engineers. Clear communication... 
    Live in
    Work at office
    Local area

    Accenture

    Washington DC
    12 hours ago
  • $107.9k - $195.05k

     ...is seeking an experienced Senior AI/ML Engineer to support the delivery, enhancement...  ...training, validation, and inference. Train and tune...  ...and integration. Analyze system performance metrics and recommend...  ...–reflection loops, multi-agent collaboration and coordination... 
    Senior
    Local area
    Immediate start

    Leidos

    Alexandria, VA
    5 days ago
  •  ...Title: Senior Embodied AI Engineer About Us: UnitX builds the world's leading physical AI systems to automate repetitive visual tasks in factories. UnitX is a fast-moving...  ...and optimizing ML models for real-time inference on robotic hardware (e.g., NVIDIA Jetson... 
    Senior
    Full time

    Unitx

    Washington DC
    12 hours ago
  •  ...harnessing the latest AI innovations. Building...  ...collaborations with health systems across the U.S., we...  ...We’re hiring a Staff AI Engineer to lead the design and...  ...evolution of Sage Care’s AI agents, with a focus on voice...  ...model providers or inference stacks What Success... 
    Full time

    Sage Care

    Washington DC
    12 hours ago
  • $77k - $202k

     ...clients through innovative, AI-driven solutions. As a Senior Associate, you will...  ..., test, and deploy AI/ML systems and applications for cybersecurity...  ...for model training and inference on cloud platforms like...  ...development or AI/ML engineering What Sets You Apart... 
    Senior
    Full time
    H1b

    PwC

    Washington DC
    3 days ago
  • A technology solutions provider is seeking a Software Engineer Level 4 to develop and maintain diverse software systems. This role will focus on backend development using Python and FastAPI, along with managing PostgreSQL databases and high-performance API integrations.... 
    Senior

    RPMGlobal

    Washington DC
    4 days ago
  • $140k - $190k

    Steampunk is looking for a highly skilled AI Developer to design, build, and optimize advanced AI solutions. The role covers the...  ...ingestion to application integration. The ideal candidate has a strong engineering background, proficiency in Python, and experience with LLM... 
    Senior

    Steampunk

    Mc Lean, VA
    2 days ago
  •  ...the Federal government, from senior level policy makers to...  ...Position Overview The Senior AI Architect/Engineer will lead the design,...  ...access, RAG-enabled knowledge systems, and governed AI APIs available...  ...controlled environments for coding agents, containers, APIs, web... 
    Senior

    Technomics

    Arlington, VA
    3 days ago
  •  ...Senior AI Security Software Engineer The CERT Division of the Software Engineering Institute (SEI) is seeking applicants for the role of Senior AI...  ...cybersecurity research, advancing the resilience of software systems and responding to sophisticated cyber threats. As AI... 
    Senior
    Full time
    Part time
    Relocation package
    Flexible hours

    Software Engineering Institute

    Arlington, VA
    1 day ago
  •  ...A leading technology consulting firm in Virginia is seeking an experienced AI Native Engineer to drive the design and deployment of cloud-native systems and agentic workflows. You will work directly with enterprise clients to create robust solutions that scale across various... 
    Senior

    Accenture

    Arlington, VA
    4 days ago
  • $77k - $202k

     ...Competency: Data, Analytics & AI Industry/Sector: Not...  ...people in data and analytics engineering focus on leveraging advanced technologies...  ...algorithms, models, and systems to enable intelligent decision...  ...that meet business needs. As a Senior Associate, you analyze complex... 
    Senior
    Full time
    H1b

    PwC

    Washington DC
    3 days ago
  • $160k - $205k

    A leading AI solutions company in Washington, DC, is seeking an experienced engineer to design and build multi-agent systems for complex workflows. The ideal candidate will have strong skills in Python, AWS services, and experience in deploying LLM technologies. This role... 
    Flexible hours

    Valid8 Financial, Inc.

    Washington DC
    1 day ago
  • $148.9k - $223.4k

    KBR is looking for a Principal System Engineer/Enterprise Architect in Chevy Chase, MD. This role involves designing IT enterprise architecture, working closely with cross-functional teams, and contributing to national security solutions. The ideal candidate will have extensive... 
    Senior

    Carlsbad Tech

    Chevy Chase, MD
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior AI Engineer Inference & Agent Systems. Be the first to apply!