Senior AI Engineer Inference & Agent Systems
Arcana Analytics
Senior AI Engineer — Inference & Agent Systems
Title: Applied AI Engineer — Inference & Agent Systems
Location: United States
Arcana is building AI agents that synthesize information across heterogeneous sources and deliver structured, reasoned answers in real time. The product only works if the agents are fast, reliable, and correct, not approximately correct.
Our stack: Go + Temporal for orchestration, a Plan-Execute-Synthesize agent architecture, and an evaluation harness we use to measure every regression. The problems are hard. The latency bar is aggressive. The accuracy requirements are unforgiving.
The Work
Inference Optimization
- Drive TTFT below 400ms for multi-step agent pipelines
- Streaming optimization: first token to user while sub-agents are still running
- KV cache strategy, prompt compression, dynamic context window management
- Multi-provider routing: model selection by latency, cost, and task type across OpenAI, Anthropic, Gemini, and open-weight models
Agent Architecture
- Design and implement Plan-Execute-Synthesize pipelines that run sub-agents in parallel DAGs, not sequential chains
- Build reliable orchestration on top of Temporal: retries, timeouts, partial failure recovery, idempotency
- Structured output enforcement: JSON schema validation, retry loops on malformed LLM output, graceful degradation
- Tool call design: schema design that LLMs actually follow reliably across providers
Evaluation & Harness
- Own the eval framework end to end: ground truth datasets, automated scoring pipelines, regression detection on every PR
- LLM-as-judge pipelines for qualitative output assessment
- Latency regression testing - p50/p95/p99 tracked across every deployment
- Adversarial test case design: ambiguous queries, missing data, conflicting sources, malformed tool responses
Infrastructure
- Model serving and cold start optimization
- Async worker architecture for parallel sub-agent execution
- Observability: trace every token, every tool call, every synthesis step
What We're Looking For
You've built something that runs in production at a meaningful scale and you understand why it's fast (or why it isn't).
Strong signal:
- You've worked on inference pipelines where TTFT was the primary metric and you moved it meaningfully
- You've built multi-step agent systems and you know where they break not from reading papers but from watching them fail in production
- You've written eval harnesses from scratch and you have opinions about what makes a ground truth dataset actually useful
- You've debugged LLM non-determinism in production and built systems resilient to it
- You've worked with streaming LLM responses and built infrastructure around partial output handling
Weaker signal (but not disqualifying):
- You've fine-tuned models but haven't shipped inference systems
- You've used LangChain/LlamaIndex but haven't built the layer underneath
- Strong ML research background without systems exposure
Stack familiarity (we care more about depth than match): Go, Python, Temporal, Kafka, PostgreSQL, Docker
Why This Role
The problems here don't have blog posts about them yet. Parallel agent DAG execution under hard latency budgets, streaming synthesis across partial sub-agent results, eval harnesses for non-deterministic multi-step systems: these are genuinely unsolved at production quality. Small team. High ownership. Every engineer's decisions ship to production.
Who We Want to Hear From
- You've shipped inference systems at:
- A real-time AI product (search, coding assistant, chat at scale)
- A model serving infrastructure company
- An agent platform (any domain)
- Or you've built eval/harness infrastructure that a team of 10+ engineers actually trusted to catch regressions.
Apply
Send to: View email address on click.appcast.io
Include:
- One system you built where latency was the primary constraint what you measured, what you changed, what moved
- Link to anything public (code, writing, talks)
- No cover letter required
We respond to every application.
$96.8k - $251.6k
...Job Description The Senior Principal AI Agent / ML Software Engineer is a Senior Staff-level, hands-on technical... ...and operating next-generation AI systems on Oracle Cloud Infrastructure (... ..., autonomous workflows, scalable inference infrastructure, and enterprise AI...SeniorTemporary workFlexible hours$129k - $193.5k
Recorded Future in Washington, DC is looking for a Senior Software Engineer to drive the development of scalable software systems focused on AI and Large Language Models. The ideal candidate will have expertise in Python, distributed systems, and modern cloud platforms...Senior- ...technology company in Washington, D.C. is seeking a senior software engineer. This role focuses on utilizing AI-assisted coding practices to deliver features at... ...have strong experience with LLMs and distributed systems development in Scala, as well as the ability to architect...Senior
$99k - $225k
Booz Allen Hamilton is seeking an Agentic AI Engineer to join their team in Washington, DC. The role focuses on building autonomous AI systems that actively engage in multi-agent orchestrations, utilizing advanced technologies such as agent orchestration frameworks and...Suggested$99k - $225k
A leading AI-focused technology firm in Maryland is seeking an experienced Agentic AI Engineer. The role involves designing intelligent agent architectures, developing multi-agent systems, and enhancing reasoning accuracy through advanced RAG pipelines. Candidates should...Suggested- Ocrolus is seeking a hands-on Applied AI Specialist to drive AI solutions that enhance internal processes. The ideal candidate will... ...Applied AI and ML and a proven track record of building scalable AI systems. Responsibilities include identifying AI opportunities,...Senior
- BLACKROCK STRATEGY INC is seeking a Senior Systems Engineer to integrate AI-enabled capabilities into defense and national security operations. This role requires a Bachelor’s degree in Engineering or related field and at least 5 years of experience in systems engineering...Senior
$130k - $150k
BLEN Corp is seeking an AI Engineer in Washington, DC to design and build AI systems for federal and commercial clients. The role involves developing agentic systems, creating LLM-powered applications, and working closely with stakeholders. Ideal candidates should have...SeniorWork from home- ...Senior AI Systems Quality Engineer This role sits at the intersection of AI engineering, platform quality, and production reliability for advanced... ...automated testing and evaluation frameworks for LLM-driven and agent-based architectures. You will work closely with AI...SeniorRemote workHome office
$130k - $150k
BLEN is looking for an AI Engineer in Washington, DC. The role involves designing and building agentic systems for federal and commercial clients, focusing on large language model applications. Candidates should have 5+ years of software engineering experience, hands-on...Senior- Radley James is seeking an AI Systems Engineer in Washington, DC to work on advanced AI systems for high-security environments. The role entails designing multi-agent architectures and refining reasoning and decision-making systems. Candidates should have 1-6 years of relevant...Relocation package
$130k - $260k
Geico is seeking a Senior Staff Engineer in Bethesda, Maryland. This role focuses on creating AI applications that enhance customer self-service across various communication... ...a strong background in developing scalable systems. An attractive salary range of $130,000 to $26...Senior$99.8k - $135.3k
...Electrical/Critical Systems Commissioning Agent WSP USA is initiating a search for a Full-time Electrical... ...Virginia. Job Description Our engineers work on high quality, high-profile,... ...growth opportunities: Many of our senior leaders started out as young engineers...SeniorFull timeFor subcontractorWork at office$99.8k - $135.3k
...for a Full-time Electrical/ Critical Systems’ Commissioning Agent for our New York office. Other... ...Virginia. Job Description Our engineers work on high quality, high-profile, national... ...growth opportunities: Many of our senior leaders started out as young engineers...SeniorFull timeFor subcontractorWork at officeLocal areaFlexible hours- ...Job Description Job Description Senior AI Engineer Hybrid - Washington D.C. (preferred)... ...analyses, including back-testing, rejection inference, and performance analyses using... ...operationalize models into production systems, ensuring scalability and resilience at...SeniorFlexible hours
- Capital One National Association is looking for a Lead AI Engineer to develop and deploy innovative AI solutions, enhancing customer interactions... ...business operations. The role emphasizes building scalable AI systems and requires solid programming skills, preferably in Python....Senior
$99k - $225k
Phase2 Technology in McLean, Virginia, is searching for an Agentic AI Engineer to develop proactive AI systems. You will utilize expertise in agent orchestration frameworks and LLM fine-tuning. The role requires 5+ years of software development experience and deep knowledge...$103.2k - $203.4k
...government forward! Build AI that matters . We ship... ...workflows and RAG systems tailored to mission data... ...predictable cost. Agent frameworks & orchestration... ...AI services or on prem inference stacks Background in... ...; mentorship of engineers. Clear communication...Live inWork at officeLocal area$107.9k - $195.05k
...is seeking an experienced Senior AI/ML Engineer to support the delivery, enhancement... ...training, validation, and inference. Train and tune... ...and integration. Analyze system performance metrics and recommend... ...–reflection loops, multi-agent collaboration and coordination...SeniorLocal areaImmediate start- ...Title: Senior Embodied AI Engineer About Us: UnitX builds the world's leading physical AI systems to automate repetitive visual tasks in factories. UnitX is a fast-moving... ...and optimizing ML models for real-time inference on robotic hardware (e.g., NVIDIA Jetson...SeniorFull time
- ...harnessing the latest AI innovations. Building... ...collaborations with health systems across the U.S., we... ...We’re hiring a Staff AI Engineer to lead the design and... ...evolution of Sage Care’s AI agents, with a focus on voice... ...model providers or inference stacks What Success...Full time
$77k - $202k
...clients through innovative, AI-driven solutions. As a Senior Associate, you will... ..., test, and deploy AI/ML systems and applications for cybersecurity... ...for model training and inference on cloud platforms like... ...development or AI/ML engineering What Sets You Apart...SeniorFull timeH1b- A technology solutions provider is seeking a Software Engineer Level 4 to develop and maintain diverse software systems. This role will focus on backend development using Python and FastAPI, along with managing PostgreSQL databases and high-performance API integrations....Senior
$140k - $190k
Steampunk is looking for a highly skilled AI Developer to design, build, and optimize advanced AI solutions. The role covers the... ...ingestion to application integration. The ideal candidate has a strong engineering background, proficiency in Python, and experience with LLM...Senior- ...the Federal government, from senior level policy makers to... ...Position Overview The Senior AI Architect/Engineer will lead the design,... ...access, RAG-enabled knowledge systems, and governed AI APIs available... ...controlled environments for coding agents, containers, APIs, web...Senior
- ...Senior AI Security Software Engineer The CERT Division of the Software Engineering Institute (SEI) is seeking applicants for the role of Senior AI... ...cybersecurity research, advancing the resilience of software systems and responding to sophisticated cyber threats. As AI...SeniorFull timePart timeRelocation packageFlexible hours
- ...A leading technology consulting firm in Virginia is seeking an experienced AI Native Engineer to drive the design and deployment of cloud-native systems and agentic workflows. You will work directly with enterprise clients to create robust solutions that scale across various...Senior
$77k - $202k
...Competency: Data, Analytics & AI Industry/Sector: Not... ...people in data and analytics engineering focus on leveraging advanced technologies... ...algorithms, models, and systems to enable intelligent decision... ...that meet business needs. As a Senior Associate, you analyze complex...SeniorFull timeH1b$160k - $205k
A leading AI solutions company in Washington, DC, is seeking an experienced engineer to design and build multi-agent systems for complex workflows. The ideal candidate will have strong skills in Python, AWS services, and experience in deploying LLM technologies. This role...Flexible hours$148.9k - $223.4k
KBR is looking for a Principal System Engineer/Enterprise Architect in Chevy Chase, MD. This role involves designing IT enterprise architecture, working closely with cross-functional teams, and contributing to national security solutions. The ideal candidate will have extensive...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior AI Engineer Inference & Agent Systems. Be the first to apply!
- machine learning ai engineer Washington DC
- senior ai engineer Washington DC
- ai engineer remote Washington DC
- ai ml engineer Washington DC
- ai engineer Washington DC
- ai developer Washington DC
- ai research engineer Washington DC
- ai prompt engineer Washington DC
- special agent Washington DC
- transfer agent Washington DC



