Senior AI Engineer Inference & Agent Systems
Arcana Analytics
Senior AI Engineer — Inference & Agent Systems
Title: Applied AI Engineer — Inference & Agent Systems
Location: United States
Arcana is building AI agents that synthesize information across heterogeneous sources and deliver structured, reasoned answers in real time. The product only works if the agents are fast, reliable, and correct, not approximately correct.
Our stack: Go + Temporal for orchestration, a Plan-Execute-Synthesize agent architecture, and an evaluation harness we use to measure every regression. The problems are hard. The latency bar is aggressive. The accuracy requirements are unforgiving.
The Work
Inference Optimization
- Drive TTFT below 400ms for multi-step agent pipelines
- Streaming optimization: first token to user while sub-agents are still running
- KV cache strategy, prompt compression, dynamic context window management
- Multi-provider routing: model selection by latency, cost, and task type across OpenAI, Anthropic, Gemini, and open-weight models
Agent Architecture
- Design and implement Plan-Execute-Synthesize pipelines that run sub-agents in parallel DAGs, not sequential chains
- Build reliable orchestration on top of Temporal: retries, timeouts, partial failure recovery, idempotency
- Structured output enforcement: JSON schema validation, retry loops on malformed LLM output, graceful degradation
- Tool call design: schema design that LLMs actually follow reliably across providers
Evaluation & Harness
- Own the eval framework end to end: ground truth datasets, automated scoring pipelines, regression detection on every PR
- LLM-as-judge pipelines for qualitative output assessment
- Latency regression testing - p50/p95/p99 tracked across every deployment
- Adversarial test case design: ambiguous queries, missing data, conflicting sources, malformed tool responses
Infrastructure
- Model serving and cold start optimization
- Async worker architecture for parallel sub-agent execution
- Observability: trace every token, every tool call, every synthesis step
What We're Looking For
You've built something that runs in production at a meaningful scale and you understand why it's fast (or why it isn't).
Strong signal:
- You've worked on inference pipelines where TTFT was the primary metric and you moved it meaningfully
- You've built multi-step agent systems and you know where they break not from reading papers but from watching them fail in production
- You've written eval harnesses from scratch and you have opinions about what makes a ground truth dataset actually useful
- You've debugged LLM non-determinism in production and built systems resilient to it
- You've worked with streaming LLM responses and built infrastructure around partial output handling
Weaker signal (but not disqualifying):
- You've fine-tuned models but haven't shipped inference systems
- You've used LangChain/LlamaIndex but haven't built the layer underneath
- Strong ML research background without systems exposure
Stack familiarity (we care more about depth than match): Go, Python, Temporal, Kafka, PostgreSQL, Docker
Why This Role
The problems here don't have blog posts about them yet. Parallel agent DAG execution under hard latency budgets, streaming synthesis across partial sub-agent results, eval harnesses for non-deterministic multi-step systems: these are genuinely unsolved at production quality. Small team. High ownership. Every engineer's decisions ship to production.
Who We Want to Hear From
- You've shipped inference systems at:
- A real-time AI product (search, coding assistant, chat at scale)
- A model serving infrastructure company
- An agent platform (any domain)
- Or you've built eval/harness infrastructure that a team of 10+ engineers actually trusted to catch regressions.
Apply
Send to: View email address on click.appcast.io
Include:
- One system you built where latency was the primary constraint what you measured, what you changed, what moved
- Link to anything public (code, writing, talks)
- No cover letter required
We respond to every application.
$96.8k - $306.4k
...Job Description The Senior Principal AI Agent / ML Software Engineer is a Senior Staff-level, hands-on technical... ...and operating next-generation AI systems on Oracle Cloud Infrastructure (OCI... ..., autonomous workflows, scalable inference infrastructure, and enterprise AI...SeniorTemporary workFlexible hours- ...technology company in Washington, D.C. is seeking a senior software engineer. This role focuses on utilizing AI-assisted coding practices to deliver features at... ...have strong experience with LLMs and distributed systems development in Scala, as well as the ability to architect...Senior
$99.6k - $223.4k
...Job Description Oracle Health is seeking a Senior AI Agent Engineer to build production AI agents and workflow automation capabilities that... ...implement AI-driven workflows that integrate with enterprise systems and support use cases such as SQL generation, pipeline...SeniorTemporary workFlexible hours$99k - $225k
Booz Allen Hamilton is seeking an Agentic AI Engineer to join their team in Washington, DC. The role focuses on building autonomous AI systems that actively engage in multi-agent orchestrations, utilizing advanced technologies such as agent orchestration frameworks and...Suggested- Ocrolus is seeking a hands-on Applied AI Specialist to drive AI solutions that enhance internal processes. The ideal candidate will... ...Applied AI and ML and a proven track record of building scalable AI systems. Responsibilities include identifying AI opportunities,...Senior
- ...seeking a Sr. Security Software Engineer for the Starshield project in Washington... ...This role focuses on utilizing AI for security automation and integration within systems. Candidates should have... ...systems, build security-critical agents, and assist in fixing security vulnerabilities...Senior
$130k - $150k
BLEN Corp is seeking an AI Engineer in Washington, DC to design and build AI systems for federal and commercial clients. The role involves developing agentic systems, creating LLM-powered applications, and working closely with stakeholders. Ideal candidates should have...SeniorWork from home- A technology solutions provider in Virginia is seeking a Senior AI Integration Engineer to manage the integration of AI solutions and oversee system engineering activities. The role requires a top-secret security clearance and a Bachelor's degree in a related field, alongside...SeniorFull time
$86.8k - $198k
Booz Allen Hamilton is seeking a Senior Software Development Engineer to design and build agentic workflows and LLM integrations for an innovative... ...platforms. You will collaborate with various teams, support AI systems, and validate LLM quality under production conditions,...Senior$124k - $280k
...Competency: Data, Analytics & AI Industry/Sector: Health... ...people in data and analytics engineering focus on leveraging advanced technologies... ...algorithms, models, and systems to enable intelligent decision... ...for health systems. As a Senior Manager, you will serve as a strategic...SeniorFull timeH1b$124k - $280k
...Competency: Data, Analytics & AI Industry/Sector: Health... ...people in data and analytics engineering focus on leveraging advanced technologies... ...algorithms, models, and systems to enable intelligent decision... ...system and health plans. As a Senior Manager, you will drive use...SeniorFull timeH1b- ...Job Description Job Description Senior AI Engineer Hybrid - Washington D.C. (preferred)... ...analyses, including back-testing, rejection inference, and performance analyses using... ...operationalize models into production systems, ensuring scalability and resilience at...SeniorFlexible hours
$99k - $225k
Phase2 Technology in McLean, Virginia, is searching for an Agentic AI Engineer to develop proactive AI systems. You will utilize expertise in agent orchestration frameworks and LLM fine-tuning. The role requires 5+ years of software development experience and deep knowledge...- ...Senior AI/ML Engineer Elevate your career with MANTECH International Corporation! Join a dynamic... ...model serving platforms for efficient inference Implement automated machine... ...Troubleshoot complex issues in AI/ML systems Minimum Qualifications: ~ Bachelor...SeniorTemporary workLocal area
$335k
...changing how military systems are designed, built and... ...powered by Lattice OS, an AI-powered operating... ...seeking a Director of AI Engineering & Research to build and... ...quantization, on‑device inference, and runtime safety. Set... ...training, and deployment of agents, VLA models, and...Full timeWork experience placement- A leading insurance provider is seeking a Senior Staff Engineer in Chevy Chase, MD to drive AI-powered capabilities and lead engineering teams. The ideal candidate will have over 8 years of experience, particularly in Generative AI, and a strong programming background...Senior
- ...A technology solutions provider is seeking a Software Engineer Level 4 to develop and maintain diverse software systems. This role will focus on backend development using Python and FastAPI, along with managing PostgreSQL databases and high-performance API integrations...Senior
$220k - $350k
...SPACE EXPLORATION TECHNOLOGIES CORP is seeking a Sr. AI Engineer for special programs. This role focuses on engineering AI capabilities... ...include optimizing integrations between AI models and government systems, collaborating on developer tools, and shipping production-...Senior- ...Quality Support, Inc. is looking for a Senior Software Engineer in Alexandria, Virginia. Candidates must have 25-30 years of experience with strong... ...involves developing software solutions, conducting systems analysis, and supporting a team of engineers. Quality Support...Senior
- ...Comcast is hiring a Senior Software Engineer to lead technical direction in AI Agent initiatives. You will focus on building scalable backend systems that leverage modern AI techniques. Responsibilities include architectural leadership, mentorship of developers, and ensuring...Senior
$148.9k - $223.4k
...KBR is looking for a Principal System Engineer/Enterprise Architect in Chevy Chase, MD. This role involves designing IT enterprise architecture, working closely with cross-functional teams, and contributing to national security solutions. The ideal candidate will have...Senior$100k - $160k
...Majus Consulting is looking for a highly skilled Senior Software Engineer to join our Washington, DC team. In this full-time position, you'll... ...solutions, lead technical initiatives, and contribute to scalable systems supporting diverse mission needs. The role demands strong...SeniorFull time- ...Senior AI Engineer As a Senior AI Engineer you build and improve the AI agent systems that power personalized genetics based health guidance. You work side by side with our AI Backend Architect to move from simple large language model calls to a true multi agent brain...SeniorRemote work
- ...Quality Support, Inc. has openings for Senior Software/Computer/AI Engineers with 25 – 30 years’ experience with a strong background of submarine construction... ...information needs, conferring with users, studying systems flow, data usage, and work processes; investigating...Senior
- ...A global consulting firm is seeking a Senior AI Native Engineer to deliver innovative AI solutions. The role requires a Bachelor's degree and 3... ...You will enhance data pipelines and implement scalable AI systems that meet diverse business requirements, collaborating within...SeniorFlexible hours
$137k - $200.2k
MAXAR TECHNOLOGIES, INC. is looking for an AI/ML Engineer in McLean, Virginia. The role involves developing AI applications, maintaining RAG pipelines, and integrating LLMs. Candidates must have at least 8 years of experience, a bachelor's degree in computer science, and...Senior- ...Senior AI Agentic Engineer The Senior AI Agentic Engineer designs, builds, and operationalizes intelligent agent systems that automate complex enterprise business processes end-to-end. This role works at the intersection of LLMs, systems engineering, and applied machine...SeniorWork experience placement
$160k - $205k
A leading AI solutions company in Washington, DC, is seeking an experienced engineer to design and build multi-agent systems for complex workflows. The ideal candidate will have strong skills in Python, AWS services, and experience in deploying LLM technologies. This role...Flexible hours$140k - $190k
Steampunk is looking for a highly skilled AI Developer to design, build, and optimize advanced AI solutions. The role covers the... ...ingestion to application integration. The ideal candidate has a strong engineering background, proficiency in Python, and experience with LLM...Senior- OATS is seeking an Engineer I, AI Agents to support team activities related to design and implementation. The role involves collaborating on technical solutions and maintaining systems while using modern programming techniques. The ideal candidate should have a degree in...Remote job
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior AI Engineer Inference & Agent Systems. Be the first to apply!
- ai engineer remote Washington DC
- ai prompt engineer Washington DC
- senior ai engineer Washington DC
- machine learning ai engineer Washington DC
- ai engineer Washington DC
- ai developer Washington DC
- ai ml engineer Washington DC
- signing agent Washington DC
- freight agent no experience Washington DC
- state farm agent Washington DC


