Senior ML Observability Engineer
ECS Limited
Senior ML Observability Engineer
Everforth ECS is seeking a Senior ML Observability Engineer to work in the National Capital Region covering the Pentagon, Falls Church, and Fairfax. Please Note: This position is contingent upon contract award.
The War Data Platform (WDP) is a key initiative within the U.S. Department of War's (DoW) AI-First strategy introduced in early 2026. The WDP focuses on operational warfighting data and aims to accelerate the deployment of artificial intelligence (AI) on the battlefield. The WDP extends to Unclassified, Secret, and Top Secret environments, and supports collaboration between Combatant Commands, Joint Staff directorates, Senior Executive Service leaders, and operational analysts.
The Senior ML Observability Engineer architects and governs the instrumentation and telemetry infrastructure needed to ensure production AI and machine learning models deployed across WDP's multi-enclave environment perform reliably and securely at mission scale. This role is essential to maintaining real-time visibility into model behavior, pipeline execution, and cross-domain access interactions in direct support of Combatant Command and Joint Staff decision-making needs.
• Designs, implements, and governs observability and instrumentation architectures supporting AI and machine learning model-serving operations across Unclassified, Secret, and Top Secret enclaves within the War Data Platform (WDP) Core Integration enterprise.
• Develops semantic conventions, runtime instrumentation patterns, and telemetry pipelines that generate latency metrics, error signatures, throughput indicators, model-specific performance signals, and operational readiness measurements for deployed models and serving surfaces.
• Integrates observability capabilities into existing data pipelines, model-deployment workflows, API access patterns, and serving runtime frameworks to provide mission-relevant monitoring aligned with Combatant Command and Joint Staff decision-support needs.
• Configures and validates instrumentation using platforms such as OpenTelemetry, Prometheus, Grafana, Elastic, Splunk, Amazon CloudWatch, and service mesh telemetry components to deliver real-time visibility into model behavior, cross-domain access interactions, and pipeline execution characteristics.
• Conducts observability readiness reviews, supports test and evaluation gates, and collaborates with cybersecurity personnel to embed anomaly-detection signals aligned with Zero Trust and DoW cyber standards.
• Works with serving engineers, pipeline engineers, platform teams, and external provider integration engineers to maintain observability consistency across enclaves and resolve domain-specific telemetry constraints.
• Produces observability standards, instrumentation specifications, dashboards, alerting configurations, and performance analysis reports that strengthen reliability, accelerate incident response, and reinforce mission assurance for production model access across all security networks.
• Performs other duties as assigned.
• Current Secret security clearance with the ability to obtain and maintain a Top Secret (TS) security clearance with Sensitive Compartmented Information (SCI).
• 10 or more years of progressive experience in systems engineering, platform operations, or ML/AI infrastructure roles, with a demonstrated focus on observability, telemetry, and monitoring in classified or federal government cloud environments.
• Hands-on experience designing and implementing observability pipelines using industry-standard tooling such as OpenTelemetry, Prometheus, Grafana, Elastic, Splunk, or Amazon CloudWatch, including instrumentation of AI/ML model-serving runtimes and data pipelines.
• Experience operating across multi-enclave environments, including NIPRNet, SIPRNet, and JWICS, with demonstrated ability to adapt telemetry and observability architectures to cross-domain constraints and multi-level security requirements.
• CompTIA Cloud+ certification or equivalent, demonstrating foundational knowledge of cloud infrastructure, security, and operational monitoring standards.
• Strong problem-solving and decision-making capabilities, with a proven ability to weigh the relative costs and benefits of potential actions and identify the most appropriate solution.
• Highly developed interpersonal and oral/written communication skills, with the ability to effectively and professionally interact with a diverse set of stakeholders (from peers to end-users to executive management).
- ...Senior Ml Serving Engineer Everforth ECS is seeking a Senior ML Serving Engineer to work in the National Capital Region covering the Pentagon, Falls Church, and Fairfax. This position is contingent upon contract award. The War Data Platform (WDP) is a key initiative...SeniorContract work
- ...becoming core to how our platform works. We're looking for a senior machine learning engineer to take the lead on this effort. You'll be the architect... ...parsing What we're looking for 5+ years in applied ML, including experience with retrieval, embeddings, and...SeniorLocal areaRemote work
$146k - $234k
...Senior AI/ML Software Engineer Job Locations US-MD-Annapolis Junction | US-VA-Herndon | US-VA-Reston Requisition ID 2... ...with OpenTelemetry tracing, provenance audit trails, and observability tooling for debugging and performance evaluation Write...SeniorContract workShift work$145k - $210k
...Senior AI/ML Engineer Cooley is seeking a Senior AI/ML Engineer to join the Practice Engineering team. Position summary: As a leading... ..., LangSmith, Pydantic AI, or similar orchestration and observability tooling ~ Knowledge of cloud security, governance, and data...SeniorFull timeTemporary workWork at officeFlexible hoursWeekend work$107.9k - $195.05k
...Leidos Digital Modernization sector is seeking an experienced Senior AI/ML Engineer to support the delivery, enhancement, and adoption of... ...-making environments ~ Familiarity with evaluation and observability tools for AI agents, such as LangSmith, OpenAI Evals, or custom...SeniorLocal areaImmediate start- ...most. Job Description: Quartermaster AI is seeking a Senior AI/ML Engineer with an emphasis in RF analysis to develop and deploy... ...provide contextual understanding of vessel activity based on observed RF signatures. Key Responsibilities: Design, train...Senior
- ...Virginia is seeking an experienced Sr Manager - Software Engineer to lead and develop a senior engineering team. The ideal candidate should have over... ...engineering experience with strong expertise in observability tools like Splunk and DataDog. This role emphasizes strategic...Senior
$140k - $190k
...Overview We are looking for a seasoned Senior Machine Learning Engineer to work with our existing team of Data Scientists and Engineers to apply AI/ML technologies in support of Federal use cases, with a focus on solutions built within the Databricks (DBX) platform...Senior- ...Senior Machine Learning Engineer At the SEI AI Division, we conduct research in applied artificial intelligence... ...into the vulnerabilities of AI and ML algorithms and securing against those... ...mitigations and defenses for observed attacks affecting AI and ML algorithms...SeniorFull timePart time
$95.3k - $158.8k
...you a collaborative Machine Learning Ops Engineer looking to work for a mission driven global... ...landscapes. About the role, as a Senior Machine Learning Engineer you'll work on... ...confidentiality. Key Responsibilities ML & LLM Engineering, Search and Recommendation...SeniorLocal area- ...Senior Machine Learning Engineer McLean, Virginia Senior Machine Learning Engineer Location: McLean, VA (hybrid); occasional travel to Durham... ...and customer sites About CoVar CoVar is a small AI/ML R&D software company with offices in Durham, NC and McLean...SeniorTemporary workWork at officeLocal areaFlexible hoursShift work
- ...The Mission Starts Here TheIncLab engineers and delivers intelligent digital applications... ...purpose as well. We are looking for a Senior Machine Learning Engineer to that will focus... ...and solve optimization problems using ML techniques Pathfinding and routing...SeniorFlexible hours
- ...Machine Learning Engineer At the SEI AI Division, we conduct research in applied artificial... ...) and eager to explore novel AI and ML theory while delivering mission-scale capabilities... ...breaks with ample paid time off and observed holidays, and rest easy with life and...SeniorFull timePart timeWork experience placementWork at office
- A professional services organization in Virginia seeks a Senior Machine Learning Engineer to design and deploy machine learning systems central to its... ...TensorFlow. Responsibilities include developing scalable ML models and collaborating with teams to translate business...Senior
- ...Senior Data Scientist/ML Engineer Team CATHEXIS elevates the government contracting experience through rapid response, deep skill, and thoughtful problem-solving and communication. Our core capabilities are our top-tier program and project management, data analytics...Senior
- ...Job Title: Senior Big Data / Machine Learning Engineer Location: McLean, VA (Hybrid - 3 days onsite) Duration: 12 Months Job Description: We are looking for a Senior Big Data / ML Engineer to support a banking client's AML and Risk Technology team....Senior
- A leading tech firm in McLean, Virginia is seeking a Senior Machine Learning Engineer to design, train, and evaluate machine learning models for real... ...world applications. This role involves research in various ML approaches and implementing models using frameworks like...Senior
$229.9k - $262.4k
Senior Manager, Machine Learning Engineering As a Capital One Machine Learning Engineer (MLE), you'll be part of an Agile team dedicated to productionizing... .... In this role, you'll be expected to perform many ML engineering activities, including one or more of the following...SeniorFull timePart timeInternshipLocal area$286.2k - $326.7k
.... Director, Machine Learning Engineering (Remote-Eligible) Overview At... ...time, our applications of AI & ML bring humanity and simplicity... ...governance, and scalable observability. Guide the adoption of state‑... ...mentoring managers, tech leads, and senior engineers. Make high‑judgment...SeniorFull timePart timeLocal areaRemote workVisa sponsorship- ICA.ai is looking for a Senior Data Science/Machine Learning Engineer to solve complex problems and enhance AI capabilities. You will implement machine learning models using Python, collaborate with teams, and optimize algorithms on cloud platforms. Ideal candidates have...SeniorRemote job
$140k - $190k
Steampunk is seeking a Senior Machine Learning Engineer in McLean, Virginia, to collaborate with Data Scientists and Engineers on AI/ML technologies for Federal applications. The role demands a strong background in developing machine learning models, experience with frameworks...Senior- Cynnovative is seeking a Senior ML Engineer in Arlington, Virginia to develop and manage tools for LLM experimentation and deployment. You will design scalable LLM systems and ensure the reliability of these systems in production environments. The ideal candidate must...Senior
- ...Business consulting services. We are in search of a highly motivated candidate to join our talented Team. Job Title: Senior Data Engineer / AI ML Engineer with Python, AI/ML & LLMs. Location: Reston, VA ( Hybrid ) Job Summary: We are seeking a highly...SeniorRemote work
- ...Senior Ai/Ml Software Developer Team CATHEXIS elevates the government contracting experience through rapid response, deep skill, and... ...Software Developer will work alongside a team of highly skilled engineers in the development of AI applications. A motivated and...Senior
$286.2k - $326.7k
...description": "Sr. Distinguished Machine Learning Engineer (Remote-Eligible) Overview:... ...in real time, our applications of AI & ML are bringing humanity and simplicity to... ...systems, and scalable monitoring & observability. Invent and introduce state-of-the-...SeniorFull timePart timeLocal areaRemote workFlexible hours$161.8k - $184.6k
Senior Machine Learning Engineer As a Capital One Machine Learning Engineer (MLE), you'll be part of an Agile team dedicated to productionizing... .... In this role, you'll be expected to perform many ML engineering activities, including one or more of the following...SeniorFull timePart timeInternshipH1bLocal area- ...Description Senior AI/ML Operational Engineer Joint Base Anacostia, Arlington, VA, or Reston, VA Active TS/SCI with Poly Clearance required @Orchard is retained by a top geospatial technology company supporting important operations missions to U....Senior
- Capital One National Association is seeking a Senior Lead Machine Learning Engineer in McLean, Virginia. The ideal candidate will design and deliver machine... ..., Scala, or Java and a solid background in building ML systems. The position offers comprehensive benefits and opportunities...Senior
$229.9k - $262.4k
...Senior Lead Machine Learning Engineer (Intelligent Foundations and Experiences) As a Capital One Machine Learning Engineer (MLE) , you'll be part of... ...Engineering. In this role, you'll be expected to perform many ML engineering activities, including one or more of the...SeniorFull timePart timeInternshipLocal area$113k - $188k
...data analysis, preprocessing, and feature engineering to extract valuable insights from large... ...or PhD with 8+ years of experience in AI/ML. Expert in the field in designing novel... ...implementation and represents capabilities to senior stakeholders. Active Top Secret...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior ML Observability Engineer. Be the first to apply!
- senior manager customer operations Fairfax, VA
- senior data engineer Fairfax, VA
- senior manager clinical operations Fairfax, VA
- senior vmware engineer Fairfax, VA
- senior engineering technician Fairfax, VA
- sr project manager Fairfax, VA
- senior performance engineer Fairfax, VA
- senior software design engineer Fairfax, VA
- senior application security engineer Fairfax, VA
- senior technical designer Fairfax, VA


