Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

CDAO Advana - Site Reliability Engineering Lead - Model Serving

Gdit

Public Trust: None
Requisition Type: Pipeline
Your Impact

Own your opportunity to support our nation's defense. Make an impact by connecting and securing critical operations across the globe, keeping our country safe and secure.

Job Description

Join GDIT and be a part of the team of men and women that solve some of the world's most complex technical challenges. The CDAO Advana team is seeking an Site Reliability Engineer - Model Serving, to join their efforts in the DC area.

Advana is the Chief Digital and Artificial Intelligence Office’s (CDAO) enterprise-wide, multi-domain data, analytics, and artificial intelligence (AI) platform that provides all DoW military and civilian decision makers, analysts, and builders with unprecedented access to enterprise data, tools, and capabilities.

This is a proposal with award expected June 2026. If interested, please apply as we are interviewing and making contingent offers now.

Duties include:

Site Reliability Engineering Lead - Model Serving SME - Owns production reliability strategy for artificial intelligence and machine learning model serving across Advana enclaves supporting Department of Defense missions, Joint Staff analysts, Combatant Command elements, and Senior Executive Service leadership.

  • Defines service‑level objectives, alerting philosophy, operational runbooks, and release safety patterns governing production deployment of model artifacts across multiple security domains.
  • Establishes reliability governance across serving surfaces by developing operational standards, on‑call expectations, escalation pathways, and incident response patterns aligned with enterprise DevSecOps practices.
  • Implements reliability engineering methodologies using Kubernetes, Prometheus, Grafana, Elastic Stack, GitLab Continuous Integration, VMware environments, and hardened deployment pipelines to maintain operational stability, mission assurance posture, and cross‑domain readiness.
  • Develops automated reliability checks integrated into deployment workflows to validate performance, latency, availability, and operational suitability of production‑ready models.
  • Leads coordination with Platform One, Cloud One, multi‑national engineering teams, and cross‑service mission partners to align reliability strategy with evolving architectures, security requirements, and mission priorities.
  • Produces mission‑critical deliverables including service‑level objective documentation, alerting configurations, operational runbooks, reliability scorecards, incident post‑action reports, and release safety assessments.
  • Strengthens program value by advancing operational readiness, reducing mission risk, and reinforcing deployment consistency across all enclaves. Supports Tier‑4 incident response actions by maintaining authoritative reliability artifacts required for rapid triage, operational continuity, and sustained mission performance.

Basic Qualifications:

  • BS degree; additional years of experience may be considered in lieu of degree
  • 8+ years of experience with production reliability strategy for artificial intelligence and machine learning model serving
  • IAT II - Security+
  • TS with SCI eligibility

WHAT CAN GDIT OFFER YOU?

  • Excellent customizable health benefits (Medical, Dental and Vision)
  • 401K with company match
  • Educational Assistance and eLearning
  • Flexible work week
  • Internal mobility team dedicated to employee advancement
  • Rewards and Recognition programs
  • Innovative and collaborative environment encouraging of highly motivated critical thinking

Work Requirements

Years of Experience

8 + years of related experience

* may vary based on technical training, certification(s), or degree

Certification

CompTIA Security+ CE | CompTIA - CompTIA

Travel Required

None

Citizenship

U.S. Citizenship Required

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the CDAO Advana - Site Reliability Engineering Lead - Model Serving in Washington DC vacancy
  • $169.6k - $229.46k

     ...Operations Job Qualifications: Skills: Model Serving, Reliability Management, Reliability...  ...technical challenges. The CDAO Advana team is seeking an Site Reliability Engineer - Model Serving, to join their...  ...Site Reliability Engineering Lead - Model Serving SME - Owns production... 
    Suggested
    Full time
    Temporary work
    Work at office
    Immediate start
    Remote work
    Worldwide
    Flexible hours

    General Dynamics Information Technology

    Washington DC
    1 day ago
  • $139.4k - $191.9k

    CDAO - Enterprise - Cloud Engineering Lead Application Deadline: 3 June 2026 Department: Enterprise - Frontier AI Employment Type: Full...  ...infrastructure required to train, fine‑tune, and serve large‑scale Generative AI models. You will lead a team of expert engineers in... 
    Suggested
    Full time
    Work at office
    Trial period
    Relocation package
    Afternoon shift

    Office of the Under Secretary of War for Research and Engine...

    Arlington, VA
    4 days ago
  •  ...of the world’s most complex technical challenges. The CDAO Advana team is seeking an DevSecOps Engineer to join their efforts in the DC area. Advana is the...  ...integration, staging, and production environments. Leads continuous pipeline development, automation scripting... 
    Suggested
    Work at office
    Flexible hours

    General Dynamics Information Technology

    Washington DC
    2 days ago
  •  ...most complex technical challenges. The CDAO Advana team is seeking an API Specialist to join...  .... Coordinates with program leadership, engineers, and mission stakeholders to validate...  ...execution, data integrity, and enterprise reliability. Basic Qualifications: BS degree;... 
    Suggested
    Work at office
    Flexible hours

    General Dynamics Information Technology

    Washington DC
    1 day ago
  •  ...complex technical challenges. The CDAO Advana team is seeking an AWS/...  ...DevSecOps factory components. Leads cloud transformation initiatives...  ...role-based access control models, encryption key management, SIEM...  .... Delivers expert training, engineering playbooks, and modernization... 
    Suggested
    Work at office
    Flexible hours

    General Dynamics Information Technology

    Washington DC
    2 days ago
  • $111.16k - $150.39k

     ...Infrastructure and Operations Skills: Engineering Systems,Systems Design,...  ...technical challenges. The CDAO Advana team is seeking a Systems...  ...offers now. Duties include: Leads complex systems engineering...  ...-analysis evaluations using modeling tools, interface control documentation... 
    Temporary work
    Work at office
    Immediate start
    Worldwide
    Flexible hours

    General Dynamics Information Technology

    Washington DC
    2 days ago
  • $128.04k - $173.23k

     ...Requirements Development, Requirements Management, Software Systems Engineering, Systems Design Experience: 8 + years of related experience...  ...some of the world's most complex technical challenges. The CDAO Advana team is seeking a Systems Engineer to join their efforts in... 
    Temporary work
    Work at office
    Flexible hours

    General Dynamics Information Technology

    Washington DC
    1 day ago
  •  ...are seeking a high-caliber Site Reliability Engineer (SRE) to join our Forward...  ...—bridging the gap between model development and production-...  ...AI Infrastructure Model Serving Reliability: Ensure the high...  ...responder in on-call rotations, leading the technical resolution of... 
    Local area

    Tiger Analytics Inc.

    Washington DC
    5 days ago
  • General Dynamics Information Technology is seeking a Systems Engineer for the CDAO Advana team in Washington, DC. The ideal candidate will have over 8 years of experience in systems engineering and a Top Secret clearance. This role includes executing engineering activities... 
    Flexible hours

    General Dynamics Information Technology

    Washington DC
    1 day ago
  • Senior Site Reliability Engineer Job Description Overview CoStar Group (NASDAQ: CSGP) is a leading global provider of commercial and residential real estate information, analytics...  ...challenges, optimizing systems that serve billions of transactions, and shaping infrastructure... 
    Full time
    Work at office
    Work from home
    Monday to Thursday

    Visual Lease

    Arlington, VA
    2 days ago
  •  ...Senior Site Reliability Engineer - Network Operations (Remote) Fastly 15 August 2025 SRE DevOps...  ...performance, and security of our network, which serves a significant portion of the content...  ...’s global network. * Respond to and lead network incidents, resolving complex... 
    Local area
    Remote work
    Flexible hours
    Night shift

    Fastly

    Bethesda, MD
    3 days ago
  •  ...The Role We're seeking a Fintech Engineering Lead who has directly leverable B2C banking...  ...practices and building a highly performant, reliable, and secure app from the ground up....  ...broken out for performance. Backend serves a REST API that powers our iOS, Android... 
    Remote work
    Flexible hours

    AHU Technologies Inc

    Washington DC
    22 days ago
  •  ...company, is seeking a Systems Engineering Lead to support KTS and our...  ...Systems Engineering Lead will serve as the principal systems engineering...  ...system lifecycle using V-Model and Agile methodologies...  ...engineering Knowledge of reliability, availability, and maintainability... 
    Local area
    Flexible hours

    Koniag

    Washington DC
    5 days ago
  • $111.16k - $150.39k

     ...Qualifications: Skills: Engineering Systems, Systems Design,...  ...technical challenges. The CDAO Advana team is seeking a Systems...  ...now. Duties include: Leads complex systems engineering...  ...-analysis evaluations using modeling tools, interface control documentation... 
    Full time
    Temporary work
    Part time
    Work at office
    Immediate start
    Remote work
    Worldwide
    Flexible hours

    GDIT

    Washington DC
    27 days ago
  • $174k - $236k

     ...role We're hiring a hands-on engineering leader to own both the people...  ...culture that is organized, reliable, and focused on impact...  ...formal people manager or tech lead with direct reports — including...  ...workloads (compute, storage, serving patterns) in AWS Location:... 
    Work at office
    Remote work
    Flexible hours

    Koalafi

    Arlington, VA
    7 days ago
  • $145k - $155k

     ...year Work Location: Hybrid. 4 days/week on site in Washington, DC Lead detection engineering activities supporting cybersecurity monitoring and...  ...candidates at this time. In 1994 Gunnison began serving the greater Washington, D.C. metro area, focused on... 
    Full time
    Contract work
    Flexible hours

    Gunnison, CO

    Washington DC
    4 days ago
  • General Dynamics Information Technology is seeking a DevSecOps Engineer to join the CDAO Advana team in Washington, DC. The role involves executing...  ...modernization, optimizing software-factory pipelines, and leading incident response activities. Ideal candidates will have... 
    Flexible hours

    General Dynamics Information Technology

    Washington DC
    1 day ago
  • $145.2k - $252.48k

     ...Responsibilities We are seeking an expert Senior Model-Based Systems Engineer (MBSE) to lead the modeling and digital transformation of a critical Navy...  ...architectures and comprehensive requirements baselines. Serve as a subject matter expert in MBSE methodologies,... 
    Hourly pay
    Contract work
    Temporary work
    For contractors
    Work experience placement
    Remote work

    Arcfield

    Washington DC
    3 days ago
  • $155k - $235k

     ...Senior Model Based Systems Engineer Arlington, VA, Mountain View, CA, San Diego, CA We're a combat...  ...you develop. What You'll Do Serve as system-level SE: derive and allocate...  ...in Cameo and Jama, or like-systems. Lead functional hazard analyses (FHA), system... 
    Full time
    Work experience placement
    Local area
    Relocation package

    Atropos Inc

    Arlington, VA
    1 day ago
  • Mid-Atlantic Transit Engineering Lead - ( 180793 ) At HDR, our employee-owners are fully engaged in creating a welcoming environment where...  ...trust and connects us closer to the clients and communities we serve. Our Commitment As employee owners, we all have a role in... 
    Full time
    Contract work
    Local area

    Fashion Institute of Design & Merchandising

    Washington DC
    3 days ago
  •  ...Pentagon, DC. Oversees the Systems Engineering staff and activities of an...  ...and software performance and reliability specifications and...  ...supervisor or team or project lead, and a minimum of two (2) years...  ...Eight (8) years' experience serving as a key advisor to senior leadership... 
    Contract work
    Flexible hours

    Joint Research and Development

    Washington DC
    4 hours ago
  • $120k

     ...digital transformation and IT programs, allowing us to better serve our customers through scale and repeatability. Your Next...  ...Awaits! Leidos is looking for a Unified Endpoint Management - Engineering Lead to support a large program within the Department of Justice.... 
    Local area

    Via Logic LLC

    Washington DC
    4 days ago
  •  ...Engineering Lead, Security Operations At Anchorage Digital, we are building the world's most advanced digital asset platform for institutions...  ...chartered crypto bank in the U.S., Anchorage Digital also serves institutions through Anchorage Digital Singapore, Porto by... 
    Full time

    Anchorage Digital

    Washington DC
    2 days ago
  • $145k - $200k

     ...Palantir builds the world’s leading software for data-...  ...Role We are a software engineering team with expertise in enabling ML models in production. We deploy...  ..., Python and Go Model serving engines for GPU-...  ...enabling fast, secure, and reliable model rollouts across on... 
    Work experience placement
    Work at office
    Remote work
    Work from home
    Relocation package

    Palantir

    Washington DC
    1 day ago
  • $140k - $220k

     ...Intelligence Community through advanced engineering, digital transformation, and...  ...AI/ML Engineer to serve as a technical leader, architect...  ...is equally comfortable writing model code, standing up ML pipelines...  ...simulation, or operational systems. Lead the full AI/ML lifecycle —... 

    Frontier Technology Inc.

    Washington DC
    2 days ago
  • $130k - $200k

     ...over 50,000 planners, designers, engineers, scientists, digital...  ...Civil Engineering Discipline Lead / Project Manager position. In...  ...be limited to, the following: Serves as lead engineer on mid- to large...  ...Transportation Work Location Model: Hybrid Operating Group: Americas... 
    Full time
    Work at office
    Local area
    Worldwide
    Flexible hours

    AECOM

    Arlington, VA
    1 day ago
  • $130k - $200k

     ...over 50,000 planners, designers, engineers, scientists, digital...  ...Civil Engineering Discipline Lead / Project Manager position. In...  ...limited to, the following: Serves as lead engineer on mid- to large...  ...Transportation Work Location Model: Hybrid Operating Group: Americas... 
    Full time
    Work at office
    Local area
    Worldwide
    Relocation package
    Flexible hours

    AECOM

    Arlington, VA
    2 days ago
  • $85k - $200k

     ...founded on innovation and growth. The Model Validation Director is expected to lead and execute model validation...  ...AML and Sanctions models, on and off site, issue written reports and...  ...to‑day compliance requirements; Serve as subject matter expert to customers... 
    Remote work

    Ankura

    Washington DC
    3 days ago
  • $71.73 - $85.73 per hour

     ...incidents and service requests per defined SLAs. Serve as liaison with client stakeholders,...  ...system capabilities with business needs. Lead and coordinate with offshore support teams...  ...Epic Clarity Certification (Clinical Data Model). Experience with ITSM tools such as... 
    Hourly pay

    Accenture

    Arlington, VA
    4 days ago
  • $65k - $179.4k

     ...greatest differentiator and competitive advantage in the markets we serve. We are all united in delivering the best experience for our...  ...contribute to the company's success. As a Quantitative Analytics and Model Consultant within PNC's Market Risk Oversight organization, you... 
    Temporary work
    Work experience placement
    Work at office

    Fairygodboss

    Washington DC
    14 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to CDAO Advana - Site Reliability Engineering Lead - Model Serving. Be the first to apply!