Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Principal AI Agent / ML Software Engineer (OCI)

Oracle

Principal AI Agent / ML Software Engineer

The Principal AI Agent / ML Software Engineer is a Senior Staff-level, hands-on technical leadership role responsible for defining, building, and operating next-generation AI systems on Oracle Cloud Infrastructure (OCI). This person will set architecture and engineering direction for production-grade agentic AI platforms, autonomous workflows, scalable inference infrastructure, and enterprise AI applications used in large-scale, business-critical environments.

This role requires a proven engineer who can translate ambiguous product and platform goals into durable technical strategy, lead multi-team execution without direct authority, and remain deeply hands-on in design, code, reviews, operations, and incident follow-up. The ideal candidate combines deep distributed systems experience with practical AI-native engineering, including orchestration of LLMs, tools, APIs, memory, retrieval, evaluation, guardrails, and cloud services. The expectation is to ship, scale, and operate reliable, secure, observable, and cost-aware AI platform systems while raising the technical bar for engineers across the organization.

Responsibilities
  • Serve as a senior technical owner for OCI AI platform capabilities, including agent execution, inference systems, model serving, AI workflow orchestration, evaluation, and observability.
  • Design, architect, and deliver scalable agentic AI systems capable of reasoning, planning, tool use, workflow execution, multi-step task orchestration, and safe human-in-the-loop escalation.
  • Build production-grade services for tool calling, agent memory, context management, Model Context Protocol (MCP) integration, vector retrieval, multi-agent coordination, policy enforcement, and evaluation.
  • Lead architecture across distributed services optimized for low latency, high throughput, GPU efficiency, reliability, cost, operability, and secure multi-tenant operation.
  • Define service boundaries, APIs, data models, state management, consistency tradeoffs, failure modes, SLIs/SLOs, rollout strategies, and operational readiness criteria for AI platform services.
  • Drive technical strategy across infrastructure, platform, security, data, and application engineering teams, converting broad goals into executable multi-quarter plans and measurable milestones.
  • Integrate AI agents securely and reliably with enterprise APIs, cloud services, databases, identity systems, secrets management, and external systems.
  • Establish AgentOps and LLMOps practices for tracing, monitoring, eval suites, regression testing, experimentation, safety guardrails, prompt/tool versioning, and production reliability.
  • Evaluate and operationalize emerging technologies in generative AI, agentic workflows, inference optimization, long-context systems, reasoning models, AI developer tooling, and agentic-first development.
  • Drive engineering excellence through code reviews, design reviews, test strategy, deployment automation, incident analysis, documentation, and AI-assisted development practices using tools such as Codex, Claude Code, Cursor, Copilot, or similar systems.
  • Mentor Staff and senior engineers, raise architectural standards, and influence engineering practices across OCI without requiring direct management authority.
  • Own critical production outcomes, including reliability, performance, security posture, cost efficiency, and supportability for the systems delivered.
Required Qualifications
  • Bachelor's, Master's, or Ph.D. in Computer Science, AI/ML, Engineering, or a related field, or equivalent practical experience.
  • 6-10+ years of professional software engineering experience, including significant ownership of production systems; or equivalent experience demonstrating Senior Staff / Principal-level impact.
  • Proven track record as a Staff, Senior Staff, Principal, or equivalent technical leader influencing architecture and execution across multiple teams.
  • Deep experience designing, building, and operating high-scale distributed systems, cloud services, infrastructure platforms, or AI/ML platform services.
  • Hands-on experience with production AI systems, agentic AI applications, autonomous workflows, tool-using agents, multi-step orchestration, or multi-agent systems.
  • Practical experience with orchestration frameworks such as LangGraph, LangChain, CrewAI, AutoGen, LlamaIndex, or similar ecosystems.
  • Deep understanding of LLM application patterns, including prompt design, structured outputs, function/tool calling, context management, RAG, memory, tool safety, and evaluation.
  • Strong programming skills in Python and ability to contribute high-quality production code, reviews, tests, and debugging in complex distributed environments.
  • Strong expertise with Kubernetes, Docker, cloud-native infrastructure, service-to-service communication, scalability, fault tolerance, observability, and performance analysis.
  • Experience defining SLIs/SLOs, production readiness criteria, incident response practices, monitoring, tracing, experiments, and reliability programs for AI or distributed systems.
  • Strong understanding of AI safety, governance, security, and operational risks for autonomous or semi-autonomous systems, including data handling, access control, auditability, and human accountability.
  • Excellent written and verbal communication, with demonstrated ability to lead technical direction, resolve ambiguity, and influence senior stakeholders.
Preferred Qualifications
  • Experience optimizing large-scale GPU inference or training workloads for latency, throughput, utilization, availability, and cost.
  • Experience building or operating model serving, inference gateways, agent runtimes, workflow engines, developer platforms, or internal AI productivity platforms.
  • Experience integrating AI systems with enterprise APIs, databases, cloud services, vector databases, embeddings, retrieval systems, identity systems, and policy enforcement layers.
  • Experience with LLM fine-tuning, long-context systems, reasoning models, model routing, caching, batching, quantization, or emerging generative AI research.
  • Experience building evaluation frameworks for agentic systems, including offline evals, online experiments, golden tasks, adversarial testing, regression gates, and observability dashboards.
  • Experience using AI-assisted software development tools such as Codex, Claude Code, Cursor, Copilot, or similar systems in large-scale engineering environments.
  • Track record of defining architectural standards, platform capabilities, or engineering practices adopted across multiple teams or organizations.
  • Experience in enterprise, cloud infrastructure, regulated, security-sensitive, or mission-critical environments.
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Principal AI Agent / ML Software Engineer (OCI) in Washington DC vacancy
  • $99.6k - $234.6k

     ...Oracle Cloud Infrastructure (OCI) is redefining the cloud for...  ...enterprise-grade reliability. Our engineering culture is grounded in OCI...  ...For We are seeking a Principal Software Development Engineer with deep...  ...to life-saving care. And with AI embedded across our products... 
    Principal
    Temporary work
    Relocation package
    Flexible hours

    Oracle

    Washington DC
    7 days ago
  • A leading AI company in Washington, D.C. seeks a skilled ML Engineer with a PhD in Computer Science/Engineering. The role involves designing, researching, and building AI systems while training and deploying ML models focused on Natural Language Processing and Large Language... 
    Suggested

    HR POD - Hiring Talent Globally

    Washington DC
    3 days ago
  •  ...An agile solution-oriented firm is seeking a Sr. Software Engineer II to lead software development projects and enhance system functionalities. This role involves collaborating with clients, managing project timelines, and ensuring compliance with industry standards. The... 
    Suggested

    ESR Healthcare

    Washington DC
    2 days ago
  • $140k - $182k

     .... Xometry is seeking a Principal Machine Learning Engineer to join our core machine learning...  ...partner closely with the AI/MLE leadership team to...  ...leveraged by Xometry's AI/ML solutions, including the Instant...  ...infrastructure and software that not only meets but exceeds... 
    Principal

    Xometry

    Silver Spring, MD
    1 day ago
  • $109.2k - $223.4k

     ...Cloud Infrastructure (OCI) is scaling at an unprecedented...  ...the next generation of AI-driven workloads. We are seeking a Senior Principal Technical Program...  ...and architecture, engineering, and infrastructure....  ...degree in Computer Science, Software Engineering, Technology... 
    Principal
    Temporary work
    Flexible hours

    Oracle

    Washington DC
    3 days ago
  • $110k - $135k

     ...A leading global technology firm in Washington, D.C. is looking for an experienced software engineer to join its AI Services team. This role focuses on developing scalable software solutions to enhance communication systems, utilizing skills in Python, SQL, and machine... 

    Motorola Solutions

    Washington DC
    2 days ago
  • $225k - $250k

     ...the demand for new Cloud and AI infrastructure. Fleet is led...  ...positioned to bring in-house design, engineering and operational capabilities...  ...The Data Center Security Software Principal Engineer leads applied AI,...  ...in security engineering, AI/ML applications, or systems integration... 
    Principal

    Fleet Data Centers

    Alexandria, VA
    1 day ago
  • $96.8k - $251.6k

     ...enabling studio-grade creative workloads on OCI. This individual contributor will own...  ...infrastructure for demanding media, creative, AI, and high-performance workloads where...  ...workflows in the cloud while improving the engineering systems, operational practices, and AI-... 
    Principal
    Temporary work
    Flexible hours

    Oracle

    Washington DC
    4 days ago
  • $99.6k - $234.6k

     ...Principal Software Developer (IC4) Oracle Health builds and operates shared platform services...  ...product delivery at scale. We are an AI-first engineering organization, using AI-assisted...  ...spaces. ~ Cloud experience preferred (OCI strongly desired; AWS/Azure/GCP acceptable... 
    Principal
    Temporary work
    Flexible hours

    Oracle

    Washington DC
    11 hours ago
  • $69.4k - $158k

    Booz Allen Hamilton is seeking an AI/ML Software Development Engineer for the Advanced Research Projects Agency for Health (ARPA‑H). The role involves developing high-impact AI technologies, evaluating feasibility, and collaborating with teams. Applicants should have 3+... 
    Full time
    Part time

    Booz Allen Hamilton

    Washington DC
    4 days ago
  • $96.8k - $251.6k

     ...opportunities. We are facing several engineering challenges in critical...  ...that powers the next gen OCI cloud. We need you to challenge...  ...for investment and drive the software design and development for new...  ...to life-saving care. And with AI embedded across our products and... 
    Principal
    Temporary work
    Work experience placement
    Local area
    Remote work
    Flexible hours

    Oracle Defunct

    Washington DC
    a month ago
  • Spear AI, Inc. is seeking a Software Engineer to develop and maintain software applications supporting AI/ML operations. The role involves designing scalable systems, RESTful APIs, and user interfaces while collaborating with various teams. Ideal candidates should have... 
    Remote job

    Spear AI, Inc.

    Washington DC
    11 hours ago
  • $96.8k - $306.4k

     ...Oracle Cloud Infrastructure (OCI) delivers mission-critical applications...  .... We are hoping to enhance engineering efficiency by concentrating...  ...for investment and drive the software design and development for new...  ...to life-saving care. And with AI embedded across our products... 
    Principal
    Temporary work
    Work experience placement
    Worldwide
    Flexible hours

    Oracle

    Washington DC
    11 hours ago
  • $99.6k - $234.6k

     ...Principal Software Development Engineer We are looking for a Principal Software Development Engineer to join our OCI team. This role is part of a globally distributed team responsible for...  ...automation and orchestration principles. ~ AI tools and agentic experience... 
    Principal
    Temporary work
    Work experience placement
    Flexible hours

    Oracle

    Washington DC
    1 day ago
  • $99.6k - $223.4k

     ..., cloud infrastructure, and applied AI-designing intelligent, scalable, and...  ..., and production delivery. Drive engineering excellence through code reviews and...  ...Kubernetes expertise. ~ Cloud experience (OCI, AWS, Azure, or GCP). ~ AI/ML or AIOps production experience. ~... 
    Principal
    Full time
    Temporary work
    Remote work
    Flexible hours

    Oracle

    Washington DC
    11 hours ago
  • $99.6k - $234.6k

     ...Overview Oracle Cloud Infrastructure (OCI) is building Oracle Video @ Edge (OVE), a...  ...highly technical, distributed systems-focused engineering team Responsibilities...  ...innovations to life-saving care. And with AI embedded across our products and services... 
    Principal
    Temporary work
    Flexible hours

    Oracle

    Washington DC
    3 days ago
  • A leading travel technology company in Washington, D.C. is seeking a senior software engineer. This role focuses on utilizing AI-assisted coding practices to deliver features at unprecedented speeds. Candidates should have strong experience with LLMs and distributed systems... 

    Hopper

    Washington DC
    1 day ago
  • $99.6k - $234.6k

     ...Oracle Cloud Infrastructure (OCI) delivers mission-critical applications...  ...Frameworks: Spearhead the engineering of new container runtimes and...  ...with Architects and drive the software design and development for new...  ...to life-saving care. And with AI embedded across our products... 
    Principal
    Temporary work
    Work experience placement
    Worldwide
    Flexible hours

    Oracle

    Washington DC
    2 days ago
  • $79.2k - $178.1k

     ...Job Description The DevOps Engineer (IC3 level) is responsible for the design, implementation...  ...of Oracle Cloud Infrastructure (OCI) environments, with a primary focus on the...  ...innovations to life-saving care. And with AI embedded across our products and services,... 
    Temporary work
    Flexible hours

    Oracle

    Washington DC
    1 day ago
  • $99.6k - $223.4k

     ...Description Oracle is seeking an experienced Software Engineer to join a highly skilled team focused on building modern cloud and AI-enabled enterprise solutions. This role is...  ...in full-stack software development. OCI Enterprise Engineering helps drive Oracle’s... 
    Principal
    Temporary work
    Flexible hours

    Oracle

    Washington DC
    4 days ago
  • $99.6k - $223.4k

     ...Healthcare Solutions with AI at their core, designed...  ...using LLMs and AI agents, helping clinicians focus...  ...for highly skilled AI engineers to design and build high...  ...~7+ years of relevant software engineering experience....  ...~ Cloud experience (OCI/AWS/Azure). ~ Demonstrated... 
    Principal
    Temporary work
    Flexible hours

    Oracle

    Washington DC
    4 days ago
  • Be a Data-Driven AI Hero for the 2030 US Census! (Cloud, Python, AWS) Do you dream of using cutting-edge AI to shape the future...  ...US Census? The US Census Bureau is looking for a talented AI/ML Software Engineer to join their team for the critical 2030 Decennial Census.... 
    Work at office

    Stackruit Ltd.

    Washington DC
    1 day ago
  • $99.6k - $223.4k

     ...environment. We focus on transforming how Software Developers and DevOps engineers build cloud applications for...  ..., Teamcity, Jenkins) ~ Knowledge of AI and familiarity with AI adoption tools...  ...automation framework in cloud Experience OCI cloud-native app development... 
    Principal
    Temporary work
    Remote work
    Flexible hours

    Oracle

    Washington DC
    3 days ago
  • $96.8k - $251.6k

     ...across layers, from database engine internals to developer-facing...  ...document APIs — Design and ship new AI-enabled document interfaces...  ...— Oracle’s AI roadmap and OCI’s strength as a cloud for AI workloads...  ...installed bases in enterprise software. Experienced peers — You... 
    Temporary work
    Flexible hours

    Oracle

    Washington DC
    3 days ago
  • $99.6k - $234.6k

     ...Intelligence (HDI) team as a Principal Software Engineer, where you will design and...  ...automation frameworks, and AI-powered operational tooling...  ...adoption of Generative AI and agent-based technologies to build...  ...Engineering Strong experience with OCI, AWS, Azure, or multi-cloud... 
    Principal
    Temporary work
    Flexible hours

    Oracle

    Washington DC
    11 hours ago
  •  ...Lead Machine Learning Operations Engineer LA is a top 10 national professional services firm where our purpose is to create opportunities...  ...in this role: Define and execute an enterprise AI/ML platform strategy, encompassing MLOps, LLMOps, and AIOps, and build... 

    CliftonLarsonAllen

    Arlington, VA
    2 days ago
  •  ...Senior AI/ML Engineer TENEX is an AI-native, automation-first, built-for-scale Managed Detection...  ...-augmented generation, tool-calling agents, and multi-modal models (text + logs +...  ...Required Skills & Qualifications Software Engineering & Architecture Expertise... 
    Work from home

    TenEx

    Washington DC
    2 days ago
  • $113k - $188k

     ...Vantor AI/ML Solutions Architect Vantor is forging the new frontier...  ...analysis, preprocessing, and feature engineering to extract valuable insights from...  ...use system-of-systems and multi-agent approaches to architect and design AI software systems Minimum... 

    Maxar by Vantor

    Washington DC
    11 hours ago
  •  ...Senior AI/ML Engineer Elevate your career with MANTECH International Corporation! Join a dynamic team dedicated to national security...  ...Digital Transformation, Cybersecurity, IT, Data Analytics and Software Development. Your journey to impactful work and rapid growth... 
    Temporary work
    Local area

    ManTech

    Arlington, VA
    2 days ago
  •  ...A growing technology company is seeking a Technical Pre-Sales Engineer to support the marketing and selling of AI Automation solutions for IT Operations. You will collaborate closely with the Sales Account Manager, showcase technology innovations, prepare Statements of... 

    JR Associates Group

    Washington DC
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Principal AI Agent / ML Software Engineer (OCI). Be the first to apply!