Principal AI Agent / ML Software Engineer (OCI)
Oracle
Principal AI Agent / ML Software Engineer
The Principal AI Agent / ML Software Engineer is a Senior Staff-level, hands-on technical leadership role responsible for defining, building, and operating next-generation AI systems on Oracle Cloud Infrastructure (OCI). This person will set architecture and engineering direction for production-grade agentic AI platforms, autonomous workflows, scalable inference infrastructure, and enterprise AI applications used in large-scale, business-critical environments.
This role requires a proven engineer who can translate ambiguous product and platform goals into durable technical strategy, lead multi-team execution without direct authority, and remain deeply hands-on in design, code, reviews, operations, and incident follow-up. The ideal candidate combines deep distributed systems experience with practical AI-native engineering, including orchestration of LLMs, tools, APIs, memory, retrieval, evaluation, guardrails, and cloud services. The expectation is to ship, scale, and operate reliable, secure, observable, and cost-aware AI platform systems while raising the technical bar for engineers across the organization.
Responsibilities
- Serve as a senior technical owner for OCI AI platform capabilities, including agent execution, inference systems, model serving, AI workflow orchestration, evaluation, and observability.
- Design, architect, and deliver scalable agentic AI systems capable of reasoning, planning, tool use, workflow execution, multi-step task orchestration, and safe human-in-the-loop escalation.
- Build production-grade services for tool calling, agent memory, context management, Model Context Protocol (MCP) integration, vector retrieval, multi-agent coordination, policy enforcement, and evaluation.
- Lead architecture across distributed services optimized for low latency, high throughput, GPU efficiency, reliability, cost, operability, and secure multi-tenant operation.
- Define service boundaries, APIs, data models, state management, consistency tradeoffs, failure modes, SLIs/SLOs, rollout strategies, and operational readiness criteria for AI platform services.
- Drive technical strategy across infrastructure, platform, security, data, and application engineering teams, converting broad goals into executable multi-quarter plans and measurable milestones.
- Integrate AI agents securely and reliably with enterprise APIs, cloud services, databases, identity systems, secrets management, and external systems.
- Establish AgentOps and LLMOps practices for tracing, monitoring, eval suites, regression testing, experimentation, safety guardrails, prompt/tool versioning, and production reliability.
- Evaluate and operationalize emerging technologies in generative AI, agentic workflows, inference optimization, long-context systems, reasoning models, AI developer tooling, and agentic-first development.
- Drive engineering excellence through code reviews, design reviews, test strategy, deployment automation, incident analysis, documentation, and AI-assisted development practices using tools such as Codex, Claude Code, Cursor, Copilot, or similar systems.
- Mentor Staff and senior engineers, raise architectural standards, and influence engineering practices across OCI without requiring direct management authority.
- Own critical production outcomes, including reliability, performance, security posture, cost efficiency, and supportability for the systems delivered.
Required Qualifications
- Bachelor's, Master's, or Ph.D. in Computer Science, AI/ML, Engineering, or a related field, or equivalent practical experience.
- 6-10+ years of professional software engineering experience, including significant ownership of production systems; or equivalent experience demonstrating Senior Staff / Principal-level impact.
- Proven track record as a Staff, Senior Staff, Principal, or equivalent technical leader influencing architecture and execution across multiple teams.
- Deep experience designing, building, and operating high-scale distributed systems, cloud services, infrastructure platforms, or AI/ML platform services.
- Hands-on experience with production AI systems, agentic AI applications, autonomous workflows, tool-using agents, multi-step orchestration, or multi-agent systems.
- Practical experience with orchestration frameworks such as LangGraph, LangChain, CrewAI, AutoGen, LlamaIndex, or similar ecosystems.
- Deep understanding of LLM application patterns, including prompt design, structured outputs, function/tool calling, context management, RAG, memory, tool safety, and evaluation.
- Strong programming skills in Python and ability to contribute high-quality production code, reviews, tests, and debugging in complex distributed environments.
- Strong expertise with Kubernetes, Docker, cloud-native infrastructure, service-to-service communication, scalability, fault tolerance, observability, and performance analysis.
- Experience defining SLIs/SLOs, production readiness criteria, incident response practices, monitoring, tracing, experiments, and reliability programs for AI or distributed systems.
- Strong understanding of AI safety, governance, security, and operational risks for autonomous or semi-autonomous systems, including data handling, access control, auditability, and human accountability.
- Excellent written and verbal communication, with demonstrated ability to lead technical direction, resolve ambiguity, and influence senior stakeholders.
Preferred Qualifications
- Experience optimizing large-scale GPU inference or training workloads for latency, throughput, utilization, availability, and cost.
- Experience building or operating model serving, inference gateways, agent runtimes, workflow engines, developer platforms, or internal AI productivity platforms.
- Experience integrating AI systems with enterprise APIs, databases, cloud services, vector databases, embeddings, retrieval systems, identity systems, and policy enforcement layers.
- Experience with LLM fine-tuning, long-context systems, reasoning models, model routing, caching, batching, quantization, or emerging generative AI research.
- Experience building evaluation frameworks for agentic systems, including offline evals, online experiments, golden tasks, adversarial testing, regression gates, and observability dashboards.
- Experience using AI-assisted software development tools such as Codex, Claude Code, Cursor, Copilot, or similar systems in large-scale engineering environments.
- Track record of defining architectural standards, platform capabilities, or engineering practices adopted across multiple teams or organizations.
- Experience in enterprise, cloud infrastructure, regulated, security-sensitive, or mission-critical environments.
$99.6k - $234.6k
...Oracle Cloud Infrastructure (OCI) is redefining the cloud for... ...enterprise-grade reliability. Our engineering culture is grounded in OCI... ...For We are seeking a Principal Software Development Engineer with deep... ...to life-saving care. And with AI embedded across our products...PrincipalTemporary workRelocation packageFlexible hours- A leading AI company in Washington, D.C. seeks a skilled ML Engineer with a PhD in Computer Science/Engineering. The role involves designing, researching, and building AI systems while training and deploying ML models focused on Natural Language Processing and Large Language...Suggested
- ...An agile solution-oriented firm is seeking a Sr. Software Engineer II to lead software development projects and enhance system functionalities. This role involves collaborating with clients, managing project timelines, and ensuring compliance with industry standards. The...Suggested
$140k - $182k
.... Xometry is seeking a Principal Machine Learning Engineer to join our core machine learning... ...partner closely with the AI/MLE leadership team to... ...leveraged by Xometry's AI/ML solutions, including the Instant... ...infrastructure and software that not only meets but exceeds...Principal$109.2k - $223.4k
...Cloud Infrastructure (OCI) is scaling at an unprecedented... ...the next generation of AI-driven workloads. We are seeking a Senior Principal Technical Program... ...and architecture, engineering, and infrastructure.... ...degree in Computer Science, Software Engineering, Technology...PrincipalTemporary workFlexible hours$110k - $135k
...A leading global technology firm in Washington, D.C. is looking for an experienced software engineer to join its AI Services team. This role focuses on developing scalable software solutions to enhance communication systems, utilizing skills in Python, SQL, and machine...$225k - $250k
...the demand for new Cloud and AI infrastructure. Fleet is led... ...positioned to bring in-house design, engineering and operational capabilities... ...The Data Center Security Software Principal Engineer leads applied AI,... ...in security engineering, AI/ML applications, or systems integration...Principal$96.8k - $251.6k
...enabling studio-grade creative workloads on OCI. This individual contributor will own... ...infrastructure for demanding media, creative, AI, and high-performance workloads where... ...workflows in the cloud while improving the engineering systems, operational practices, and AI-...PrincipalTemporary workFlexible hours$99.6k - $234.6k
...Principal Software Developer (IC4) Oracle Health builds and operates shared platform services... ...product delivery at scale. We are an AI-first engineering organization, using AI-assisted... ...spaces. ~ Cloud experience preferred (OCI strongly desired; AWS/Azure/GCP acceptable...PrincipalTemporary workFlexible hours$69.4k - $158k
Booz Allen Hamilton is seeking an AI/ML Software Development Engineer for the Advanced Research Projects Agency for Health (ARPA‑H). The role involves developing high-impact AI technologies, evaluating feasibility, and collaborating with teams. Applicants should have 3+...Full timePart time$96.8k - $251.6k
...opportunities. We are facing several engineering challenges in critical... ...that powers the next gen OCI cloud. We need you to challenge... ...for investment and drive the software design and development for new... ...to life-saving care. And with AI embedded across our products and...PrincipalTemporary workWork experience placementLocal areaRemote workFlexible hours- Spear AI, Inc. is seeking a Software Engineer to develop and maintain software applications supporting AI/ML operations. The role involves designing scalable systems, RESTful APIs, and user interfaces while collaborating with various teams. Ideal candidates should have...Remote job
$96.8k - $306.4k
...Oracle Cloud Infrastructure (OCI) delivers mission-critical applications... .... We are hoping to enhance engineering efficiency by concentrating... ...for investment and drive the software design and development for new... ...to life-saving care. And with AI embedded across our products...PrincipalTemporary workWork experience placementWorldwideFlexible hours$99.6k - $234.6k
...Principal Software Development Engineer We are looking for a Principal Software Development Engineer to join our OCI team. This role is part of a globally distributed team responsible for... ...automation and orchestration principles. ~ AI tools and agentic experience...PrincipalTemporary workWork experience placementFlexible hours$99.6k - $223.4k
..., cloud infrastructure, and applied AI-designing intelligent, scalable, and... ..., and production delivery. Drive engineering excellence through code reviews and... ...Kubernetes expertise. ~ Cloud experience (OCI, AWS, Azure, or GCP). ~ AI/ML or AIOps production experience. ~...PrincipalFull timeTemporary workRemote workFlexible hours$99.6k - $234.6k
...Overview Oracle Cloud Infrastructure (OCI) is building Oracle Video @ Edge (OVE), a... ...highly technical, distributed systems-focused engineering team Responsibilities... ...innovations to life-saving care. And with AI embedded across our products and services...PrincipalTemporary workFlexible hours- A leading travel technology company in Washington, D.C. is seeking a senior software engineer. This role focuses on utilizing AI-assisted coding practices to deliver features at unprecedented speeds. Candidates should have strong experience with LLMs and distributed systems...
$99.6k - $234.6k
...Oracle Cloud Infrastructure (OCI) delivers mission-critical applications... ...Frameworks: Spearhead the engineering of new container runtimes and... ...with Architects and drive the software design and development for new... ...to life-saving care. And with AI embedded across our products...PrincipalTemporary workWork experience placementWorldwideFlexible hours$79.2k - $178.1k
...Job Description The DevOps Engineer (IC3 level) is responsible for the design, implementation... ...of Oracle Cloud Infrastructure (OCI) environments, with a primary focus on the... ...innovations to life-saving care. And with AI embedded across our products and services,...Temporary workFlexible hours$99.6k - $223.4k
...Description Oracle is seeking an experienced Software Engineer to join a highly skilled team focused on building modern cloud and AI-enabled enterprise solutions. This role is... ...in full-stack software development. OCI Enterprise Engineering helps drive Oracle’s...PrincipalTemporary workFlexible hours$99.6k - $223.4k
...Healthcare Solutions with AI at their core, designed... ...using LLMs and AI agents, helping clinicians focus... ...for highly skilled AI engineers to design and build high... ...~7+ years of relevant software engineering experience.... ...~ Cloud experience (OCI/AWS/Azure). ~ Demonstrated...PrincipalTemporary workFlexible hours- Be a Data-Driven AI Hero for the 2030 US Census! (Cloud, Python, AWS) Do you dream of using cutting-edge AI to shape the future... ...US Census? The US Census Bureau is looking for a talented AI/ML Software Engineer to join their team for the critical 2030 Decennial Census....Work at office
$99.6k - $223.4k
...environment. We focus on transforming how Software Developers and DevOps engineers build cloud applications for... ..., Teamcity, Jenkins) ~ Knowledge of AI and familiarity with AI adoption tools... ...automation framework in cloud Experience OCI cloud-native app development...PrincipalTemporary workRemote workFlexible hours$96.8k - $251.6k
...across layers, from database engine internals to developer-facing... ...document APIs — Design and ship new AI-enabled document interfaces... ...— Oracle’s AI roadmap and OCI’s strength as a cloud for AI workloads... ...installed bases in enterprise software. Experienced peers — You...Temporary workFlexible hours$99.6k - $234.6k
...Intelligence (HDI) team as a Principal Software Engineer, where you will design and... ...automation frameworks, and AI-powered operational tooling... ...adoption of Generative AI and agent-based technologies to build... ...Engineering Strong experience with OCI, AWS, Azure, or multi-cloud...PrincipalTemporary workFlexible hours- ...Lead Machine Learning Operations Engineer LA is a top 10 national professional services firm where our purpose is to create opportunities... ...in this role: Define and execute an enterprise AI/ML platform strategy, encompassing MLOps, LLMOps, and AIOps, and build...
- ...Senior AI/ML Engineer TENEX is an AI-native, automation-first, built-for-scale Managed Detection... ...-augmented generation, tool-calling agents, and multi-modal models (text + logs +... ...Required Skills & Qualifications Software Engineering & Architecture Expertise...Work from home
$113k - $188k
...Vantor AI/ML Solutions Architect Vantor is forging the new frontier... ...analysis, preprocessing, and feature engineering to extract valuable insights from... ...use system-of-systems and multi-agent approaches to architect and design AI software systems Minimum...- ...Senior AI/ML Engineer Elevate your career with MANTECH International Corporation! Join a dynamic team dedicated to national security... ...Digital Transformation, Cybersecurity, IT, Data Analytics and Software Development. Your journey to impactful work and rapid growth...Temporary workLocal area
- ...A growing technology company is seeking a Technical Pre-Sales Engineer to support the marketing and selling of AI Automation solutions for IT Operations. You will collaborate closely with the Sales Account Manager, showcase technology innovations, prepare Statements of...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Principal AI Agent / ML Software Engineer (OCI). Be the first to apply!
- sourcing agent Washington DC
- commissioning agent Washington DC
- cruise agent Washington DC
- state farm agent Washington DC
- airport agent Washington DC
- executive protection agent Washington DC
- title agent Washington DC
- showing agent Washington DC
- signing agent Washington DC
- work from home chat agent Washington DC

