AI Platform Reliability Engineer
$79.2k - $209.5kOracle
Job Description
Oracle Health is seeking an AI Platform Reliability Engineer to ensure our AI agent platform and AI-enabled analytics workflows are reliable, observable, measurable, and safe in production.
This role will focus on the operational foundation for production AI systems, including monitoring, tracing, evaluation in production, rollback controls, alerting, versioning, runtime diagnostics, and quality safeguards. The engineer will also support data reliability use cases such as detection of stopped processing, data gaps, freshness issues, schema drift, and anomaly conditions that affect downstream analytics and reporting.
The ideal candidate brings strong engineering discipline in observability, release safety, and operational tooling, with the ability to apply those skills to modern AI and agent-based systems. This role is critical to maintaining trust in AI outputs and ensuring new capabilities can scale safely across Oracle Health.
Responsibilities
Build and maintain observability, logging, tracing, and monitoring for AI agents, agent tools, and AI-enabled analytics workflows.
Implement release, rollout, rollback, and versioning controls for prompts, models, tools, and configurations.
Design and support production evaluation practices to detect regressions, silent failures, quality drift, and performance issues.
Contribute to data monitoring and reliability workflows, including detection of stopped processing, data gaps, freshness issues, schema drift, and anomalies.
Support incident response, triage, root-cause analysis, and operational reporting for AI and data reliability issues.
Partner with architects and AI engineers to ensure systems are production-ready, measurable, and maintainable.
Implement latency, throughput, and cost monitoring controls for AI-enabled systems.
Help enforce operational safeguards, auditability, and controlled deployment practices for enterprise AI platforms.
Disclaimer:
Certain U.S. based or U.S. customer or client-facing roles may be required to comply with applicable requirements, such as immunization/occupational health mandates, and/or drug testing requirements.
Range and benefit information provided in this posting are specific to the stated locations only
US: Hiring Range in USD from: $79,200 to $209,500 per annum. May be eligible for bonus and equity.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
Medical, dental, and vision insurance, including expert medical opinion
Short term disability and long term disability
Life insurance and AD&D
Supplemental life insurance (Employee/Spouse/Child)
Health care and dependent care Flexible Spending Accounts
Pre-tax commuter and parking benefits
401(k) Savings and Investment Plan with company match
Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
11 paid holidays
Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
Paid parental leave
Adoption assistance
Employee Stock Purchase Plan
Financial planning and group legal
Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC3
About Us
Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential at a company leading the way in AI and cloud solutions that impact billions of lives.
True innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing a workforce that promotes opportunities for all with competitive benefits that support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing View email address on click.appcast.io or by calling View phone number on click.appcast.io in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
$79.2k - $209.5k
...Job Description As a Senior Site Reliability Engineer, you will play a pivotal role in building... ...advancing automation, observability, and AI-assisted reliability practices. You... ...Technologies • Proficiency in Data Warehousing platforms (e.g., Vertica, Snowflake) •...SuggestedTemporary workFlexible hours$86.4k - $199.5k
...and system tuning. Responsibilities Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a... ...everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers...SuggestedTemporary workFlexible hours- ...Teradata Autonomous Knowledge Platform activates enterprise... ...delivers real business value with AI. What You’ll Do # Working... ...software solutions to ensure system reliability and availability, mitigate... ...# You will help lead chaos engineering efforts in a production-alike...SuggestedPermanent employmentFlexible hours
$218.03k - $256.5k
...us, every day, as we build the emerging onchain platform — and with it, the future global financial... ...What you’ll be doing (ie. job duties): ** ~*AI-Driven Innovation: *Join a high-performing team of skilled engineers driving AI transformation at Coinbase. This role...SuggestedLocal area$109.2k - $223.4k
...complexity increase, OCI depends on hardware platforms that are both innovative and deployable... ...Leadership Collaborate across OHD, engineering, operations, supply chain, and software... ...to life-saving care. And with AI embedded across our products and services...SuggestedTemporary workFlexible hours$186.07k - $225k
...every day, as we build the emerging onchain platform — and with it, the future global... ...for a Senior Machine Learning Platform Engineer to join our Machine Learning Platform team... ...the ability to responsibly use generative AI tools and copilots (e.g., LibreChat, Gemini...Local area$186.07k - $218.9k
...we build the emerging onchain platform — and with it, the future... ...the design, development, and reliability of core platform services that... ...management) Championing engineering standards, code and design review... ...responsibly use generative AI tools and copilots (e.g., LibreChat...Local area- ...better information. Teradata Autonomous Knowledge Platform activates enterprise intelligence by unifying... ...approach. Teradata delivers real business value with AI. What you will do We are looking for a mid-level engineer who will be responsible for delivering robust,...Permanent employmentFlexible hours
$97.5k - $199.5k
...Job Description Role Overview As a Reliability Engineer, you will apply data-driven analysis and engineering problem-solving to improve availability... ...from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers...Temporary workFlexible hours$120.1k - $251.6k
...Job Description Lead reliability assessments and improvement initiatives across electrical... ...Mentor less experienced reliability engineers and help raise the technical bar across... ...innovations to life-saving care. And with AI embedded across our products and services...Temporary workFlexible hours$97.5k - $199.5k
...suitability for data center builds. Acts as the engineering representative on a wide range of... ...team. Responsibilities Support reliability activities for critical electrical, mechanical... ...to life-saving care. And with AI embedded across our products and services...Temporary workFlexible hours$200k - $250k
...Datavant is the data collaboration platform trusted for healthcare.... ...medical records to powering the AI revolution in healthcare,... ...hands-on, deeply experienced engineering leader who can operate across... ...maintaining high standards for reliability, security, and scalability in...- ...Teradata Autonomous Knowledge Platform activates enterprise... ...delivers real business value with AI. What You'll Do At Teradata... .... As a member of our AI engineering team, you'll play a critical... ...evaluation layers that ensure reliability, accountability, and performance...Permanent employmentFlexible hours
$186.07k - $218.9k
...we build the emerging onchain platform — and with it, the future global... ...looking for a Senior Software Engineer to join the Payment Rails team... ...is fast, secure, and reliable, directly enabling millions of... ...to responsibly use generative AI tools and copilots (e.g., LibreChat...Local area$186.07k - $218.9k
...we build the emerging onchain platform — and with it, the future... ...core platform: the detection engine, invariant framework, and tooling... ...them in production. Build AI guardrails that close the... ...You’ve built financial, high reliability or security systems. Crypto...Local area$186.07k - $218.9k
...every day, as we build the emerging onchain platform — and with it, the future global... ...and powering end-user experiences. As an engineer on the team you will contribute to the full... ...the ability to responsibly use generative AI tools and copilots (e.g., LibreChat, Gemini...Local area$24 per hour
...DevOps Engineer Intern Company: Norstella Location: Remote, United States Date Posted... ...critical global life sciences data and AI solutions provider dedicated to improving... ...concepts Interest in DevOps, SRE, or platform engineering as a career path Salary: $...Hourly payInternshipLocal areaRemote work$120.1k - $251.6k
...delivery of our datacenters, Oracle is recruiting a Senior Mechanical Engineer. The role is a senior multi-disciplinary datacenter design lead... ...from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn...Contract workTemporary workFlexible hours$186.07k - $218.9k
...A leading cryptocurrency platform is seeking an experienced software engineer in Pierre, South Dakota. The role involves designing and developing core platform services for account and identity management across various products. Candidates should have a deep passion...$116.4k - $204.1k
...As a Lead DevOps Engineer, you will be responsible for leading the... ...maintaining and improving the reliability of our existing systems. You... ...deployment. Experience building platforms for Observability and... ...interviews without the assistance of AI tools or external prompts....Work at office$94.1k - $150k
...The Platform Engineer (Ops Technology Lead) is responsible for designing, implementing, and maintaining IT infrastructure platforms within the CASTLE-NET program, ensuring reliability, scalability, and security. This role supports application deployment and management,...Contract workWork at office$132.23k - $176.31k
...Lumen is the trusted network for AI. We're transforming how businesses connect, secure, and scale in an AI-driven world. By connecting... ...the future. The Role SAIC seeks a Lumen Network Design Engineer V (WAN / Work Package Engineer) to support the Department of the...Contract workTemporary workFor contractorsRemote work- ...Lumen is seeking a Senior Director of Architecture, Engineering & Visibility to lead the strategy and delivery of threat intelligence and security platforms. This position focuses on cloud transformation and oversees platform engineering operations to enhance threat detection...
$79.2k - $178.1k
...initiatives such as building new innovative platforms, high performance primitives, frameworks... ...data-planes. We are hoping to enhance engineering efficiency by concentrating our... ...innovations to life-saving care. And with AI embedded across our products and services...Temporary workLocal areaRemote workWorldwideFlexible hours$139.4k - $291.8k
...requires strong technical depth, sound engineering judgment, and the ability to lead teams... ...business objectives for capacity growth, reliability, speed to market, and operational excellence... ...to life-saving care. And with AI embedded across our products and services...Temporary workFlexible hours$200k
...Maximus is currently seeking an exciting opportunity for a Senior Director, AI Systems Engineering to join the Maximus AI Accelerator supporting the enterprise at large. We are looking for an accomplished hands-on technical leader and team player to be a part of the AI...Immediate startRemote workFlexible hours$116.4k - $204.1k
...Wolters Kluwer in Pierre, South Dakota is seeking a Lead Software Engineer to develop AI-powered cloud solutions for audit purposes. The successful candidate will have strong experience in Python, Java or C++, and AI systems. Responsibilities include building AI agent...$160.2k - $290.7k
...Automotive Electrical Architecture System Engineer to lead the end-to-end software... ...solutions that span across different vehicle platforms and product lines. Facilitate reuse... ...functional requirements such as performance, reliability, and scalability. Support testing...Local areaRemote workWork from homeRelocationRelocation packageFlexible hours$116.4k - $204.1k
...for a Lead Product Software Engineer - Cloud Operations to join I... ...the-enterprise that co-designs AI solutions with customers... ...clear visibility into system reliability, performance, and usage. Define... ...to leverage patterns and platforms built by InnovateHub. Requirements...Work at office$157.1k - $258.5k
...portfolios that push the boundaries of safe, reliable autonomous and assisted driving experiences.... ...We are seeking an experienced Staff Systems Engineer to lead systems engineering efforts for our embedded AV/ADAS platform. You will be responsible for defining and managing...Local areaWork from homeRelocation packageFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Platform Reliability Engineer. Be the first to apply!

