Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Principal Member of Technical Staff, AI Infrastructure

$96.8k - $223.4k

Oracle

Job Description

Here at OCI we're building the world's largest AI clusters and we're the fastest at bringing them to the market. The AI Infrastructure organization at OCI is leading this effort by creating a GPU focused cloud with the latest hardware providing the best performance, efficiency, reliability, and scalability. This is your chance to be part of the AI revolution by creating systems that allow customers to scale from tens to thousands of GPUs without compromising performance. You will have the opportunity to work with cutting-edge technologies and make a significant impact on our organization's success.

We are looking for a highly skilled distributed systems engineer to scale and optimize AI infrastructure components like GPU control plane and GPU data plane that provide computing resources to customer AI workloads. You will provide technical leadership to the team and bring clarity to ambiguous problems and come up with innovative solutions. You will collaborate with cross-functional teams to enhance our AI infrastructure to deliver exceptional customer experience and peak performance.

Responsibilities

Responsibilities
  • Design and develop solutions to scale and optimize AI compute infrastructure components like GPU control plane and GPU data plane with the goal to optimize customer experience and customer workload performance on our AI infrastructure.
  • Develop "best-in-class" AI compute infrastructure for our customers by ensuring that the services and the components are well-defined and modularized, secure, reliable, diagnosable, actively monitored, compliant and reusable.
  • Collaborate with cross-functional teams, including development, operations, and product management, to understand their requirements and design innovative orchestration solutions.
  • Mentor junior developers and drive modern software engineering practices like leveraging data/telemetry to make decisions, well-defined interfaces across components, design reviews, coding standards, code reviews, and comprehensive coverage from unit test, integration test and active production monitoring.
  • Develop benchmark metrics and automation to drive and track performance and reliability across customer workload and lower infrastructure stack.
Qualifications & Skills
  • BS (or equivalent experience) in Computer Science, Engineering, or related field.
  • 6 years of experience in software development with programming languages including, but not limited to, C, C++, C#, Java, Go, Rust.
  • 3 years of experience designing and developing large-scale infrastructure, distributed systems, and services.
  • 1 year of experience providing technical leadership and clarity to cross-functional teams and projects while collaborating across stake holders.
  • Systematic problem-solving approach, strong communication skills, a sense of ownership, and drive.
  • Ability to adapt to a fast-paced, dynamic environment and manage multiple tasks and priorities effectively.
Preferred Qualifications
  • Experience in managing cloud infrastructure with hundreds of thousands of servers.
  • Experience in containerization technologies such as Docker and Kubernetes.
  • Experience in scheduling high-performance workloads on Kubernetes or Slurm.

Qualifications

Disclaimer:

Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.

Range and benefit information provided in this posting are specific to the stated locations only

US: Hiring Range in USD from: $96,800 to $223,400 per annum. May be eligible for bonus and equity.

Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.

Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance

The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC4

About Us

Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential at a company leading the way in AI and cloud solutions that impact billions of lives.

True innovation starts when everyone is empowered to contribute. That's why we're committed to growing a workforce that promotes opportunities for all with competitive benefits that support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.

We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing View email address on click.appcast.io or by calling View phone number on click.appcast.io in the United States.

Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Principal Member of Technical Staff, AI Infrastructure in Austin, TX vacancy
  • $96.8k - $223.4k

     ...systems, virtualized infrastructure, and highly available...  ...can have a significant technical and business impact. As a member of the software...  ...Responsibilities As a Principal Member of Technical Staff, you will own the software...  ...care. And with AI embedded across our products... 
    Principal
    Temporary work
    Flexible hours

    Oracle

    Austin, TX
    4 days ago
  •  ...puts the welfare of team members at the forefront." Maryna...  ...Join Our Team Agentic AI Engineering Intern Engineering...  ...required) Manager, Water Infrastructure Program - Data Centers...  ...Operations Burlington, TX Principal Member of Technical Staff, Agent Workflow Systems and... 
    Principal
    Internship
    Remote work
    Night shift

    SB Energy

    Austin, TX
    2 days ago
  •  ...Job Description: Job Title: Principal Member of Technical Staff, AI Operating Company : Environmental Solutions Group Location: Austin,TX Reports to: Senior Director, Business Process & Digital Systems Department: Information Technology POSITION... 
    Principal
    Permanent employment
    Local area
    Worldwide

    Terex

    Austin, TX
    1 day ago
  • $79.2k - $178.1k

     ...Engineer As a Senior Software Engineer within Oracle Cloud Infrastructure (OCI), you'll have the opportunity to solve large-scale, mission-critical engineering challenges with broad technical impact. Our AI Infrastructure Engineering team is building the services... 
    Suggested
    Temporary work
    Flexible hours

    Oracle

    Austin, TX
    2 days ago
  • At Oracle Cloud Infrastructure (OCI), we build the future of the cloud for Enterprises as a diverse team of fellow creators and inventors. We...  ...and build an imaging service for Large Scale Compute/HPC/AI/ML Customer Workloads and performance while providing strong guarantees... 
    Principal
    Full time
    Flexible hours

    Oracle

    Austin, TX
    4 days ago
  • $177k - $387k

     ...roadmap, and DRAM solutions for OEM customers. A Senior Member of the Technical Staff is an integral member of the CDBU. In this role you...  ...level performance across diverse workloads, including AI/ML infrastructure Ability to articulate strategy, engage customers,... 
    Full time
    Local area
    Immediate start

    Micron Technology

    Austin, TX
    22 hours ago
  • $79.2k - $178.1k

     ...Senior Engineer, Oracle Cloud Infrastructure's Compute Team Join Oracle Cloud Infrastructure...  ...of OCI's rapidly expanding AI infrastructure. The Compute Bare Metal...  .... Responsibilities As a Senior Member of Technical Staff, you will own the software design and... 
    Temporary work
    Worldwide
    Flexible hours

    Oracle

    Austin, TX
    2 days ago
  •  ...building the world’s largest AI clusters and we’re the fastest...  ...a quick learning ability. Technical Excellence: Rock‑solid developers...  ...and solutions to manage AI infrastructure of OCI. Write high quality...  ...Collaborate with other team members working on the same project to... 
    Temporary work
    Flexible hours

    Oracle

    Austin, TX
    16 hours ago
  • $79.2k - $178.1k

    Senior Member Technical Staff (JoinOCI-SDE) Job Description Oracle Cloud Infrastructure: Senior Software Engineer (IC3)Oracle’s Cloud Infrastructure (OCI) team is building the...  ...innovations to life‑saving care. And with AI embedded across our products and services, we... 
    Temporary work
    Flexible hours

    Ll Oefentherapie

    Austin, TX
    4 days ago
  • $96.8k - $251.6k

    Job Description OCI (Oracle Cloud Infrastructure) AI Infrastructure is at the forefront of building a cutting‑edge, ultra‑high‑performance...  ...technologies like RoCE and Infiniband. As a Consulting Member of Technical Staff, you will own the software design and development for... 
    Temporary work
    Flexible hours

    Oracle

    Austin, TX
    2 days ago
  • $100k - $115k

     ...00 - $115,000 USD + equity Associate Members of Technical Staff are generalist engineers who work across...  ...flow, the next it might be building infrastructure to scale partner deployments, the...  ...performs. How We Build We don't just build AI, we build with AI. Every engineer... 
    Full time
    Relocation

    Tecla

    Austin, TX
    16 hours ago
  •  ...Member of Technical Staff – Infrastructure & Engineering Location: Austin, TX Wind River is a global leader in delivering software for mission-critical intelligent systems. For more than four decades, the company has been an innovator and pioneer, powering billions... 
    Permanent employment
    Temporary work
    Local area
    Visa sponsorship
    Flexible hours

    Aptiv

    Austin, TX
    22 hours ago
  • $115.4k - $251.6k

     ...Health Applications & Infrastructure Oracle Health Applications...  ...Role As an Senior Principal Product Manager, you...  ...analytical thinking, technical fluency, and customer...  ..., data, AI, or analytics products...  ...Responsibilities As a member of the product development... 
    Principal
    Temporary work
    Flexible hours

    Oracle

    Austin, TX
    3 days ago
  • $145.6k - $211.87k

     ...CDW. Job Summary The Principal Solution Architect – Hybrid Infrastructure is a field-based technical pre-sales role that focuses on...  ...mentorship to junior team members. ~ A PSA must have a customer...  ...CDW is committed to being an AI-fluent organization We’re... 
    Principal
    Local area

    CDW

    Austin, TX
    3 days ago
  •  ...a highly experienced and technically proficient Principal Electrical Engineer to engage...  ...from generator systems to AI rack integration zones,...  ...distribution, phased power infrastructure, and controls architecture...  ...across the industry. Team members function as deeply technical... 
    Principal
    Contract work
    Remote work

    Plasticos Castella SA

    Austin, TX
    3 days ago
  •  ...DeepSpeed with deep expertise in distributed systems and large‑scale model training infrastructure Varun is an INFORMS Wagner Prize Finalist for his research in large‑scale driver navigation AI models and one of the top chess players in the US. Our team has built mission‑... 
    Immediate start

    Pear VC

    Austin, TX
    2 days ago
  •  ...Location: Remote About Us micro1 is the end-to-end human data infrastructure behind AGI. Our AI recruitment model is used by frontier AI labs and...  ...with a strong research or applied project portfolio. Solid technical foundation in Python and RL/simulation tools (e.g.,... 
    Full time
    Remote work

    micro1

    Austin, TX
    3 days ago
  •  ...DeepSpeed with deep expertise in distributed systems and large‑scale model training infrastructure Varun is an INFORMS Wagner Prize Finalist for his research in large‑scale driver navigation AI models and one of the top chess players in the US. Our team has built mission‑... 
    Worldwide

    Pear VC

    Austin, TX
    16 hours ago
  •  ...DeepSpeed with deep expertise in distributed systems and large‑scale model training infrastructure Varun is an INFORMS Wagner Prize Finalist for his research in large‑scale driver navigation AI models and one of the top chess players in the US. Our team has built mission‑... 

    Pear VC

    Austin, TX
    1 day ago
  • $99.6k - $223.4k

     ...AI Infrastructure Engineer OCI (Oracle Cloud Infrastructure) AI Infrastructure is at the forefront...  ...with a quick learning ability. Technical Excellence: Rock-solid developers and...  ...Memcache etc) Responsibilities As a member of the software engineering division,... 
    Principal
    Temporary work
    Flexible hours

    Oracle

    Austin, TX
    1 day ago
  • $180k

    Member of Technical Staff - Model Training ABOUT xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization... 
    Temporary work

    xAI

    Austin, TX
    2 days ago
  • $147k - $237.5k

     ...Execution, Integrity, and Inclusion. We weave AI into the fabric of everything we do and...  ...Job Summary We're looking for an Infrastructure Engineer to build developer tooling that...  ...feature velocity with long-term technical debt. Act as the "glue" across teams, consulting... 
    Principal
    Local area
    Remote work

    Palo Alto Networks

    Austin, TX
    3 days ago
  • $120.1k - $251.6k

     ...Job Description As Principal Program Manager, Network Infrastructure Deployment & Delivery, you will own the end-to-end execution of OCI's data center network...  ...industry innovations to life-saving care. And with AI embedded across our products and services, we help customers... 
    Principal
    Temporary work
    Flexible hours
    Night shift

    Oracle

    Austin, TX
    3 days ago
  •  ...Senior Principal Test Framework Software Engineer Austin...  ..., software and systems infrastructure that will unlock the next generation of AI breakthroughs and power...  ...Group, Graphcore is a member of an elite family of...  ...methodologies. Communicate technical status, risks, and... 
    Principal

    Graphcore

    Austin, TX
    5 days ago
  •  ...Senior Principal Network Engineer Austin, Texas, United...  ..., software and systems infrastructure that will unlock the next generation of AI breakthroughs and power...  ...Group, Graphcore is a member of an elite family of...  ...infrastructure. Provide technical leadership and mentorship... 
    Principal

    Graphcore

    Austin, TX
    4 days ago
  •  ...Architecture Title: IT Principal Architect, Enterprise...  ...architecture for an end-to-end member/patient servicing...  ...with platform, data, AI, and operations teams....  ...software engineering, infrastructure, or architecture ~...  ...to executive and technical audiences Enable adoption... 
    Principal
    Local area
    Work from home

    CarepathRx

    Austin, TX
    4 days ago
  • Summary Principal Structural Design Engineer is a hands-on individual...  ...Organization : Data Center Infrastructure Team Location : Austin, TX -...  ...Approval Support Act as the technical authority for regulatory,...  ...providers across the industry. Team members function as deeply technical... 
    Principal
    Remote work

    Plasticos Castella SA

    Austin, TX
    2 days ago
  •  ...Senior Principal Solutions Architect Shape the future...  ..., solution and/or technical knowledge, Dun & Bradstreet...  ...data, analytics, AI, and cloud technologies...  ...architecture, including cloud infrastructure and applications...  ...yourself or family members. Education assistance... 
    Principal
    Worldwide

    Dun & Bradstreet

    Austin, TX
    1 day ago
  • $196k - $364k

     ...DFT) IP business unit. As a member of the DFT R&D team, you will...  ...junior engineers and provide technical leadership across teams...  ...with automotive, industrial, AI, or hyperscaler-class IP development...  ...metrics, and scalable infrastructure The annual salary range... 
    Principal

    Cadence Inc

    Austin, TX
    4 days ago
  •  ...hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the widespread...  ...SoftBank Group, Graphcore is a member of an elite family of companies...  .... This role combines technical depth in networking and optical... 
    Principal
    Flexible hours

    Graphcore

    Austin, TX
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Principal Member of Technical Staff, AI Infrastructure. Be the first to apply!