Principal Engineer - AI Networking
$99.6k - $234.6kOracle
Job Description
The ideal candidate is an experienced RDMA software engineer with a strong background in high-performance networking, distributed communication systems, and systems programming. You will work closely with senior technical leaders to design, implement, optimize, and operate critical networking infrastructure used by large-scale AI training and inference workloads.
This is a hands-on engineering role requiring deep technical expertise, strong software development skills, and a passion for solving complex performance and scalability challenges.
What You'll Bring
Strong software engineering fundamentals and systems programming expertise.
Deep interest in RDMA, high-performance networking, and distributed communication systems.
Ability to diagnose and solve complex performance and scalability problems.
Strong collaboration and communication skills in cross-functional engineering environments.
Ownership mindset with the ability to independently drive technical initiatives from design through production deployment.
Passion for building infrastructure that enables next-generation AI systems.
Responsibilities
Key Responsibilities
Design, develop, and optimize RDMA-based software components and services for large-scale AI infrastructure.
Build and enhance collective communication frameworks, transport layers, and communication libraries used by distributed AI workloads.
Develop congestion management, load balancing, resiliency, and failover capabilities for RDMA-based networks.
Analyze and improve communication performance across networking, GPU, and software stacks.
Design and implement scalable distributed systems supporting AI training and inference environments.
Collaborate with networking, AI infrastructure, hardware, and cloud platform teams to deliver high-performance solutions.
Investigate and resolve complex networking, performance, and reliability issues in production environments.
Develop observability, telemetry, debugging, and performance analysis tools for distributed communication systems.
Contribute to architectural design discussions and technical direction for networking platforms.
Participate in code reviews and help maintain engineering excellence across the team.
Minimum Qualifications
Bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or related field; advanced degree preferred.
7+ years of software engineering experience in systems software, networking, distributed systems, or infrastructure platforms.
Strong hands-on expertise with RDMA technologies, including RoCEv2 and/or InfiniBand.
Experience developing RDMA-enabled software, communication libraries, networking services, or distributed infrastructure.
Strong understanding of RDMA programming concepts, including queue pairs, completion queues, memory registration, verbs, and transport semantics.
Proficiency in C/C++ and Linux systems programming.
Experience debugging and optimizing performance-critical software systems.
Solid understanding of networking fundamentals, operating systems, and distributed systems concepts.
Preferred Qualifications
Experience with collective communication frameworks and libraries such as NCCL, RCCL, MPI, UCX, UCC, XCCL, or similar technologies.
Experience supporting AI/ML infrastructure and distributed training environments.
Knowledge of GPUDirect RDMA and GPU-aware communication technologies.
Experience developing congestion management, traffic engineering, or network resiliency solutions.
Familiarity with large-scale GPU clusters and high-performance computing environments.
Experience building services and infrastructure operating directly over RDMA transports.
Familiarity with distributed training frameworks such as PyTorch, DeepSpeed, Megatron-LM, TensorFlow, or JAX.
Experience with Kubernetes, containers, and cloud infrastructure platforms.
Understanding of performance profiling and benchmarking tools for networking and distributed systems.
Disclaimer:
Certain U.S. based or U.S. customer or client-facing roles may be required to comply with applicable requirements, such as immunization/occupational health mandates, and/or drug testing requirements.
Range and benefit information provided in this posting are specific to the stated locations only
US: Hiring Range in USD from: $99,600 to $234,600 per annum. May be eligible for bonus, equity, and compensation deferral.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
Medical, dental, and vision insurance, including expert medical opinion
Short term disability and long term disability
Life insurance and AD&D
Supplemental life insurance (Employee/Spouse/Child)
Health care and dependent care Flexible Spending Accounts
Pre-tax commuter and parking benefits
401(k) Savings and Investment Plan with company match
Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
11 paid holidays
Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
Paid parental leave
Adoption assistance
Employee Stock Purchase Plan
Financial planning and group legal
Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC4
About Us
Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential at a company leading the way in AI and cloud solutions that impact billions of lives.
True innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing a workforce that promotes opportunities for all with competitive benefits that support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing View email address on click.appcast.io or by calling View phone number on click.appcast.io in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
$96.8k - $306.4k
...You will work at the intersection of distributed systems, networking, and AI infrastructure, driving architecture, design, implementation... ...influence architecture across multiple teams, mentor senior engineers, and help shape the roadmap for Oracle's AI networking platform...SuggestedTemporary workFlexible hours$102.3k - $209.5k
...to be global leaders in the RDMA cluster networking domain and enable seamless, accelerated... ...RDMA clusters tailored specifically for AI, ML, HPC workloads. We strive to be the... ...: Bachelor's degree in CS or related engineering field with 6+ years of Network...SuggestedTemporary workImmediate startFlexible hours$102.3k - $209.5k
...leading enterprise software company in the world. This Principal Network Development Engineer (NDE) is focused on design and support of network fabric... ...from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers...SuggestedTemporary workFlexible hours$99.6k - $223.4k
...technical, distributed systems-focused engineering team Responsibilities Responsibilities... ...Collaborate cross-functionally with networking, playback, and product teams Drive architectural... ...to life-saving care. And with AI embedded across our products and services...SuggestedTemporary workFlexible hours$285k - $342k
...Type: FullTime Location Type: Remote Department Engineering Compensation: $285K – $342K • Offers Equity At Confluent... ...leader to define and drive Confluent’s internal agentic AI capabilities, and adoption of smart, automated decisioning systems...SuggestedFull timeRemote workShift work$126.2k - $264.1k
...interested in building large-scale distributed networking solutions for the cloud? Do you love the... ...Computer Science, Electrical/Hardware Engineering or related field. ~ Ability to work... ...to life-saving care. And with AI embedded across our products and services...Temporary workFlexible hours$186.07k - $218.9k
...is hiring for a Senior Offensive Security Engineer, Offensive Security. We are seeking a... ...expertise in IOT/IOT automation and prosumer networking gear. Conduct comprehensive... ...tooling to support penetration testing and AI penetration testing activities. Experience...Local area$200k - $250k
...for their medical records to powering the AI revolution in healthcare, Datavanters... ...role for a hands-on, deeply experienced engineering leader who can operate across software engineering... ...data systems (RDS, DynamoDB, Redshift), networking (VPC design, private connectivity), and...$297.5k - $357k
...Type: FullTime Location Type: Remote Department Engineering Compensation: $297.5K – $357K • Offers Equity At... ...Data Streaming Platform. About the Role: The Traffic & Networking organization is responsible for engineering the foundational...Full timeRemote work$114.6k - $234.6k
...workplace where you'll belong and be encouraged. In? OCI ? AI Infrastructure ?org we are addressing exciting challenges at... ...evaluation/benchmarking and human preference learning. Apply engineering principles for defining robust and maintainable architectures...Temporary workFlexible hours$33.37 - $71.3 per hour
...together the data, infrastructure, applications, and expertise to power everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential...Hourly payTemporary workVisa sponsorshipFlexible hours- ...opportunities. Learn more at . Overview of Job Function: As a Principal Engineer, you will be the senior-most technical voice on the US... ...-zone differences across three global engineering centers. AI/ML and Platform Innovation Lead integration of AI/ML capabilities...Local areaShift work
$120.1k - $251.6k
...delivery of our datacenters, Oracle is recruiting a Senior Mechanical Engineer. The role is a senior multi-disciplinary datacenter design lead... ...from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn...Contract workTemporary workFlexible hours$114.5k - $154.58k
...re doing! Job Description Summary: We're looking for a Principal Sales Engineer who is not only passionate about technology but thrives on engaging... ...value, backed by your passion for the latest database and AI technologies and trends. Deliver compelling proof of...Remote workWorldwide$139.4k - $291.8k
...accountability for operational outcomes. You will partner closely with Engineering, Construction, Training, Vendor Management, Reliability, and... ...from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn...Temporary workFor contractorsWork at officeFlexible hours$118.3k - $251.6k
...Automation team, enabling the delivery of some of the world's largest AI centric data centers around the globe. This role requires a strong cross-organizational leader who can operate across engineering, product, operations, infrastructure, and executive stakeholder...Temporary workFlexible hours$146.3k - $306.4k
...partner closely with mechanical, electrical, network, software, operations, commissioning, manufacturing, and supply chain engineering teams to develop and scale a unified... ...automation roadmaps supporting hyperscale cloud and AI infrastructure growth. Establish...Temporary workRemote workFlexible hoursShift work$67.7k - $90.27k
...Lumen is the trusted network for the AI‑powered world, connecting people, data, and applications through our expansive fiber network and connected... ..., join us today. The Role The Network Inventory GIS Engineer supports the organization’s GIS network inventory. This role...Full timeTemporary workRemote workWork from home$99.6k - $234.6k
...future of healthcare - cloud-native Healthcare Solutions with AI at their core, designed to operate at nation-scale. Our mission... ...administrative burden. We’re looking for highly skilled AI engineers to design and build high-scale, cloud-based data processing...Temporary workFlexible hours$118.3k - $251.6k
...challenges, and thrives at the intersection of product innovation, engineering excellence, and operational ownership. As a key member of... ...organization, you'll partner closely with Product Management, AI Engineering, Data Science, and Operations teams to evolve the reimbursement...Temporary workWorldwideFlexible hours$96.8k - $306.4k
...Job Description The Senior Principal AI Agent / ML Software Engineer is a Senior Staff-level, hands-on technical leadership role responsible for defining, building, and operating next-generation AI systems on Oracle Cloud Infrastructure (OCI). This person will set architecture...Temporary workFlexible hours$30 per hour
...autonomous database and industry's broadest and deepest suite of AI-powered cloud applications. The following facts and... ...in more than 40 regions worldwide. The mission of our Network Reliability Engineering team is to provide exceptional network reliability and...Hourly payTemporary workInternshipWorldwideFlexible hours- ...through a hybrid approach. Teradata delivers real business value with AI. What You’ll Do We are seeking a Staff Software Engineer to lead the design, development, and evolution of networking software for our massively parallel processing (MPP) platform, the foundation...Permanent employmentFlexible hours
$109.2k - $223.4k
...Organizational Effectiveness Lead business operations programs for Engineering & Infrastructure organizations in partnership with CIO-office... ...from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn...Temporary workWork at officeFlexible hours$99.6k - $234.6k
...Job Description Principal Backend Developer - Clinical Applications Engineering Transform Healthcare Through Modern EHR Technology Oracle Health is building... ...successfully delivered. Engineering Productivity & AI Enablement Leverage modern AI-assisted...Temporary workWorldwideFlexible hours$132.23k - $176.31k
...Lumen is the trusted network for AI. We're transforming how businesses connect, secure, and scale in an AI-driven world. By connecting... ...the future. The Role SAIC seeks a Lumen Network Design Engineer V (WAN / Work Package Engineer) to support the Department of...Contract workTemporary workFor contractorsRemote work$96.8k - $223.4k
...Interested in building the Kubernetes networking platform that safely manages CNI, service... ...SaaS? Oracle SaaS "Spectra" Platform Engineering is hiring engineers to build the networking... ...to life-saving care. And with AI embedded across our products and services...Temporary workVisa sponsorshipFlexible hours- ...approach. Teradata delivers real business value with AI. What you will do We are looking for a mid-level engineer who will be responsible for delivering robust,... ...performance across CPU, storage, I/O, and network layers. Stress and validate infrastructure up...Permanent employmentFlexible hours
- ...s fun to work in a company where people truly BELIEVE in what they're doing! Job Description Summary: The Customer Solutions Engineer a highly skilled Mainframe Modernization Senior Consultant to provide technical support and/or leadership in the creation and delivery...Local areaRemote workWorldwide
- ...Teradata delivers real business value with AI. What You'll Do At Teradata, we're... ...environments. As a member of our AI engineering team, you'll play a critical role in designing... ...re seeking a hands-on, deeply technical Principal Engineer to lead innovation in the...Permanent employmentFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Principal Engineer - AI Networking. Be the first to apply!
- data center chief engineer Pierre, SD
- hotel chief engineer Pierre, SD
- principal developer Pierre, SD
- senior civil engineer project manager Pierre, SD
- general engineer Pierre, SD
- senior principal engineer Pierre, SD
- chief engineer Pierre, SD
- principal infrastructure engineer Pierre, SD
- senior chief engineer Pierre, SD
- engineering director Pierre, SD


