AI Systems Performance Engineer
$141.3k - $226kBroadcom Corporation
Please Note:
1. If you are a first time user, please create your candidate login account before you apply for a job. (Click Sign In > Create Account)
2. If you already have a Candidate Account, please Sign-In before you apply.
Job Description:
We are seeking a highly talented and experienced Senior AI Fabric Performance Engineer to take on a critical role within our Performance Lab. In this high-impact position, you will drive the performance benchmarking of AI inference, training and storage workloads with focus on our network infrastructure. You will be responsible to generate reports that aid the customers in deployment and marketing team to position the product.
While the AI workloads (inference and training) run on our servers, your primary focus will be optimizing the Ethernet fabric that connects them. You will be responsible for executing rigorous performance benchmarks, isolating complex system bottlenecks, and tuning parameters to achieve maximum throughput and minimum latency. If you possess a deep understanding of Ethernet fabric, machine learning system demands, and Linux environments, and you thrive on solving complex performance puzzles, we want you on our team.
Key Responsibilities
Benchmarking & Execution: Install, configure, and run industry-standard AI performance benchmarks, with a strong emphasis on MLPerf (Training and Inference) and NCCL tests.
Fabric Optimization: Tune and optimize network parameters, focusing heavily on Ethernet fabric performance, to ensure seamless data flow for distributed AI workloads running on server clusters.
Deep Debugging: Identify, isolate, and troubleshoot complex system performance bottlenecks spanning across the Linux OS, server hardware, and Ethernet switches.
Automation Development: Design, develop, and implement robust performance testing frameworks and automation tools to streamline continuous benchmarking.
Cross-Functional Collaboration: Document test methodologies, communicate performance findings, and provide actionable improvement recommendations to hardware, software, and networking stakeholders.
Required Qualifications
Education: Bachelor's / Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or a related technical field plus 12+ years / 10+ years related industry experience.
OS Expertise: Deep familiarity and hands-on experience with Linux operating systems, including system-level performance tuning and troubleshooting.
Programming Skills: Strong proficiency in programming and scripting languages, specifically Python and C++ .
AI/ML Knowledge: Familiarity with modern machine learning frameworks, particularly PyTorch , and a solid understanding of how AI models consume compute and network resources.
Networking & Fabric: Proven experience in performance testing and validating Ethernet switch systems.
Analytical Capabilities: Extensive experience with performance metrics, profiling, and benchmarking tools. Strong problem-solving skills with a proven ability to diagnose root causes in complex, distributed systems.
Preferred Qualifications (Optional but recommended for a critical role)
Experience with RDMA (Remote Direct Memory Access) and RoCEv2 (RDMA over Converged Ethernet).
Prior experience building CI/CD pipelines for automated hardware or software performance regression testing.
Familiarity with containerization and orchestration tools (Docker, Kubernetes) used in AI deployments.
Additional Job Description:
Compensation and Benefits
The annual base salary range for this position is $141,300 - $226,000.
As a valued member of our team, you'll be eligible for a discretionary annual bonus and the opportunity to receive not only a competitive new hire equity grant, but also annual equity awards, connecting your success directly to the company's growth. All subject to relevant plan documents and award agreements.
Broadcom offers a competitive and comprehensive benefits package: Medical, dental and vision plans, 401(K) participation including company matching, Employee Stock Purchase Program (ESPP), Employee Assistance Program (EAP), company paid holidays, paid sick leave and vacation time. The company follows all applicable laws for Paid Family Leave and other leaves of absence.
Broadcom is proud to be an equal opportunity employer. We will consider qualified applicants without regard to race, color, creed, religion, sex, sexual orientation, national origin, citizenship, disability status, medical condition, pregnancy, protected veteran status or any other characteristic protected by federal, state, or local law. We will also consider qualified applicants with arrest and conviction records consistent with local law.
If you are located outside USA, please be sure to fill out a home address as this will be used for future correspondence.
Welcome! Thank you for your interest in Broadcom!
We are a global technology leader that designs, develops and supplies a broad range of semiconductor and infrastructure software solutions.
For more information please visit our video library ( and check out our Connected by Broadcom ( series.
Follow us on Linked In Broadcom Inc ( .
- ...Unix, GCP/Azure, Kubeflow, CUDA, KAFKA Experience with MLOps, performance benchmarks for ML models, optimizing and deploying ML models... ...science and Applied scientists to help with setting up ML system infrastructure aspects. Project Contribution Help...PerformanceContract workWork at office2 days per week1 day per week
- ...GCP/Azure, Kubeflow, CUDA, KAFKA Experience with MLOps, performance benchmarks for ML models, optimizing and deploying ML models... ...Data science and Applied scientists to help with setting up ML system infrastructure aspects. What is the project this person...PerformanceContract workWork at officeRemote workShift work2 days per week1 day per week
$181.1k - $318.4k
...team delivers fast, reliable CI systems that make Apple's software easier... ...for a skilled CI Systems Engineer to join our team and help build intelligent AI‑assisted systems that enable Apple... ...to the context they need Drive performance improvements and optimization initiatives...PerformanceRelocation- ...company in Santa Clara seeks a Machine Learning engineer to build and operate a web crawl infrastructure that... ...has experience in building scalable distributed systems. You will be responsible for ensuring the performance of the system while working across its full...Performance
- ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture... ...a versatile and experienced engineer to join our SOTA Training Platform... ...achieving unprecedented levels of performance, efficiency, and scalability for AI...PerformanceInternship
- ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute... ...agentic computation. About The Role Engineers on the inference performance team operate at the intersection of hardware and...Performance
$184k - $287.5k
...are increasingly known as “the AI computing company.” We're... ...Designing and developing performance optimized UEFI/BIOS solutions... ...automation for qualifying the whole system software and firmware stack.... ...Degree or higher; in Electrical Engineering or Computer Science or...Performance- A leading AI technology company located in Sunnyvale, California, is looking for an experienced engineer to join its SOTA Training Platform team. The ideal candidate will... ...bringing ML models to life on Cerebras CSX systems, performance tuning, and contributing to tool...Performance
- ...generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of... ...Lead / Principal Systems Design Engineer to join our growing team. As a key... ...results Optimize the performance, such as fusion and tiling strategies...Performance
$184k - $287.5k
...into the unlimited potential of AI to define the next era of... ...where everyone is motivated to perform at their highest level. Come join... ...: Develop use cases and system requirements for L3 and L4 autonomous... ...with Data Analytics, Test Engineering, and System Integration & Test...Performance$131k - $175k
...Senior Hardware Systems Engineer – AI Rack & Cluster Infrastructure Arista Networks is an industry leader in data-driven, client-to-cloud... ...and strive to maintain the highest standards of quality and performance in everything we do. Job Description Who You'll Work...PerformanceRemote workFlexible hours$152k - $241.5k
...for a creative and experienced Software Systems Engineer to help bring NVIDIA's next generation... ...compare the impact of ODDs on relevant performance metrics, translating data and analysis... ...languages such as Python and the use of AI tooling to enhance requirement and test...PerformanceOdd job$168k - $258.75k
...recently, GPU deep learning ignited modern AI—the next era of computing. NVIDIA is a... .... We are now looking for a System Design Engineer in the Graphics Product Team. In this... ...to pursue the balance of product cost, performance, and schedule under the guidance of system...Performance- ...recognized globally for innovation, performance and quality. Sandisk has... ...to join our Product Engineering team. This pivotal role requires... ...cross-functional teams. As a System Product Engineer within the... ...technologies (SSD) for data center and AI platforms. Additional...PerformanceTemporary workRemote workFlexible hoursShift work
$125k - $191.7k
.../Remote Role: As a Senior Software Systems Engineer on the Software Validation team within... ...testing, and verifying the safety and performance of autonomous systems. You will be responsible... ...future of evaluation methodologies for AI systems and other ADAS features,...PerformanceLocal areaRemote workWork from homeFlexible hours$125k - $185k
...Software Engineer - Systems Engineering Ai Tooling Sunnyvale, California, United States About Applied Intuition Applied Intuition, Inc.... ...educational attainment, skill level requirements, interview performance, and the level and scope of the position. Please...PerformanceFull timeFor contractorsFor subcontractorCasual workWork at officeRemote workDay shift$200k - $322k
...seeking a self‑motivated senior engineer for the Aerial Omniverse... ...of emulated devices, across systems of potentially thousands of interconnected... ...we need to see: PhD in high‑performance computing, computer... ...existing vacancy. NVIDIA uses AI tools in its recruiting processes...Performance$165k - $242k
...Senior Business Systems Engineer- Data Center Systems II Livingston, NJ /Bellevue, WA / Sunnyvale... ...CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers,... ...CoreWeave combines superior infrastructure performance with deep technical expertise to...PerformanceTemporary workCasual workWork at officeImmediate startRemote workFlexible hours$175k - $230k
...AI/HPC System Engineer Job Title: AI/HPC System Engineer Office Location: San Jose, CA Job Type: Full-Time Work Model: Onsite... ...electronic devices and IT infrastructure, enabling enhanced performance and user experiences across the digital landscape. We're looking...PerformanceFull timeWork at officeLocal area- Job Description As a Senior Systems Research Engineer , you will join a future-forward team to explore and build embodied AI applications at the intersection of state-of-the-art... ...hands-on technical role, you will recommend performant architectures, and iteratively develop...Performance
$170k
.... Title of position: Staff Software Systems Engineer Position type: Full time Location... ...work in the country where the role will be performed. Role: Ability to translate... ...We may use artificial intelligence (AI) tools to support parts of the hiring process...PerformanceFull timeWork experience placementLocal areaWork from homeFlexible hours$155k - $215k
...AI System Engineer San Jose, CA Job Title: AI System Engineer Office Location: San Jose, CA Job Type: Full-Time Work Model: Onsite... ...electronic devices and IT infrastructure, enabling enhanced performance and user experiences across the digital landscape. We're looking...PerformanceFull timeWork at officeLocal area$184k - $287.5k
...is looking for an experienced systems and network infrastructure... ...new Artificial Intelligence (AI) hardware and software technologies... ...NVIDIA Solution Architecture Engineering (SA) team focused on... ...and debugging compute/network performance issues What We Need To See...PerformanceRemote work- ...Director / Senior Director, System Applications Engineering We are seeking a Director / Senior Director... ...system engineering for next-generation AI networking platforms and high-speed... ...DSP architectures, and high-speed link performance and debugging ~ Strong...PerformanceFlexible hours
$141.91k - $200.34k
...Description: Join an enthusiastic team of engineers in Intel's Networking Solutions Group... ...that enhance isolation, security, performance, and system management for our customers.... ...center workloads, RDMA, collectives, and AI benchmarking. Understanding of secure...PerformanceLocal areaImmediate startShift work- ...generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and... ...looking for a Senior Staff AI Infra Engineer who is passionate about improving the performance of key applications and benchmarks, with...Performance
- NVIDIA in Santa Clara is seeking an experienced engineer to design and optimize AI systems for the CUDA ecosystem. Ideal candidates will have strong C/C++ and Python skills, with a solid background in AI systems development. The position offers competitive salaries, equitably...Performance
$270k - $288k
...VP, Server System Engineering and Management Supermicro is seeking an accomplished and visionary... ...across the rapidly expanding IT and AI infrastructure markets. The ideal candidate... .... As AI, cloud, and high-performance computing markets accelerate, Supermicro...PerformanceWorldwide- ...leading technology company is seeking a skilled engineer to optimize deep learning frameworks and enhance GPU kernel performance. The ideal candidate excels in collaborative... ...a focus on innovative solutions and advancing AI technologies. #J-18808-Ljbffr Advanced Micro DevicesPerformance
- ...solid-state speakers-powering AI glasses, earbuds, headphones,... ..., enabling improved thermal performance in smartphones, AI glasses, SSDs... ...a Senior Audio Application Engineer to support customer... ...debug electrical and acoustic system performance Perform electrical...Performance
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Systems Performance Engineer. Be the first to apply!
- machine learning ai engineer San Jose, CA
- senior ai engineer San Jose, CA
- ai engineer remote San Jose, CA
- ai ml engineer San Jose, CA
- ai engineer San Jose, CA
- ai developer San Jose, CA
- ai prompt engineer San Jose, CA
- operations support system engineer San Jose, CA
- microsoft systems engineer San Jose, CA
- mission system engineer San Jose, CA

