Engineering Manager, Inference Benchmarking — AI Perf
$224k - $356.5kNVIDIA
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s an outstanding legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.NVIDIA’s open-source benchmarking platform, AIPerf, is the growing standard for assessing LLM serving performance across various inference frameworks. Hyperscalers, cloud providers, and enterprises use AIPerf to inform decisions on production inference. This includes choosing GPUs, optimizing costs, reducing latency, improving efficiency, and scaling. As Technical Lead Manager, you will lead the engineering team within NVIDIA’s Dynamo organization. Your responsibility is to build and advance the platform so AIPerf becomes the leading benchmarking tool for datacenter, local, and edge use cases. This span LLM, multimodal, diffusion, and computer vision inference. This position combines hands-on leadership with expertise in systems engineering, inference infrastructure, and open-source communities. It has a direct effect on how AI performance is measured and pushed forward.What you'll be doing:Driving the technical roadmap for AIPerf's core infrastructure: load generation, ZMQ-based microservices, GPU telemetry (DCGM/PyNVML, Prometheus metrics, statistical confidence intervals, and Kubernetes-native deployment.Taking ownership for the accuracy and statistical soundness of benchmark results that engineering groups throughout the industry depend on to inform production infrastructure decisions.Advising upstream engine integrations involving vLLM, TRT-LLM, and SGLang in partnership with NVIDIA's Dynamo and NIM teams to maintain AIPerf's relevance across emerging hardware, workload categories, and inference configurations.Hiring, mentoring, and growing a team of senior engineers operating in a high-velocity open-source environment with active external contributors worldwide.What we need to see:Bachelor's degree in Computer Science, Electrical Engineering, or related field, or equivalent experience.8+ overall years of software engineering experience building performance-critical infrastructure, ML tooling, or distributed systems.3+ years of engineering leadership experience as a tech lead, TLM, or engineering manager.Deep understanding of LLM inference mechanics — TTFT, ITL, KV caching, Prefill/Decode, speculative decoding — and the ability to reason about measurement correctness and reproducibility.Proven track record of collaborating across multi-functional groups and delivering production-quality output in high-velocity, high-external-visibility environments.Ways to stand out from the crowd:Extensive experience with vLLM, TRT-LLM or SGLang internals along with contributions to their upstream projects.Experience building Kubernetes-native infrastructure including operators, Helm charts, and GPU observability tooling (DCGM, dcgm-exporter, PyNVML).Background in competitive benchmarking frameworks such as MLPerf or equivalent industry-standard evaluation systems.History leading or making meaningful contributions to active open-source projects with external communities.Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD.You will also be eligible for equity and benefits.Applications for this job will be accepted at least until June 1, 2026.This posting is for an existing vacancy.NVIDIA uses AI tools in its recruiting processes.NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr NVIDIA Corporation
- ...AI Compiler Engineer Locations available: San Diego and San Jose, California or Austin, Texas... ...efficiency. Level up validation, benchmarking, and regression pipelines by harnessing... ...skills Solid understanding of AI inference workloads (CNNs, transformers,...Suggested
$183k - $265k
...degree in Science, Technology, Engineering, Mathematics, or equivalent... ...role. 2 years of experience managing a software engineering, forward... .... Experience developing AI/Generative AI solutions utilizing... ...best practices, and benchmarks to elevate engineering excellence...SuggestedFull time- Fairygodboss is seeking an Engineering Manager to lead the People Applications team. In this hybrid role, you will manage and mentor a team of... ...vision and overseeing impactful features. With a strong focus on AI and collaborative functions across various locations, the...Suggested
$184.5k - $258k
Expedia Group in Austin is seeking a Senior Engineering Manager to lead high-performing teams in delivering scalable technical solutions. The... ...candidate will be responsible for system design and integration of AI/ML capabilities, ensuring reliable service delivery. Minimum...Suggested$291.5k - $369.1k
...hybrid, multi‑cloud environments. Join the AI Models team at Splunk, where we advance... ...excellence of Splunk and Cisco’s global engineering capabilities. Our work spans networking,... ..., distributed training pipelines, and inference efficiency to minimize cost and latency...SuggestedFull timeTemporary workLocal areaFlexible hours- ...Senior Principal Network Engineer Austin, Texas, United States... ...unlock the next generation of AI breakthroughs and power the widespread... ...networks. AI training and inference workloads require extremely... ...provisioning, configuration management, and network remediation....
$114.6k - $234.6k
...be encouraged. In? OCI ? AI Infrastructure ?org we are addressing... ..., model serving, evaluation/benchmarking and human preference learning. # Apply engineering principles for defining robust... ...submissions partnering with product managers # Balance between product...Temporary workFlexible hours$286.2k - $326.7k
.... Director, Machine Learning Engineering (Remote-Eligible) Overview... ...creating responsible and reliable AI systems, changing banking for... ..., by providing well‑managed, self‑service, experimentation... ...and both real‑time and batch inference with strong reliability, scalability...Full timePart timeLocal areaRemote work- A leading global technology company seeks a Senior Engineering Manager to lead the Agentic Vault initiative. This role entails driving the vision for a security product handling Non-Human Identities. Ideal candidates will have deep experience in leadership, modern identity...Work at office3 days per week
- CyberCoders is seeking an Engineering Manager for a pivotal role that combines management and coding. The position requires effective planning... ...Science or a related field is required, and experience with AI coding strategies will be advantageous. #J-18808-Ljbffr CyberCodersRemote job
$150k - $185k
...Senior Infrastructure Engineering Manager Location: Austin, Texas OR Temple, TX (Partial Remote) Employment Type: Permanent Role Overview... .... By applying for this job, you agree to receive calls, AI-generated calls, text messages, or emails from Everforth Apex...Permanent employmentWork at officeRemote work3 days per week- ...solutions for systems such as AI servers, switches, and... ...processing units (GPUs). Strong engineering fundamentals are essential. As... ...Analyze test data to draw inferences about the equipment tested.... ...manufacturing processes, supplier management, and technology transfer ~...Work at officeLocal area
- ...next-generation computing experiences-from AI and data centers, to PCs, gaming and... ...The Role We are seeking a Program Manager with strong analytical, problem-solving,... ...organizational significance that span multiple engineering and business teams Partner with stakeholders...Work at office
$163k - $245k
...meet the needs of our users. Our teams consist of Software Engineers, UX Designers, Product Managers, and Machine Learning professionals collaborating across... ...a referral or submitting a resume for that opening. AI Notice Indeed is committed to ensuring fairness and...Work experience placementLocal area- AMD is looking for a Sr. Manager AI Systems Validation Architect in Austin, Texas. This role involves defining validation strategies... ...team of senior architects, and collaborating across multiple engineering domains. The successful candidate should have deep experience...
- ...pioneering the next generation of biomarker intelligence—combining AI-powered technology with human insight to deliver personalized... ...is smarter, faster, and more personalized. We are hiring an Engineering Manager, Infrastructure to lead the teams responsible for Everlywell’...
- Manager, Engineering Operations - AI at SailPoint - Headquarters (Austin, Texas, USA), United States Manager, Engineering Operations - AI at SailPoint - Headquarters (Austin, Texas, USA), United States SailPoint is seeking a Lead Technical Program Manager (TPM) to oversee...Temporary workFlexible hours
$159k - $220.5k
...business if more than 3 days]. The Role The Privacy by Design Engineering Manager leads a dedicated privacy engineering function responsible... ...into the design and delivery of enterprise data, analytics, AI platforms, and vehicle systems. This role is accountable for...H1bFlexible hours- A leading AI infrastructure company in Austin, TX seeks a VP of Engineering to own the engineering roadmap and guide a growing team. The ideal candidate has over 7 years of experience in engineering, particularly in high-growth startups, and possesses a strong background...
- A leading software firm is seeking an Engineering Manager for Sales Planning based in Austin, TX or other hybrid locations. The role involves leading a distributed engineering team, collaborating on product strategy, and delivering customer-facing solutions. Candidates...
- ...include overseeing validation studies and fostering scientific excellence. Join a dynamic team and help shape the future of healthcare with cutting-edge AI technology. Competitive salary, health benefits, and flexible work culture offered. #J-18808-Ljbffr Maze Impact SA.Flexible hours
- ...Principal Architect, AI & Developer Productivity Location... ...who has shipped AI augmented engineering tooling at scale and can... ...harness so new tools can be benchmarked against incumbents on real internal... .... 8. Cost and Capacity Management ~ Own the total cost...Full timeTemporary workWork at officeLocal areaFlexible hours
- ...Founding Head of Engineering, Agentic AI About the Company Reputable legal technology (LegalTech) company using AI to provide affordable... ...Head of Engineering will also lead a team, including hiring, managing, and scaling the AI engineering team, and will be...
- Mclane Company, Inc. is seeking a Sr Data Engineering Manager to lead platform modernization and strategy for its next-generation data ecosystem. This hybrid role requires hands-on leadership as well as collaboration with business partners to deliver scalable data products...
- Overview Job Description - Sr Data Engineering Manager (JR108053) Take your career further with McLane! McLane teammates, the driving force... ...and business partnership to deliver scalable, governed, and AI-ready data products. This is a hybrid position which will require...Work at office3 days per week
- A global consulting firm is seeking an AI & Data - Physical AI Engineering Consultant - Manager to lead innovative AI solutions. In this role, you will engage with clients to deliver state-of-the-art technologies while ensuring robust engineering practices. A Bachelor’...
- ...products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems.... ...beyond. Together, we advance your career. The Role This is an engineering management position responsible for a product engineering team. Manages...
- ...would like to meet you.## Key Responsibilities* **Data Engineering team leadership**. Lead, mentor, and develop a team... ...supporting reporting, analytics, data science, AI, and data quality initiatives. Plan and manage delivery of data projects across multiple concurrent...Permanent employmentTemporary workWork at officeFlexible hours
$185k - $215k
...Full time Location Type Hybrid Department Engineering Compensation $185K - $215K Meet Upside:... ...role: We’re looking for an Engineering Manager to lead our Data Engineering team, the... ...maintainability. Are eager to integrate generative AI tools into development workflows to...Full timeContract workWork at officeFlexible hours$109.2k - $223.4k
...Lead business operations programs for Engineering & Infrastructure organizations in partnership... ...without direct reporting authority. Manage risks, dependencies, escalations, and execution... ...to life-saving care. And with AI embedded across our products and services...Temporary workWork at officeFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Engineering Manager, Inference Benchmarking — AI Perf. Be the first to apply!


