Engineering Manager, Inference Benchmarking — AI Perf

$224k - $356.5k

Full-time

NVIDIA

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s an outstanding legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world. NVIDIA’s open-source benchmarking platform, AIPerf, is the growing standard for assessing LLM serving performance across various inference frameworks. Hyperscalers, cloud providers, and enterprises use AIPerf to inform decisions on production inference. This includes choosing GPUs, optimizing costs, reducing latency, improving efficiency, and scaling. As Technical Lead Manager, you will lead the engineering team within NVIDIA’s Dynamo organization. Your responsibility is to build and advance the platform so AIPerf becomes the leading benchmarking tool for datacenter, local, and edge use cases. This span LLM, multimodal, diffusion, and computer vision inference. This position combines hands-on leadership with expertise in systems engineering, inference infrastructure, and open-source communities. It has a direct effect on how AI performance is measured and pushed forward. What you'll be doing: Driving the technical roadmap for AIPerf's core infrastructure: load generation, ZMQ-based microservices, GPU telemetry (DCGM/PyNVML, Prometheus metrics, statistical confidence intervals, and Kubernetes-native deployment. Taking ownership for the accuracy and statistical soundness of benchmark results that engineering groups throughout the industry depend on to inform production infrastructure decisions. Advising upstream engine integrations involving vLLM, TRT-LLM, and SGLang in partnership with NVIDIA's Dynamo and NIM teams to maintain AIPerf's relevance across emerging hardware, workload categories, and inference configurations. Hiring, mentoring, and growing a team of senior engineers operating in a high-velocity open-source environment with active external contributors worldwide. What we need to see: Bachelor's degree in Computer Science, Electrical Engineering, or related field, or equivalent experience. 8+ overall years of software engineering experience building performance-critical infrastructure, ML tooling, or distributed systems. 3+ years of engineering leadership experience as a tech lead, TLM, or engineering manager. Deep understanding of LLM inference mechanics — TTFT, ITL, KV caching, Prefill/Decode, speculative decoding — and the ability to reason about measurement correctness and reproducibility. Proven track record of collaborating across multi-functional groups and delivering production-quality output in high-velocity, high-external-visibility environments. Ways to stand out from the crowd: Extensive experience with vLLM, TRT-LLM or SGLang internals along with contributions to their upstream projects. Experience building Kubernetes-native infrastructure including operators, Helm charts, and GPU observability tooling (DCGM, dcgm-exporter, PyNVML). Background in competitive benchmarking frameworks such as MLPerf or equivalent industry-standard evaluation systems. History leading or making meaningful contributions to active open-source projects with external communities. Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until June 1, 2026. This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. NVIDIA pioneered accelerated computing. Today, our AI infrastructure powers global intelligence, transforming every industry. Learn more about NVIDIA.

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the Engineering Manager, Inference Benchmarking — AI Perf in Washington State vacancy

Senior Engineering Manager, AI
$228.4k - $289.2k
...advance the state of AI for highvolume, realtime... ...and Cisco's global engineering capabilities. Our work... ...a Senior Engineering Manager, AI , you will lead a... ...for experimentation, benchmarking, model evaluation, observability... ...distributed training, inference, tool integration, and...
Suggested
Full time
Temporary work
Local area
Flexible hours
Cisco
Seattle, WA
2 days ago
Principal Rust Engineer - ML Infrastructure
...Principal Rust Engineer - ML Infrastructure (AI Training) About the Role What if your deep expertise... ...~ Deep understanding of memory management and zero-copy deserialization with... ..., model training pipelines, or benchmarking infrastructure Experience building...
Suggested
Hourly pay
Ongoing contract
Contract work
Freelance
Remote work
Flexible hours
Alignerr
Seattle, WA
4 days ago
Principal Machine Learning Engineer
$291.5k - $369.1k
...hybrid, multi‑cloud environments. Join the AI Models team at Splunk, where we advance... ...excellence of Splunk and Cisco’s global engineering capabilities. Our work spans networking,... ..., distributed training pipelines, and inference efficiency to minimize cost and latency...
Suggested
Full time
Temporary work
Local area
Flexible hours
Cisco
Bellevue, WA
4 days ago
Senior Manager, Machine Learning Engineering
$200k - $250k
...Metropolis is seeking a Senior Manager of Machine Learning Engineering within the Advanced Technologies Group... ...systems that power our next generation of AI. You will oversee 4 critical pillars... ..., model registries, and low-latency inference services. Ensure high availability...
Suggested
Temporary work
Work at office
Local area
Metropolis Corp
Seattle, WA
4 days ago
Principal Data Engineer
$185k - $220k
...About Curative AI, Inc. Curative AI, Inc. is an ambitious... ...our customers in Revenue Cycle Management (RCM) and Clinical Operations... ...experienced Principal Data Engineer for our rapidly growing company... ...data available for training, inference, and real-time AI agents....
Suggested
Full time
H1b
Curative AI, Inc.
Bellevue, WA
1 day ago
Principal Data Engineer
$197.3k - $313.7k
...Salesforce Salesforce is the #1 AI CRM, where humans with agents... ...data modeler to build and manage the data model(s) for our... ...workloads, including feature engineering for ML models and real-time... ...training pipelines, and real-time inference. ~ A proven track record...
Work at office
Salesforce.Com Inc
Seattle, WA
1 day ago
Principal Solutions Engineer (US West Remote)
$60k
...Principal Solutions Engineer (US West Remote) Join to apply for... ...West Remote) role at Jobright.ai Principal Solutions Engineer... ...feedback into Product Management based on field engagements to... ...interviewing at Jobright.ai by 2x Inferred from the description for this...
Full time
Remote work
jobright.com
Seattle, WA
6 days ago
Remote Senior AI Engineering Manager
$240k - $280k
...A leading cybersecurity firm is looking for a Senior Manager, AI Engineering, to lead AI engineering teams focused on reducing cyber risk. The role involves guiding Engineering Managers and senior engineers while collaborating cross-functionally. Candidates should have...
Remote work
Flexible hours
HackerOne
Seattle, WA
1 day ago
Security Engineering Manager
$188.2k - $325.5k
...Security Engineering Manager We are the Apple Services Engineering (ASE) Security team. We build the secure systems and infrastructure that... ...organizational boundaries without direct authority Familiarity with AI-assisted development practices and how they're changing...
Relocation
Apple
Seattle, WA
3 days ago
Principal AI Engineer
...****10+ years of experience in software engineering, with significant experience in distributed... ....***** ****Experience with Vertex AI, Gemini APIs, OpenAI APIs, or similar enterprise... ...**** ****Knowledge of cost modeling and inference optimization techniques.***** ****...
Work at office
Local area
Remote work
Work from home
F5 Networks
Seattle, WA
1 day ago
Principal Engineer, AI Factory
$206.5k
...worldwide, helping them modernize through shared AI expertise and operational discipline. The... ...experienced and hands-on Principal Engineer to drive the engineering and execution... ...) using Terraform for provisioning and managing large-scale cloud environments. Deep expertise...
Permanent employment
Worldwide
Banyan Software
Seattle, WA
3 days ago
Principal Engineer, Inference
$206k - $303k
...CoreWeave is the AI Hyperscaler™, delivering a cloud platform... ...We’re seeking a Principal Engineer to serve as the hands-on technical... ...for our next-generation Inference Platform . As a senior individual... ..., and performance benchmarks across gRPC/ CUDA Graphs, and...
Permanent employment
Temporary work
Casual work
Work at office
Remote work
Flexible hours
Shift work
CoreWeave
Bellevue, WA
more than 2 months ago
Founding Head of Engineering, Agentic AI
...Founding Head of Engineering, Agentic AI About the Company Reputable legal technology (LegalTech) company using AI to provide affordable... ...Head of Engineering will also lead a team, including hiring, managing, and scaling the AI engineering team, and will be instrumental...
Confidential
Seattle, WA
5 days ago
Principal, Senior Principal and Distinguished Engineer, AWS Agentic AI
$200.1k - $270.6k
...Agentic AI drives innovation at the forefront of artificial intelligence, enabling customers... ..., applied scientists, software engineers, and solution architects who work backwards... ...solving intrinsically hard problems in context management, state coordination, and autonomous...
Worldwide
Flexible hours
Shift work
Amazon
Seattle, WA
4 days ago
Machine Safety Automation Technical Program Manager, Global Safety Engineering
$141.3k - $191.1k
...solutions, leveraging automation and AI-driven predictive modeling to... ...Automation Technical Program Manager (TPM) to lead the development... ...This role shapes how data, engineering, and operations work together... ...the industry- leading benchmark for workplace health and...
Flexible hours
Shift work
Amazon
Bellevue, WA
2 days ago
Manager, Software Engineering
...of the hardest challenges in AI infrastructure today: enabling... ..., model training, and scaled inference spanning thousands of GPUs,... ...leader building a high-performing engineering team. In this role, you... ...experience leading, managing, growing and coaching a team...
Temporary work
H1b
Work at office
Flexible hours
Union
Bellevue, WA
2 days ago
Security Engineering Manager, Security Analytics and AI Research
$175.1k - $236.9k
...Amazon Web Services is looking for an experienced Security Engineering Manager to join the Security Analytics and AI Research group within AWS Security Services. This group is entrusted with researching and developing core detection and machine learning algorithms for...
Flexible hours
Amazon
Seattle, WA
2 days ago
Tech Lead / Engineering Manager
...About Aurelian Aurelian builds AI tools that help 911 centers handle more with... ...and CORA makes every emergency call more manageable for the people handling it. Aurelian... ...Join Us We're hiring a Tech Lead / Engineering Manager to lead a small pod and own a key...
Full time
Live in
Work at office
Relocation package
Aurelian
Seattle, WA
5 days ago
AI-Driven Backend Engineering Manager
...A global consulting firm is seeking a Manager in Application Design and Development to lead projects ensuring quality and risk management. The role requires deep technical skills in software engineering, with qualifications including a Bachelor's degree and 4-6 years...
Flexible hours
Ernst & Young Oman
Olympia, WA
2 days ago
Principal Security Engineering Manager
$142.8k - $274.8k
...Overview The Cloud & AI organization accelerates Microsoft's mission and bold ambitions to ensure that our company and... ...impact on service continuity and trust. The Principal Security Engineering Manager role leads a team responsible for improving the security...
Ongoing contract
Local area
Microsoft Corporation
Redmond, WA
9 days ago
Director, Principal Software Engineering, AI capabilities
$134.2k - $258.3k
...peers across Development & Engineering and Architecture teams, collaborating... ...CI/CD delivery using code management, configuration management... ...systems that combine AI models with external tools or... .../B testing, and performance benchmarking Ideally, you’ll also Microsoft...
Summer holiday
Local area
Flexible hours
EY
Seattle, WA
6 days ago
Principal Systems Engineer (C++) - AI Infrastructure
...Principal Systems Engineer (C++) - AI Infrastructure About the Role What if your C++... ...build system architecture and package management strategies ~ Experienced designing... ...AI/ML workflows, model training, or benchmarking infrastructure Experience with distributed...
Hourly pay
Ongoing contract
Contract work
Freelance
Remote work
Flexible hours
Alignerr
Seattle, WA
4 days ago
Senior Principal Engineer - AI/ML Architect
...Senior Director, Design Engineering Req ID: 134544 Hiring Manager: Randy Clark Band: 14 Remote Position: Yes Region: Americas Country: USA Summary... ...This position is for a Senior Principal Engineer, AI/ML System Architect. As system architect,one will define...
Local area
Remote work
Celestica
Seattle, WA
5 days ago
Senior Principal Engineer, Infrastructure
...Docker is seeking a Senior Principal Engineer to serve as the technical visionary and... ...usage-based pricing, our expansion into AI and security products, and our growth from... ...account ownership, organization lifecycle management, and namespace separation Define the Centralized...
Contract work
Immediate start
Remote work
Home office
Docker
Seattle, WA
6 days ago
Principal Video Infrastructure Engineer - Oracle Video Edge
$99.6k - $223.4k
...Oracle Video @ Edge Engineer Oracle Cloud Infrastructure (OCI) is building Oracle Video @ Edge (OVE), a next-generation video delivery... ...everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers...
Temporary work
Flexible hours
Oracle
Seattle, WA
5 days ago
Principal Video Infrastructure Engineer - Oracle Video Edge
$99.6k - $223.4k
...delivery Work with a highly technical, distributed systems-focused engineering team Responsibilities Responsibilities Design and... ...from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn...
Temporary work
Flexible hours
Oracle
Olympia, WA
4 days ago
Security Engineering Manager, Network Security
$165k - $242k
...Security Engineering Manager, Network Security Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators...
Temporary work
Flexible hours
CoreWeave
Bellevue, WA
3 days ago
Analytics Data Engineering Manager, Product
$370k
...Analytics Data Engineering Manager, Product San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society...
Work at office
Visa sponsorship
Flexible hours
Anthropic
Seattle, WA
9 days ago
Senior Principal Machine Learning Engineer, Ad Platforms
$228.7k - $306.7k
...Senior Principal Machine Learning Engineer, Ad Platforms Technology... ...Machine Learning and AI patterns, platforms and infrastructure... ...~ Model optimization and inference (TensorRT, ONNX, DeepSpeed)... ...augmented generation (RAG), context management, and multi-agent systems and...
Work experience placement
Disney
Seattle, WA
1 day ago
Senior Principal Machine Learning Engineer, Ad Platforms
$228.7k - $306.7k
...Sr Principal Machine Learning Engineer Technology is at the heart... ...Machine Learning and AI patterns, platforms and infrastructure... ...~ Model optimization and inference (TensorRT, ONNX, DeepSpeed)... ...augmented generation (RAG), context management, and multi-agent systems and...
Work experience placement
Local area
The Walt Disney Studios
Seattle, WA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Engineering Manager, Inference Benchmarking — AI Perf. Be the first to apply!