Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Engineering Manager, Inference Benchmarking — AI Perf

$224k - $356.5k
Full-time

NVIDIA

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s an outstanding legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world. NVIDIA’s open-source benchmarking platform, AIPerf, is the growing standard for assessing LLM serving performance across various inference frameworks. Hyperscalers, cloud providers, and enterprises use AIPerf to inform decisions on production inference. This includes choosing GPUs, optimizing costs, reducing latency, improving efficiency, and scaling. As Technical Lead Manager, you will lead the engineering team within NVIDIA’s Dynamo organization. Your responsibility is to build and advance the platform so AIPerf becomes the leading benchmarking tool for datacenter, local, and edge use cases. This span LLM, multimodal, diffusion, and computer vision inference. This position combines hands-on leadership with expertise in systems engineering, inference infrastructure, and open-source communities. It has a direct effect on how AI performance is measured and pushed forward. What you'll be doing: Driving the technical roadmap for AIPerf's core infrastructure: load generation, ZMQ-based microservices, GPU telemetry (DCGM/PyNVML, Prometheus metrics, statistical confidence intervals, and Kubernetes-native deployment. Taking ownership for the accuracy and statistical soundness of benchmark results that engineering groups throughout the industry depend on to inform production infrastructure decisions. Advising upstream engine integrations involving vLLM, TRT-LLM, and SGLang in partnership with NVIDIA's Dynamo and NIM teams to maintain AIPerf's relevance across emerging hardware, workload categories, and inference configurations. Hiring, mentoring, and growing a team of senior engineers operating in a high-velocity open-source environment with active external contributors worldwide. What we need to see: Bachelor's degree in Computer Science, Electrical Engineering, or related field, or equivalent experience. 8+ overall years of software engineering experience building performance-critical infrastructure, ML tooling, or distributed systems. 3+ years of engineering leadership experience as a tech lead, TLM, or engineering manager. Deep understanding of LLM inference mechanics — TTFT, ITL, KV caching, Prefill/Decode, speculative decoding — and the ability to reason about measurement correctness and reproducibility. Proven track record of collaborating across multi-functional groups and delivering production-quality output in high-velocity, high-external-visibility environments. Ways to stand out from the crowd: Extensive experience with vLLM, TRT-LLM or SGLang internals along with contributions to their upstream projects. Experience building Kubernetes-native infrastructure including operators, Helm charts, and GPU observability tooling (DCGM, dcgm-exporter, PyNVML). Background in competitive benchmarking frameworks such as MLPerf or equivalent industry-standard evaluation systems. History leading or making meaningful contributions to active open-source projects with external communities. Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until June 1, 2026. This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. NVIDIA pioneered accelerated computing. Today, our AI infrastructure powers global intelligence, transforming every industry. Learn more about NVIDIA.

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Engineering Manager, Inference Benchmarking — AI Perf in Washington State vacancy
  • $228.4k - $289.2k

     ...advance the state of AI for highvolume, realtime...  ...and Cisco's global engineering capabilities. Our work...  ...a Senior Engineering Manager, AI , you will lead a...  ...for experimentation, benchmarking, model evaluation, observability...  ...distributed training, inference, tool integration, and... 
    Suggested
    Full time
    Temporary work
    Local area
    Flexible hours

    Cisco

    Seattle, WA
    2 days ago
  •  ...Principal Rust Engineer - ML Infrastructure (AI Training) About the Role What if your deep expertise...  ...~ Deep understanding of memory management and zero-copy deserialization with...  ..., model training pipelines, or benchmarking infrastructure Experience building... 
    Suggested
    Hourly pay
    Ongoing contract
    Contract work
    Freelance
    Remote work
    Flexible hours

    Alignerr

    Seattle, WA
    4 days ago
  • $291.5k - $369.1k

     ...hybrid, multi‑cloud environments. Join the AI Models team at Splunk, where we advance...  ...excellence of Splunk and Cisco’s global engineering capabilities. Our work spans networking,...  ..., distributed training pipelines, and inference efficiency to minimize cost and latency... 
    Suggested
    Full time
    Temporary work
    Local area
    Flexible hours

    Cisco

    Bellevue, WA
    4 days ago
  • $200k - $250k

     ...Metropolis is seeking a Senior Manager of Machine Learning Engineering within the Advanced Technologies Group...  ...systems that power our next generation of AI. You will oversee 4 critical pillars...  ..., model registries, and low-latency inference services. Ensure high availability... 
    Suggested
    Temporary work
    Work at office
    Local area

    Metropolis Corp

    Seattle, WA
    4 days ago
  • $185k - $220k

     ...About Curative AI, Inc. Curative AI, Inc. is an ambitious...  ...our customers in Revenue Cycle Management (RCM) and Clinical Operations...  ...experienced Principal Data Engineer for our rapidly growing company...  ...data available for training, inference, and real-time AI agents.... 
    Suggested
    Full time
    H1b

    Curative AI, Inc.

    Bellevue, WA
    1 day ago
  • $197.3k - $313.7k

     ...Salesforce Salesforce is the #1 AI CRM, where humans with agents...  ...data modeler to build and manage the data model(s) for our...  ...workloads, including feature engineering for ML models and real-time...  ...training pipelines, and real-time inference. ~ A proven track record... 
    Work at office

    Salesforce.Com Inc

    Seattle, WA
    1 day ago
  • $60k

     ...Principal Solutions Engineer (US West Remote) Join to apply for...  ...West Remote) role at Jobright.ai Principal Solutions Engineer...  ...feedback into Product Management based on field engagements to...  ...interviewing at Jobright.ai by 2x Inferred from the description for this... 
    Full time
    Remote work

    jobright.com

    Seattle, WA
    6 days ago
  • $240k - $280k

     ...A leading cybersecurity firm is looking for a Senior Manager, AI Engineering, to lead AI engineering teams focused on reducing cyber risk. The role involves guiding Engineering Managers and senior engineers while collaborating cross-functionally. Candidates should have... 
    Remote work
    Flexible hours

    HackerOne

    Seattle, WA
    1 day ago
  • $188.2k - $325.5k

     ...Security Engineering Manager We are the Apple Services Engineering (ASE) Security team. We build the secure systems and infrastructure that...  ...organizational boundaries without direct authority Familiarity with AI-assisted development practices and how they're changing... 
    Relocation

    Apple

    Seattle, WA
    3 days ago
  •  ...****10+ years of experience in software engineering, with significant experience in distributed...  ....***** ****Experience with Vertex AI, Gemini APIs, OpenAI APIs, or similar enterprise...  ...**** ****Knowledge of cost modeling and inference optimization techniques.***** ****... 
    Work at office
    Local area
    Remote work
    Work from home

    F5 Networks

    Seattle, WA
    1 day ago
  • $206.5k

     ...worldwide, helping them modernize through shared AI expertise and operational discipline. The...  ...experienced and hands-on Principal Engineer to drive the engineering and execution...  ...) using Terraform for provisioning and managing large-scale cloud environments. Deep expertise... 
    Permanent employment
    Worldwide

    Banyan Software

    Seattle, WA
    3 days ago
  • $206k - $303k

     ...CoreWeave is the AI Hyperscaler™, delivering a cloud platform...  ...We’re seeking a Principal Engineer to serve as the hands-on technical...  ...for our next-generation Inference Platform . As a senior individual...  ..., and performance benchmarks across gRPC/ CUDA Graphs, and... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours
    Shift work

    CoreWeave

    Bellevue, WA
    more than 2 months ago
  •  ...Founding Head of Engineering, Agentic AI About the Company Reputable legal technology (LegalTech) company using AI to provide affordable...  ...Head of Engineering will also lead a team, including hiring, managing, and scaling the AI engineering team, and will be instrumental... 

    Confidential

    Seattle, WA
    5 days ago
  • $200.1k - $270.6k

     ...Agentic AI drives innovation at the forefront of artificial intelligence, enabling customers...  ..., applied scientists, software engineers, and solution architects who work backwards...  ...solving intrinsically hard problems in context management, state coordination, and autonomous... 
    Worldwide
    Flexible hours
    Shift work

    Amazon

    Seattle, WA
    4 days ago
  • $141.3k - $191.1k

     ...solutions, leveraging automation and AI-driven predictive modeling to...  ...Automation Technical Program Manager (TPM) to lead the development...  ...This role shapes how data, engineering, and operations work together...  ...the industry- leading benchmark for workplace health and... 
    Flexible hours
    Shift work

    Amazon

    Bellevue, WA
    2 days ago
  •  ...of the hardest challenges in AI infrastructure today: enabling...  ..., model training, and scaled inference spanning thousands of GPUs,...  ...leader building a high-performing engineering team. In this role, you...  ...experience leading, managing, growing and coaching a team... 
    Temporary work
    H1b
    Work at office
    Flexible hours

    Union

    Bellevue, WA
    2 days ago
  • $175.1k - $236.9k

     ...Amazon Web Services is looking for an experienced Security Engineering Manager to join the Security Analytics and AI Research group within AWS Security Services. This group is entrusted with researching and developing core detection and machine learning algorithms for... 
    Flexible hours

    Amazon

    Seattle, WA
    2 days ago
  •  ...About Aurelian Aurelian builds AI tools that help 911 centers handle more with...  ...and CORA makes every emergency call more manageable for the people handling it. Aurelian...  ...Join Us We're hiring a Tech Lead / Engineering Manager to lead a small pod and own a key... 
    Full time
    Live in
    Work at office
    Relocation package

    Aurelian

    Seattle, WA
    5 days ago
  •  ...A global consulting firm is seeking a Manager in Application Design and Development to lead projects ensuring quality and risk management. The role requires deep technical skills in software engineering, with qualifications including a Bachelor's degree and 4-6 years... 
    Flexible hours

    Ernst & Young Oman

    Olympia, WA
    2 days ago
  • $142.8k - $274.8k

     ...Overview The Cloud & AI organization accelerates Microsoft's mission and bold ambitions to ensure that our company and...  ...impact on service continuity and trust. The Principal Security Engineering Manager role leads a team responsible for improving the security... 
    Ongoing contract
    Local area

    Microsoft Corporation

    Redmond, WA
    9 days ago
  • $134.2k - $258.3k

     ...peers across Development & Engineering and Architecture teams, collaborating...  ...CI/CD delivery using code management, configuration management...  ...systems that combine AI models with external tools or...  .../B testing, and performance benchmarking Ideally, you’ll also Microsoft... 
    Summer holiday
    Local area
    Flexible hours

    EY

    Seattle, WA
    6 days ago
  •  ...Principal Systems Engineer (C++) - AI Infrastructure About the Role What if your C++...  ...build system architecture and package management strategies ~ Experienced designing...  ...AI/ML workflows, model training, or benchmarking infrastructure Experience with distributed... 
    Hourly pay
    Ongoing contract
    Contract work
    Freelance
    Remote work
    Flexible hours

    Alignerr

    Seattle, WA
    4 days ago
  •  ...Senior Director, Design Engineering Req ID: 134544 Hiring Manager: Randy Clark Band: 14  Remote Position: Yes  Region: Americas  Country: USA Summary...  ...This position is for a Senior Principal Engineer, AI/ML System Architect. As system architect,one will define... 
    Local area
    Remote work

    Celestica

    Seattle, WA
    5 days ago
  •  ...Docker is seeking a Senior Principal Engineer to serve as the technical visionary and...  ...usage-based pricing, our expansion into AI and security products, and our growth from...  ...account ownership, organization lifecycle management, and namespace separation Define the Centralized... 
    Contract work
    Immediate start
    Remote work
    Home office

    Docker

    Seattle, WA
    6 days ago
  • $99.6k - $223.4k

     ...Oracle Video @ Edge Engineer Oracle Cloud Infrastructure (OCI) is building Oracle Video @ Edge (OVE), a next-generation video delivery...  ...everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers... 
    Temporary work
    Flexible hours

    Oracle

    Seattle, WA
    5 days ago
  • $99.6k - $223.4k

     ...delivery Work with a highly technical, distributed systems-focused engineering team Responsibilities Responsibilities Design and...  ...from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn... 
    Temporary work
    Flexible hours

    Oracle

    Olympia, WA
    4 days ago
  • $165k - $242k

     ...Security Engineering Manager, Network Security Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators... 
    Temporary work
    Flexible hours

    CoreWeave

    Bellevue, WA
    3 days ago
  • $370k

     ...Analytics Data Engineering Manager, Product San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society... 
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    Seattle, WA
    9 days ago
  • $228.7k - $306.7k

     ...Senior Principal Machine Learning Engineer, Ad Platforms Technology...  ...Machine Learning and AI patterns, platforms and infrastructure...  ...~ Model optimization and inference (TensorRT, ONNX, DeepSpeed)...  ...augmented generation (RAG), context management, and multi-agent systems and... 
    Work experience placement

    Disney

    Seattle, WA
    1 day ago
  • $228.7k - $306.7k

     ...Sr Principal Machine Learning Engineer Technology is at the heart...  ...Machine Learning and AI patterns, platforms and infrastructure...  ...~ Model optimization and inference (TensorRT, ONNX, DeepSpeed)...  ...augmented generation (RAG), context management, and multi-agent systems and... 
    Work experience placement
    Local area

    The Walt Disney Studios

    Seattle, WA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Engineering Manager, Inference Benchmarking — AI Perf. Be the first to apply!