Engineering Manager, Inference Benchmarking — AI Perf
$224k - $356.5kNVIDIA
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s an outstanding legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world. NVIDIA’s open-source benchmarking platform, AIPerf, is the growing standard for assessing LLM serving performance across various inference frameworks. Hyperscalers, cloud providers, and enterprises use AIPerf to inform decisions on production inference. This includes choosing GPUs, optimizing costs, reducing latency, improving efficiency, and scaling. As Technical Lead Manager, you will lead the engineering team within NVIDIA’s Dynamo organization. Your responsibility is to build and advance the platform so AIPerf becomes the leading benchmarking tool for datacenter, local, and edge use cases. This span LLM, multimodal, diffusion, and computer vision inference. This position combines hands-on leadership with expertise in systems engineering, inference infrastructure, and open-source communities. It has a direct effect on how AI performance is measured and pushed forward. What you'll be doing: Driving the technical roadmap for AIPerf's core infrastructure: load generation, ZMQ-based microservices, GPU telemetry (DCGM/PyNVML, Prometheus metrics, statistical confidence intervals, and Kubernetes-native deployment. Taking ownership for the accuracy and statistical soundness of benchmark results that engineering groups throughout the industry depend on to inform production infrastructure decisions. Advising upstream engine integrations involving vLLM, TRT-LLM, and SGLang in partnership with NVIDIA's Dynamo and NIM teams to maintain AIPerf's relevance across emerging hardware, workload categories, and inference configurations. Hiring, mentoring, and growing a team of senior engineers operating in a high-velocity open-source environment with active external contributors worldwide. What we need to see: Bachelor's degree in Computer Science, Electrical Engineering, or related field, or equivalent experience. 8+ overall years of software engineering experience building performance-critical infrastructure, ML tooling, or distributed systems. 3+ years of engineering leadership experience as a tech lead, TLM, or engineering manager. Deep understanding of LLM inference mechanics — TTFT, ITL, KV caching, Prefill/Decode, speculative decoding — and the ability to reason about measurement correctness and reproducibility. Proven track record of collaborating across multi-functional groups and delivering production-quality output in high-velocity, high-external-visibility environments. Ways to stand out from the crowd: Extensive experience with vLLM, TRT-LLM or SGLang internals along with contributions to their upstream projects. Experience building Kubernetes-native infrastructure including operators, Helm charts, and GPU observability tooling (DCGM, dcgm-exporter, PyNVML). Background in competitive benchmarking frameworks such as MLPerf or equivalent industry-standard evaluation systems. History leading or making meaningful contributions to active open-source projects with external communities. Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until June 1, 2026. This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. NVIDIA pioneered accelerated computing. Today, our AI infrastructure powers global intelligence, transforming every industry. Learn more about NVIDIA.
$228.4k - $289.2k
...advance the state of AI for highvolume, realtime... ...and Cisco's global engineering capabilities. Our work... ...a Senior Engineering Manager, AI , you will lead a... ...for experimentation, benchmarking, model evaluation, observability... ...distributed training, inference, tool integration, and...SuggestedFull timeTemporary workLocal areaFlexible hours- ...Principal Rust Engineer - ML Infrastructure (AI Training) About the Role What if your deep expertise... ...~ Deep understanding of memory management and zero-copy deserialization with... ..., model training pipelines, or benchmarking infrastructure Experience building...SuggestedHourly payOngoing contractContract workFreelanceRemote workFlexible hours
$291.5k - $369.1k
...hybrid, multi‑cloud environments. Join the AI Models team at Splunk, where we advance... ...excellence of Splunk and Cisco’s global engineering capabilities. Our work spans networking,... ..., distributed training pipelines, and inference efficiency to minimize cost and latency...SuggestedFull timeTemporary workLocal areaFlexible hours$200k - $250k
...Metropolis is seeking a Senior Manager of Machine Learning Engineering within the Advanced Technologies Group... ...systems that power our next generation of AI. You will oversee 4 critical pillars... ..., model registries, and low-latency inference services. Ensure high availability...SuggestedTemporary workWork at officeLocal area$185k - $220k
...About Curative AI, Inc. Curative AI, Inc. is an ambitious... ...our customers in Revenue Cycle Management (RCM) and Clinical Operations... ...experienced Principal Data Engineer for our rapidly growing company... ...data available for training, inference, and real-time AI agents....SuggestedFull timeH1b$197.3k - $313.7k
...Salesforce Salesforce is the #1 AI CRM, where humans with agents... ...data modeler to build and manage the data model(s) for our... ...workloads, including feature engineering for ML models and real-time... ...training pipelines, and real-time inference. ~ A proven track record...Work at office$60k
...Principal Solutions Engineer (US West Remote) Join to apply for... ...West Remote) role at Jobright.ai Principal Solutions Engineer... ...feedback into Product Management based on field engagements to... ...interviewing at Jobright.ai by 2x Inferred from the description for this...Full timeRemote work$240k - $280k
...A leading cybersecurity firm is looking for a Senior Manager, AI Engineering, to lead AI engineering teams focused on reducing cyber risk. The role involves guiding Engineering Managers and senior engineers while collaborating cross-functionally. Candidates should have...Remote workFlexible hours$188.2k - $325.5k
...Security Engineering Manager We are the Apple Services Engineering (ASE) Security team. We build the secure systems and infrastructure that... ...organizational boundaries without direct authority Familiarity with AI-assisted development practices and how they're changing...Relocation- ...****10+ years of experience in software engineering, with significant experience in distributed... ....***** ****Experience with Vertex AI, Gemini APIs, OpenAI APIs, or similar enterprise... ...**** ****Knowledge of cost modeling and inference optimization techniques.***** ****...Work at officeLocal areaRemote workWork from home
$206.5k
...worldwide, helping them modernize through shared AI expertise and operational discipline. The... ...experienced and hands-on Principal Engineer to drive the engineering and execution... ...) using Terraform for provisioning and managing large-scale cloud environments. Deep expertise...Permanent employmentWorldwide$206k - $303k
...CoreWeave is the AI Hyperscaler™, delivering a cloud platform... ...We’re seeking a Principal Engineer to serve as the hands-on technical... ...for our next-generation Inference Platform . As a senior individual... ..., and performance benchmarks across gRPC/ CUDA Graphs, and...Permanent employmentTemporary workCasual workWork at officeRemote workFlexible hoursShift work- ...Founding Head of Engineering, Agentic AI About the Company Reputable legal technology (LegalTech) company using AI to provide affordable... ...Head of Engineering will also lead a team, including hiring, managing, and scaling the AI engineering team, and will be instrumental...
$200.1k - $270.6k
...Agentic AI drives innovation at the forefront of artificial intelligence, enabling customers... ..., applied scientists, software engineers, and solution architects who work backwards... ...solving intrinsically hard problems in context management, state coordination, and autonomous...WorldwideFlexible hoursShift work$141.3k - $191.1k
...solutions, leveraging automation and AI-driven predictive modeling to... ...Automation Technical Program Manager (TPM) to lead the development... ...This role shapes how data, engineering, and operations work together... ...the industry- leading benchmark for workplace health and...Flexible hoursShift work- ...of the hardest challenges in AI infrastructure today: enabling... ..., model training, and scaled inference spanning thousands of GPUs,... ...leader building a high-performing engineering team. In this role, you... ...experience leading, managing, growing and coaching a team...Temporary workH1bWork at officeFlexible hours
$175.1k - $236.9k
...Amazon Web Services is looking for an experienced Security Engineering Manager to join the Security Analytics and AI Research group within AWS Security Services. This group is entrusted with researching and developing core detection and machine learning algorithms for...Flexible hours- ...About Aurelian Aurelian builds AI tools that help 911 centers handle more with... ...and CORA makes every emergency call more manageable for the people handling it. Aurelian... ...Join Us We're hiring a Tech Lead / Engineering Manager to lead a small pod and own a key...Full timeLive inWork at officeRelocation package
- ...A global consulting firm is seeking a Manager in Application Design and Development to lead projects ensuring quality and risk management. The role requires deep technical skills in software engineering, with qualifications including a Bachelor's degree and 4-6 years...Flexible hours
$142.8k - $274.8k
...Overview The Cloud & AI organization accelerates Microsoft's mission and bold ambitions to ensure that our company and... ...impact on service continuity and trust. The Principal Security Engineering Manager role leads a team responsible for improving the security...Ongoing contractLocal area$134.2k - $258.3k
...peers across Development & Engineering and Architecture teams, collaborating... ...CI/CD delivery using code management, configuration management... ...systems that combine AI models with external tools or... .../B testing, and performance benchmarking Ideally, you’ll also Microsoft...Summer holidayLocal areaFlexible hours- ...Principal Systems Engineer (C++) - AI Infrastructure About the Role What if your C++... ...build system architecture and package management strategies ~ Experienced designing... ...AI/ML workflows, model training, or benchmarking infrastructure Experience with distributed...Hourly payOngoing contractContract workFreelanceRemote workFlexible hours
- ...Senior Director, Design Engineering Req ID: 134544 Hiring Manager: Randy Clark Band: 14 Remote Position: Yes Region: Americas Country: USA Summary... ...This position is for a Senior Principal Engineer, AI/ML System Architect. As system architect,one will define...Local areaRemote work
- ...Docker is seeking a Senior Principal Engineer to serve as the technical visionary and... ...usage-based pricing, our expansion into AI and security products, and our growth from... ...account ownership, organization lifecycle management, and namespace separation Define the Centralized...Contract workImmediate startRemote workHome office
$99.6k - $223.4k
...Oracle Video @ Edge Engineer Oracle Cloud Infrastructure (OCI) is building Oracle Video @ Edge (OVE), a next-generation video delivery... ...everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers...Temporary workFlexible hours$99.6k - $223.4k
...delivery Work with a highly technical, distributed systems-focused engineering team Responsibilities Responsibilities Design and... ...from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn...Temporary workFlexible hours$165k - $242k
...Security Engineering Manager, Network Security Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators...Temporary workFlexible hours$370k
...Analytics Data Engineering Manager, Product San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society...Work at officeVisa sponsorshipFlexible hours$228.7k - $306.7k
...Senior Principal Machine Learning Engineer, Ad Platforms Technology... ...Machine Learning and AI patterns, platforms and infrastructure... ...~ Model optimization and inference (TensorRT, ONNX, DeepSpeed)... ...augmented generation (RAG), context management, and multi-agent systems and...Work experience placement$228.7k - $306.7k
...Sr Principal Machine Learning Engineer Technology is at the heart... ...Machine Learning and AI patterns, platforms and infrastructure... ...~ Model optimization and inference (TensorRT, ONNX, DeepSpeed)... ...augmented generation (RAG), context management, and multi-agent systems and...Work experience placementLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Engineering Manager, Inference Benchmarking — AI Perf. Be the first to apply!

