Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Manager, Software Engineering - Production AI Inference

$224k - $356.5k
Full-time

NVIDIA

NVIDIA is the platform upon which every new AI-powered application is built. We are seeking a deeply technical software manager to lead production AI inference for NVIDIA Inference Microservices (NIM), the production runtime through which customers deploy optimized, enterprise-supported AI inference across cloud, data center, and edge environments. NIM makes state-of-the-art AI models available as production-ready software stack, combining optimized inference engines, model profiles/recipes, validated runtime configurations, and security hardening. This role leads the team accountable for turning fast-moving model and inference engine work into reliable NIM releases that customers can operate with confidence. This is a hands-on engineering management role for someone who can run production execution without managing from a distance. You will lead engineers working across model onboarding, serving stack integration, performance profiling/optimization, release quality, security readiness, automation, observability, and operational health. You will partner closely with the product, solution architect, security, research, and other internal engineering teams to make day-0 model launches repeatable and to raise the production bar for every NIM release. What you'll be doing: Lead the team responsible for shipping production-ready LLM NIMs, including planning, new model onboarding, validated serving recipes, release readiness, and post-release follow-through. Build a predictable operating model for the team through roadmap planning, a weekly execution rhythm, launch checklists, clear ownership boundaries, collaborator communication, and issue management. Own project execution by anticipating schedule, staffing, and dependency risks. Adapt plans under pressure and collaborate with peer managers to dynamically prioritize engineering timelines to remain agile in the fast paced AI industry. Drive continuous improvement in production workflows through RCCA and partner feedback, removing unnecessary and redundant work while keeping the team passionate about production outcomes. Build and maintain a world-class AI inference engineering team by building an innovative culture, setting clear expectations, maintaining active feedback loops, and mentoring engineers and emerging leaders. What we need to see: 10+ overall years building production software, including 3+ years of managing software engineering teams. Experience delivering production software with strong quality, reliability, and release expectations. Experience driving process improvements, and improving operational efficiency. Excellent communication and collaborator management; ability to influence executive leadership across product, research, security, and operations. Deep understanding of AI/ML fundamentals, innovative model architectures, inference engine/kernel, performance optimization strategies, accelerated computing, large-scale distributed systems, and security hardening. A degree in Computer Science, Computer Engineering, or a related field (BS or MS) or equivalent experience. Ways to stand out from the crowd: Built and managed globally distributed organizations; established durable engineering processes that significantly improved quality and velocity across multiple teams. Recognized industry leader with contributions to open-source ecosystems (i.e vLLM, SGLang, TensorRTLLM, Dynamo, Triton, PyTorch), technical publications, or talks in containers, Kubernetes, GPU, or inference communities. Drove measurable performance improvements for large-scale LLM inference systems, including latency, throughput, GPU utilization, cost efficiency, and performance regression prevention across production releases. Hands-on experience with core GPU technologies such as CUDA, cuDNN, CUTLASS, cuBLAS, NCCL, NIXL, NVLink, and GPUDirect RDMA. Hands-on experience delivering enterprise or government-ready AI software, including FedRAMP, air-gapped deployments, regulated environments, security hardening, compliance evidence, and production support expectations. With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you. NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most hard-working and talented people in the world working for us. If you're creative and passionate about developing cloud services we want to hear from you! Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD for Level 3, and 272,000 USD - 431,250 USD for Level 4. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until July 6, 2026. This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. NVIDIA pioneered accelerated computing. Today, our AI infrastructure powers global intelligence, transforming every industry. Learn more about NVIDIA.

Vacancy posted 6 hours ago
Similar jobs that could be interesting for youBased on the Manager, Software Engineering - Production AI Inference in Santa Clara, CA vacancy
  • $229k - $343k

     ...other digital services. Snap Engineering teams build fun and technically sophisticated products that reach hundreds of...  ...forefront. We're looking for a Manager, Software Engineering, ML Inference to join Snap Inc.! What...  ...cost management Utilize AI tools and high velocity engineering... 
    Suggested
    Full time
    Live in
    Work at office
    Local area

    Snap Inc.

    Palo Alto, CA
    1 day ago
  • $244k - $330k

     ...Overview Come join Intuit's Consumer Group as a Manager 3, Software Engineering, helping power TurboTax — the #1 tax prep...  ...Consumer Group is reinventing the tax experience with AI, GenAI, and an expert-in-the-product model that combines world-class software with virtual... 
    Suggested
    Work experience placement

    Intuit Inc.

    Mountain View, CA
    10 hours ago
  • $272k - $431.25k

     ...for the most ambitious AI research, and the...  ...NVIDIA seeks a Senior Engineering Manager to define and drive NVIDIA...  ..., post-training, inference, and robotics, bridging...  ...systems that improve products and processes What we...  ...10+ overall years of software engineering... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $245k - $325k

     ...Director, Software Engineering Own and scale the SambaStack platform engineering efforts to deliver reliable, production-grade AI inference services Location: San Jose, California, United States...  ...hiring a Software Engineering Manager for the SambaStack platform. We... 
    Suggested
    Temporary work
    Local area
    Flexible hours

    jobs.frontdoordefense.com - Jobboard

    San Jose, CA
    2 days ago
  • $160.2k - $263.7k

    **The Role** We are looking for a Manager, Software Engineer to elevate the quality, trust, and operational...  ...maintenance, build confidence in test products, and drive cross-functional alignment...  ...Experience using or integrating AI-powered development tools (e.g., GitHub... 
    Suggested
    Remote work
    Flexible hours

    General Motors

    Sunnyvale, CA
    2 days ago
  •  ...SambaNova is hiring a Software Engineering Manager for the SambaStack platform We help enterprises...  ...and service providers host their own AI inference platforms, powered by our state‑of‑...  ...scaling of foundation model workloads at production scale As an Engineering Manager... 
    Flexible hours

    SambaNova Systems

    San Jose, CA
    1 day ago
  • $271.32k - $301.47k

     ...Cohesity is a leader in AI-powered data security and management. Aided by an extensive ecosystem...  ...for Innovation, Product Strength, and Simplicity...  ...our industry. As a Senior Engineering Manager at Cohesity, you...  ...for seasoned, innovative software engineering leaders who are... 
    Full time
    Work at office
    2 days per week
    3 days per week

    Madrona Venture Labs

    Santa Clara, CA
    1 day ago
  • $224k - $356.5k

    Manager, Software Performance Engineering - Autonomous Vehicles page is loaded## Manager, Software Performance Engineering...  ...into the unlimited potential of AI to define the next era of computing....  ...DRIVE Autonomous Solutions (NDAS) products, spanning multiple customer programs... 

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $165k - $267.5k

     ...Job Summary As an engineering leader, you will: Be...  ...part of a world‑class software engineering team that...  ...in all phases of the product development cycle, from...  ...functionally with Product Management, Software, and Quality...  ...experience with AI‑based conversational UI... 

    Palo Alto Networks, Inc.

    Santa Clara, CA
    2 days ago
  • $271.32k - $301.47k

     ...that craft and develop products for both on‑prem and SaaS...  ...high‑quality software development. Mentor, lead, and grow engineers on the team. Set clear...  ...Work closely with product management, program management, sales...  ...Security, Data Management, AI/ML, and SaaS. Pay... 
    Work at office
    Flexible hours

    Cohesity Inc.

    Santa Clara, CA
    3 days ago
  • $166.5k - $291.4k

     ...DescriptionIt all started when engineer Fred Luddy wrote code that...  .... Today, ServiceNow is the AI control tower for business reinvention...  ...the Team: As the Manager of Software Engineering, you will lead the...  ...(ITAM) supporting a product that helps enterprise organizations... 
    Full time
    Contract work
    Work at office
    Immediate start
    Remote work
    Flexible hours

    ServiceNow

    Santa Clara, CA
    1 day ago
  •  ...Description Primary Function of Position The manager in this position will lead a team of talented software engineers tasked with enabling the manufacturing of new...  ...the software infrastructure and tools to enable production. The manager will foster mutual trust and respect... 
    Temporary work

    Intuitive

    Sunnyvale, CA
    4 days ago
  • $185k - $298k

     ...Palo Alto Networks is seeking a Senior Manager, Software Engineering to lead multiple teams building cloud...  ...growth, partnering across engineering, product, design, and leadership teams to...  ...organization Champion the adoption of AI-assisted engineering practices to improve... 

    Palo Alto Networks, Inc.

    Santa Clara, CA
    5 days ago
  • $320k

    Within NVIDIA's Edge AI, Metropolis, and Blueprints...  ...team is the execution engine behind NVIDIA’s Vision...  ...from model onboarding to production deployment. We...  ...delivering robust, low‑latency inference at scale. You have led...  ...expertise in the embedded software sector, holding... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $199.3k - $249.1k

     ...Role Summary Our Enterprise Software team powers the digital backbone...  ...on time from planning and production to logistics and supplier...  ...We’re looking for a Software Engineering Manager to lead the teams that design...  ...of enterprise systems and AI. Responsibilities Lead, manage... 
    Full time
    Contract work
    Temporary work
    Part time
    Local area
    Shift work

    Rivian

    Palo Alto, CA
    4 days ago
  • $250k - $344.5k

     ...Network Security (NetSec) Engineering – Our team is at the...  ...and customer-facing AI-enabled solutions designed...  ...specialized group of software engineers dedicated to...  ...DNA of our NetSec product portfolio and beyond....  ...developing top-tier talent and managers focused on the... 

    Palo Alto Networks, Inc.

    Santa Clara, CA
    2 days ago
  •  ...is to build great products that accelerate next...  ...experiences—from AI and data centers,...  ...LLM and Multimodal inference at scale across...  ...across internal GPU software teams and engage with...  ...PERSON: Skilled engineer with strong...  ...to define goals, manage development efforts... 

    Advanced Micro Devices

    Santa Clara, CA
    5 days ago
  • $163k - $237k

     ...Technical Program Manager III, Software Engineering, Augmented Reality Mid Experience driving progress,...  ...technologies in one of the following: AI/ML, AR/VR, silicon or chip development...  ...solved for all. That's why Googlers build products that help create opportunities for... 

    Google

    San Jose, CA
    5 days ago
  • $170k - $240k

     ...Technical Program Manager, Software Engineering The era of pervasive AI has arrived. In this era, organizations will use generative AI to unlock hidden value...  ...dynamic Technical Program Manager (TPM) to drive product delivery at Sambanova Systems. You will play a critical... 
    Local area

    SambaNova Systems

    San Jose, CA
    1 day ago
  • Advanced Micro Devices in Santa Clara, California, seeks a strategic software engineering lead. This role entails developing techniques for optimizing key applications, particularly for large-scale inference within the K8s ecosystem. Successful candidates should possess... 

    Advanced Micro Devices

    Santa Clara, CA
    5 days ago
  •  ...builds the world's largest AI chip, 56 times larger...  ...-leading training and inference speeds; over 10 times...  ...re hiring a Principal Engineer for our Inference...  ...architecture, and write production code on critical paths...  ...years of experience in software engineering, with... 

    Cerebras Systems, Inc.

    Sunnyvale, CA
    1 day ago
  •  ...Inc. is seeking a Principal Engineer to lead their Inference Cloud Platform team. This...  ..., and contributing production code to enhance performance...  ...candidate has over 10 years of software engineering experience and...  ...to work on groundbreaking AI technology and be part of... 

    Cerebras Systems, Inc.

    Sunnyvale, CA
    1 day ago
  • $245k - $325k

     ...Director, Software Engineering SambaNova Systems San Jose...  ...SambaNova Systems AI is changing the world...  ...Software Engineering Manager for the SambaStack platform...  ...host their own AI inference platforms, powered by...  ...model workloads at production scale. As an... 
    Full time
    Temporary work
    Local area
    Flexible hours

    Softbank Investment Advisers

    San Jose, CA
    3 days ago
  • $185k - $298k

     ...Job Summary The Senior Software Engineering Manager will oversee a team of innovative engineers designing and developing security features for our...  ...the organization. You will drive the building of global products and bring new ideas to the security discipline. Key Responsibilities... 
    Remote work
    Visa sponsorship
    Work visa

    Palo Alto Networks

    Santa Clara, CA
    4 days ago
  • $156k - $229k

    Technical Program Manager III, Software Engineering, Platforms and Devices link Copy link corporate_fare Google...  ...for all. That’s why Googlers build products that help create opportunities for...  ...Services team combines the best of Google AI, Software, and Hardware to create... 
    Full time

    Google Inc.

    Mountain View, CA
    2 days ago
  • $184k - $287.5k

     ...Overview We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme...  ...compiler infrastructure to boost kernel developer productivity while approaching peak hardware utilization. Define... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $184k - $287.5k

     ...Networking Systems & Software Architecture group is solving some of AI’s hardest infrastructure...  ...plug directly into production AI stacks. The team's...  ...computing interconnects. This Engineering Manager role leads a team...  ...distributed training and inference patterns. Preferred... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $300k - $375k

     ...autonomous machines, advanced software, and human expertise...  ...exceptionally strong engineering executive in the role...  ...Engineering & AI with end-to-end responsibility...  ...engineering: fleet management, observability, real-...  ...Company’s AI strategy into production – agentic AI, computer... 
    Full time

    Knightscope, Inc

    Sunnyvale, CA
    2 days ago
  •  ...safely bring GenAI-powered products, agents, automation,...  ...high-throughput batch inference, and fine-tuning on autoscaling...  ...for Generative AI at DoorDash, leading the...  ...serving and inference engines, fine-tuning and training...  ...industry experience in software engineering Deep... 
    Hourly pay
    Work at office
    Local area
    Remote work
    Flexible hours

    DoorDash USA

    Sunnyvale, CA
    6 hours ago
  •  ...that powers industrial AI. AI doesn't work...  ...We're a growth-stage software company helping manufacturers...  ..., and optimize production. Backed by leading...  ...experienced operators, engineers, and leaders who have...  ...teams - including product managers, OT/controls engineers... 
    Work experience placement
    Remote work
    Shift work

    Litmus

    Santa Clara, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Manager, Software Engineering - Production AI Inference. Be the first to apply!