Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Manager, Large Language Model Inference

$184k - $287.5k

Dormont Manufacturing Company

At NVIDIA, we aren’t just powering the AI revolution—we’re accelerating it. The TensorRT inference platform is the backbone of modern AI, delivering the industry’s fastest and most efficient deployment of cutting‑edge deep learning models on every NVIDIA GPU. With demand for AI exploding, particularly in the realm of large language models (LLMs) and vision‑language models (VLMs, VLAs), we are significantly expanding our team. We’re seeking a highly skilled and driven Engineering Manager to take the lead in developing the next generation of LLM/VLM/VLA inference software technologies that will define the future of AI. This is a high‑impact, hands‑on leadership role at the intersection of deep technical expertise and world‑class management. You won’t just manage; you’ll architect and guide a brilliant team of engineers who are building the core LLM inference runtime. Your work will be highly collaborative, interfacing directly with NVIDIA Researchers, GPU Architects, and other teams across the company to ensure we ship production‑grade, lightning‑fast software that sets the global standard for AI performance. What You’ll Be Doing Lead and grow a team responsible for specialized kernel development, runtime optimizations, and frameworks for LLM inference. Drive the design, development, and delivery of production inference software, targeting NVIDIA’s next‑generation enterprise and edge hardware platforms. Integrating cutting‑edge technologies developed at NVIDIA and offering an intuitive developer experience for LLM deployment. Lead software development execution, with responsibility for project planning, milestone delivery, and cross‑functional coordination. What We Need to See MS, PhD, or equivalent experience in Computer Science, Computer Engineering, AI, or a related technical field. 7+ overall years of overall software engineering experience, including 3+ years of technical leadership experience. Proven ability to lead and scale high‑performing engineering teams, especially across distributed and cross‑functional groups. Strong background in C++ or Python, with expertise in software design and delivering production‑quality software libraries. Demonstrated expertise in large language models (LLM) and/or vision language models (VLM). Ways to Stand Out from the Crowd Deep understanding of GPU architecture, CUDA programming, and system‑level performance tuning. Background in LLM inference or working with frameworks such as TensorRT‑LLM, vLLM, or SGLang. Passion for building scalable, user‑friendly APIs and enabling developers in the AI ecosystem. Have a proven track record of growing and managing a team that encourages idea sharing, empowers team members, and provides opportunities for professional growth. We are widely considered to be one of the technology world’s most desirable employers, and we have some of the most forward‑thinking and hardworking people in the world working with us. Due to outstanding growth, our best‑in‑class teams are rapidly growing. If you’re a creative self‑starter with a real passion for technology, then come join us. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 2, and 224,000 USD - 356,500 USD for Level 3. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until January 13, 2026. This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering a diverse work environment and proud to be an equal‑opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Manager, Large Language Model Inference in California, MO vacancy
  •  ...on creatively developing and maintaining prompts, responses, and datasets to support cutting-edge machine learning tools and large language models (LLMs). This position involves collaboration with internal teams and third-party firms, contributing to labeling initiatives... 
    Language

    Welocalize, Inc

    California, MO
    1 day ago
  •  ...seeking experienced and compassionate Humanitarian Response Case Managers to support individuals and families returning to the U.S....  ...services. Coordinate interpreter or translation services when language barriers are present. Maintain accurate and confidential case records... 
    Language
    Work at office
    Flexible hours
    Shift work

    The WorkForce Group

    California, MO
    3 days ago
  •  ...collaborating with internal teams and third parties, contributing to labeling initiatives, and training teams on best practices for large language models. Ideal candidates will have native-level English along with bilingual skills in languages such as Korean, Japanese, or... 
    Language

    Welocalize, Inc

    California, MO
    2 days ago
  • $47 - $52 per hour

     ...Perform linguistic analyses on large datasets. Perform linguistic error analysis of AI model outputs, determining what the most...  ...research on a large number of languages, highlighting their...  ...training in the basics of project management is a plus Self‑motivation is a... 
    Language
    Hourly pay
    Remote work

    Crystal Equation Corporation

    California, MO
    2 days ago
  • $152k - $241.5k

     ...innovators to roll out and enhance AI inference solutions at scale, demonstrating NVIDIA...  ...Inference Server, or TensorRT-LLM for model optimization and serving. GPU orchestration...  ..., UCX). Demonstrated success in tuning large language models for low-latency inference in... 
    Language

    Dormont Manufacturing Company

    California, MO
    4 days ago
  •  ...Medical & Science has an outstanding opportunity for an ECM Case Manager in San Jose, CA. This is a full-time contract to hire position....  ...Fluent in English and one of the Santa Clara Country threshold languages (Spanish, Tagalog, Vietnamese, Mandarin, Cantonese) preferred.... 
    Language
    Hourly pay
    Full time
    Contract work
    Temporary work
    For contractors
    Local area
    Monday to Friday

    6AM City

    California, MO
    3 days ago
  • $23 per hour

     ...Care Manager I IOA is on the forefront of revolutionary healthcare models, reshaping the way people can age in place. Our innovative models transform lives, enhance...  ...tasks. Computer literacy is required. Physical & Language Requirements Ability to lift up to 25lbs. Ability... 
    Language
    Local area

    Institute on Aging

    California, MO
    4 days ago
  •  ...strategic, growth‑oriented Customer Success Manager to own key customer relationships, drive...  ...+ benefits that are on par with large companies, while also placing a high value...  ...Education in offering their content in several languages. To see our product in action, click any... 
    Language
    Remote work

    Localize

    California, MO
    3 days ago
  •  ...transforming how AI is deployed worldwide — compressing large language models by up to 95% without losing accuracy and cutting inference costs by 50-80%. Joining us means working on...  ..., sales enablement, and channel/account managers. Establish sales processes, playbooks, and... 
    Language
    Remote work
    Worldwide
    Relocation package
    Flexible hours

    Multiverse Computing LLC

    California, MO
    5 days ago
  • $20 - $24 per hour

     ...Clever Care Community Center(s). Provide in‑language support for members regarding their...  ...general building repairs with the Facilities Manager/Department. Maintain a proper inventory...  ...a plus. Ability to present to small and large groups. Desire to help seniors by... 
    Language
    Hourly pay
    Work at office

    Clever Care Health Plan

    California, MO
    2 days ago
  •  ...Wiseasy is seeking a Sales Manager to lead efforts in targeting large payment solution providers in the USA and Canada. You will drive growth through strategic sales initiatives, client management, and market expertise. The ideal candidate has a strong background in the... 
    Remote work

    Wiseasy

    California, MO
    3 days ago
  • $70k - $90k

     ...planning (ERP) platform that helps companies manage and streamline their operations through a...  ...you up to speed on the product, partner model, and go‑to‑market strategy. We’re looking...  ...Available immediately Additional languages, Spanish, French, or Portuguese preferred... 
    Language
    Immediate start
    Remote work
    Flexible hours

    VentorTech LLC

    California, MO
    2 days ago
  •  ...Sales Manager – United States As sales manager, you will be the next driving force behind Hanshow Technology in the States. We are looking...  ...provide comprehensive solutions for varying customer needs. Languages: Spanish; Chinese. Sales experience in retail, communication,... 
    Language

    Hanshow

    California, MO
    10 hours ago
  • $180k

     ...Optimizing the latency and throughput of model inference. Building reliable production serving...  ...distillation, and speculative decoding. Worked on large-scale, high concurrent production...  ...interviews: Coding assessment in a language of your choice. Systems hands-on:... 
    Language
    Temporary work
    Relocation

    x.ai

    California, MO
    2 days ago
  • Job Title: Validata Functional SME - Model N Flex Location: Onsite - California, USA...  ...strategy, compliance workflows, test lifecycle management, and execution processes within a...  ...Competencies Ability to simplify compliance language for business and technical teams.... 
    Language
    Flexible hours

    Clough AMEC.

    California, MO
    4 days ago
  • Language is currently seeking qualified Farsi interpreters for a large-scale medical interpretation project in Orange County, California. This ongoing contract position requires fluency in both Farsi and English, as well as professionalism and reliability. Experience in... 
    Language
    Ongoing contract

    Language

    California, MO
    1 day ago
  • Construction Project Manager/Designer - McCall Job Description As a Construction Project Manager/Designer, you will spend your time designing...  ...Workplace Discrimination is Illegal poster, please choose your language: English - Spanish - Arabic - Chinese English - Spanish -... 
    Language
    For subcontractor
    Relocation package

    WeBuildIdaho.org

    California, MO
    4 days ago
  •  ...Excellence Leader will be responsible for the identification, management and execution of projects and improve processes within Tech Ops...  ...responsible for defining, evolving, and embedding effective operating models, process standards, and continuous improvement practices across... 
    Worldwide

    Abbott Laboratories company

    California, MO
    4 days ago
  •  ...implementation including team management, partner relations, risk...  ...international stakeholders for this large-scale response in a complex,...  ...agencies. Required Language(s) Effective in written and verbal...  .... Ability to communicate and model to staff positive behaviours... 
    Language
    Contract work
    Fixed term contract
    Internship
    Local area
    Immediate start

    tendersglobal

    California, MO
    3 days ago
  •  ...datasets for machine learning tools. The ideal candidate will lead labeling initiatives and train teams on best practices for large language models. Applicants should possess native English proficiency, bilingual skills in multiple languages, and a 4-year degree or... 
    Language

    Welocalize

    California, MO
    3 days ago
  •  ...to improve their vocabulary acquisition, language comprehension and written expression in...  ...of individualized education plans, case management, and close collaboration with classroom...  ...and their families and the community at large from infectious diseases such as the flu... 
    Language
    Local area

    Charles E. Smith Jewish Day School

    California, MO
    3 days ago
  • Language is currently seeking qualified Turkish interpreters for a large-scale medical interpretation project in Orange County, California. This contract position requires fluency in both Turkish and English, with preferred experience in medical settings. Candidates must... 
    Language
    Contract work

    Language

    California, MO
    1 day ago
  •  ...currently seeking qualified Turkish interpreters for an exciting large-scale project in Orange County, California. Project Details...  ...biglanguage.com Thank you! Department: Interpretation This is a contract position #J-18808-Ljbffr Language Link - A BIG Language company
    Language
    Contract work

    Language Link - A BIG Language company

    California, MO
    1 day ago
  • Shift Manager (SM) SUPERVISOR: Assistant General Manager (AGM), General Manager (GM), Area...  ...Benefits - GED reimbursement, free second-language education, etc. DailyPay - Program that...  ...company! We pride ourselves on having a large-company infrastructure with a small-company... 
    Language
    Local area
    All shifts
    Shift work

    Taco Bell

    California, MO
    1 day ago
  •  ...tasting panel. Qualifications: Minimum four (4) years’ experience of large hotel or similar Casino type operation which includes high...  ...responsibilities of the chef, such as inventory, kitchen staff management, and food preparation. Ensure all team members adhere to... 
    Work experience placement

    PainCeptor

    California, MO
    3 days ago
  •  ...currently seeking qualified Ukrainian interpreters for an exciting large-scale project in Orange County, California. Project Details...  ...applicable) Medical interpreting experience Interpretation skillset Languages you are fluent in Located within 30 miles of the location... 
    Language
    Hourly pay
    Contract work

    Language

    California, MO
    3 days ago
  •  ...innovation. In this role, you will oversee the life-cycle of AI/ML models, partnering with various teams to identify opportunities and...  ...AI/ML, proficiency in Python and SQL, and a strong grasp of large language models and reinforcement learning. We offer a robust... 
    Language
    Flexible hours

    ServiceTitan, Inc.

    California, MO
    2 days ago
  •  ...ability to remain calm under pressure Nice to Have Additional language skills Experience supporting international customers Familiarity...  ...job operates in a professional office environment. The role is largely sedentary and requires using a computer and headset; may... 
    Language
    Work at office
    Local area
    Remote work
    Work from home

    MCI

    California, MO
    2 days ago
  •  ...experienced professionals with a strong background in sales management, business development, or account leadership within the...  ...you will work on projects that help fine‑tune and improve large language models (LLMs) using your knowledge of sales strategy, wholesale trade... 
    Language
    Full time
    Contract work
    For contractors
    Freelance
    Remote work

    Turing

    California, MO
    3 days ago
  •  ...Stand for long periods of time, ability to bend, stoop and lift up to 45lbsExcellent communication skills both verbal and written LANGUAGE SKILLS Ability to read and interpret documents in English, such as safety rules, operating and maintenance instructions and procedure... 
    Language
    Traineeship

    Terrible's

    California, MO
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Manager, Large Language Model Inference. Be the first to apply!