Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Machine Learning Engineer, GeForce G-Assist

$184k - $287.5k

NVIDIA

GeForce G-Assist Engineer

At NVIDIA, we're building GeForce G-Assist — an on-device AI assistant that combines Small Language Models (SLMs), retrieval systems, and hybrid cloud capabilities to deliver responsive, context-aware assistance inside the GeForce ecosystem. We work closely across engineering and product teams to ensure G-Assist performs reliably in real-world scenarios.

What you'll be doing:

  • Together, we focus on how models behave in production, not just on benchmarks. Evaluate and improve Small Language Models used in GeForce G-Assist, with an emphasis on accuracy, robustness, and conversational reliability. Identify and mitigate conversation and context contamination, including state drift, prompt leakage, and retrieval cross-talk.
  • Work with SLM and VLM architectures to support text and multimodal interactions. Collaborate on hybrid architectures that combine local SLMs with cloud-based models. We value engineers who enjoy thinking across the full system—from model behavior to runtime performance.
  • Optimize local inference using llama.cpp, including quantization, memory usage, and performance tuning. Read, write, and optimize C/C++ code in performance-critical paths.
  • Design and integrate retrieval-augmented generation (RAG) systems that ground responses in system and user context. Support agentic AI workflows, enabling planning, tool use, and multi-step execution.

What we need to see:

  • 8+ years of validated experience in system software or a related field, with an M.S. or higher degree in Computer Science, Data Science, Engineering, or a related field (or equivalent experience). We're looking for teammates who enjoy solving real problems, learning as they go, and collaborating in a tight-knit environment.
  • Strong ability to read and write C/C++ code in systems-level or performance-sensitive environments, along with proficiency in Python. Hands-on experience with llama.cpp or similar local inference frameworks.
  • Hands-on experience evaluating Small Language Models, including task-based and conversational testing, with an understanding of conversation dynamics, long-context behavior, and contamination challenges.
  • Knowledge of SLM and VLM architectures and their trade-offs, experience with retrieval technologies and language-model integration, and familiarity with agentic AI patterns such as tool use and planning.

Ways to stand out from the crowd:

  • Experience contributing to language or multimodal models that power user-facing products, features, or workflows.
  • A track record of collaborating with product, platform, or systems teams to balance model capability, performance, and user experience.
  • Demonstrated ability to translate user needs or feedback into measurable improvements in model behavior or system reliability.

We are widely considered to be one of the technology world's most desirable employers, and as a result, we have some of the most forward-thinking and hardworking people in the world working for us. If you're passionate, creative, and driven, we'd love to have you join the team. With competitive salaries and a generous benefits package, we are considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us, and due to unprecedented growth, our exclusive engineering teams are rapidly growing. We want to hear from you if you're a creative and autonomous engineer with a real passion for technology.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until March 20, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Machine Learning Engineer, GeForce G-Assist in Santa Clara, CA vacancy
  • $150k

     ..., data scientists, and engineers, tackling the most fundamental...  ...computing in deep learning, driving impactful...  ...The Role As a Machine Learning Engineer at the...  ...ensure best practices (e.g., style guidelines, checking...  ...Leave *Employee Assistance Program *Life insurance... 
    Suggested
    Worldwide
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    4 days ago
  •  ...organizations that keep the world running. Our Team's Vision: Our Engineering team is shaping the future of cybersecurity. We thrive on...  .../Flink). ~ Hands on experience with agentic frameworks (e.g., AutoGen, CrewAI, or custom orchestration layers), RAG, MCP, fine... 
    Suggested
    Immediate start

    Illumio

    Sunnyvale, CA
    4 days ago
  • $147.4k - $272.1k

     ...Machine Learning Engineer - Agentic AI The VCV organization has pioneered human-centric, real-time...  ...environments. Strong proficiency with LLM-assisted coding, including using AI tools to...  ...across multiple components (e.g., services, data pipelines, integrations... 
    Suggested
    Relocation

    Apple

    Sunnyvale, CA
    2 days ago
  •  ...seeking a talented and experienced Senior Machine Learning Engineer. The ideal candidate will have a...  ...regulations, and ordinances. If you need assistance and/or a reasonable accommodation due...  ...experience with machine learning frameworks (e.g., TensorFlow, PyTorch). Experience... 
    Suggested

    Insight Global

    Sunnyvale, CA
    1 day ago
  • $184k - $287.5k

     ...Intelligent machines powered by Artificial Intelligence computers that can learn, reason and interact with people are no longer science...  ...Senior Perception Engineer to develop and productize NVIDIA...  ...using deep learning frameworks (e.g., PyTorch). ~ Experience in data... 
    Suggested
    Odd job
    Work experience placement

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $150k

     ...Distributed ML Engineer We are a dedicated research lab for building...  ...computing in deep learning, driving impactful discoveries...  ...to ensure best practices (e.g., style guidelines, checking...  ...Parental Leave ~ Employee Assistance Program ~ Life insurance and... 
    Work experience placement
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    4 days ago
  • $19 - $65 per hour

     ...company's existing simulation engine. Develop metrics to validate...  ...Familiarity with deep learning frameworks (PyTorch preferred)...  ...understanding of generative models (e.g., VAEs, GANs, diffusion models...  ...responses. These tools assist our recruitment team but do not... 
    Internship

    PlusAI, Inc.

    Santa Clara, CA
    3 days ago
  • $170k - $240k

     ...development initiatives. As a Senior ML Engineer, you will collaborate closely with machine learning engineers, research scientists,...  ...in AI/ML infrastructure, e.g., enabling distributed training...  ...vacation & holidays, tuition assistance programs, employee assistance program... 
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    8 hours ago
  • $174.72k - $295.68k

     ...Senior Machine Learning Engineer - Foundation Model Santa Clara, CA XPENG is a leading smart technology company at the forefront of innovation...  ...Design and implement large-scale multi-modal architectures (e.g., vision–language–action transformers) for end-to-end... 
    Full time

    XPENG

    Santa Clara, CA
    4 days ago
  • $185.1k - $335.3k

     ...The Role We are looking for a Staff Machine Learning Engineer to serve as a technical leader for...  ...validate, and maintain map primitives (e.g., lanes, boundaries, traffic controls,...  ...insurance, paid vacation & holidays, tuition assistance programs, employee assistance program,... 
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    3 days ago
  •  ...Staff Machine Learning Engineer It started with a simple idea: what if surgery could be less invasive...  .... As a global leader in robotic-assisted surgery and minimally invasive care, our...  ...inference, GPU/throughput optimization (e.g., TensorRT, ONNX Runtime, mixed... 
    Work at office
    Local area
    Worldwide
    Flexible hours

    Intuitive

    Sunnyvale, CA
    2 days ago
  • $291.5k - $369.1k

     ...volume, real‑time, multi‑modal machine‑generated data — including...  ...of Splunk and Cisco’s global engineering capabilities. Our work spans...  ...and unstructured data, deep learning‑based time series modeling, advanced...  ...deep learning frameworks (e.g., PyTorch, TensorFlow)... 
    Full time
    Temporary work
    Local area
    Flexible hours

    Cisco

    Sunnyvale, CA
    16 hours ago
  • $160k - $200k

     ...Senior ML Infrastructure Engineer at Plus, you will...  ...state-of-the-art deep learning frameworks like PyTorch...  ...of what's possible in machine learning infrastructure...  ...experiment tracking tools (e.g., Docker, Kubernetes, multiprocessing...  ...responses. These tools assist our recruitment team... 

    PlusAI, Inc.

    Santa Clara, CA
    8 hours ago
  • $272k - $431.25k

     ...NVIDIA is looking for a Machine Learning (ML) Engineer to join the GPU accelerated Apache Spark team. Apache...  ...Develop AI-based agents and tools to assist with fixing system issues and application...  ...boosted tree model solutions (e.g., XGBoost). Ways to stand out from... 

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $130k - $220k

     ...-growing teams. We're looking for a machine learning engineer to train and deploy the latest generation...  ...training with modern frameworks (e.g. PyTorch) ~ A rigorous approach to model...  ..., or assessing responses. These tools assist our recruitment team but do not replace... 

    PlusAI, Inc.

    Santa Clara, CA
    4 days ago
  • $296.3k

     ...We are seeking a Principal AI Engineer to lead the design and advancement...  ...experience in AI/ML domain (e.g., enabling distributed...  ...vacation & holidays, tuition assistance programs, employee assistance...  ...on realizing your ambitions. Learn how GM supports a rewarding career... 
    Local area
    Remote work
    Work from home
    Flexible hours

    General Motors

    Sunnyvale, CA
    3 days ago
  • $150k

     ..., data scientists, and engineers, tackling the most fundamental...  ...computing in deep learning, driving impactful...  ...foundation models to unlock machine intelligence beyond...  ...infrastructure on AWS (e.g., compute, storage, networking...  ...developer and AI-assisted coding (e.g., Codex,... 
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    4 days ago
  •  ...Summary Join Apple’s Information Security Machine Learning (ISML) team, where we are redefining...  ...motivated and talented Machine Learning Engineer to join our dynamic and growing team....  ...Familiarity with cloud platforms (e.g., AWS, GCP) and their security offerings... 
    Local area

    Apple

    Sunnyvale, CA
    2 days ago
  • $157.2k - $254.1k

     ...Machine Learning Engineer We are seeking a Machine Learning Engineer to join our pioneering security...  ...and deep learning architectures (e.g., Sequence models, GNNs, Transformers)...  ...individuals with a disability. If you require assistance or accommodation due to a disability... 

    Palo Alto Networks

    Santa Clara, CA
    3 days ago
  • $150k

     ...researchers, data scientists, and engineers, tackling the most fundamental...  ...performance computing in deep learning, driving impactful discoveries...  ...training frameworks (e.g., DeepSpeed, FSDP, FairScale,...  ...• Experience with large-scale machine learning workloads (strong ML... 
    Flexible hours

    Institute of Foundation Models

    Sunnyvale, CA
    4 days ago
  • $19 - $65 per hour

     ...enthusiastic and driven Simulation/ML Engineer Intern to join our team. In...  ...ll help build an internal AI assistant that lets employees instantly...  .... Required Skills Machine Learning & NLP: Solid understanding of...  ...quantizing open‑source models (e.g., Qwen, LLaMA, Mistral) using... 
    Hourly pay
    Internship

    PlusAI

    Santa Clara, CA
    5 days ago
  • $171k

     ...Partner with platform, product, and security engineering teams to enable the successful deployment of the latest machine learning techniques into production. Basic Qualifications...  .... Familiar with modern AI/ML frameworks (e.g., PyTorch). Preferred Qualifications... 
    Full time
    Work experience placement
    Work at office
    Remote work

    Uber

    Sunnyvale, CA
    5 days ago
  • $181.1k - $272.1k

     ...technology for artificial intelligence, machine learning and natural language processing. The features...  ...are looking for. Our universal search engine powers search features across Apple...  ...of-the-art LLM fine-tuning techniques (e.g., SFT with Rejection Sampling, RLHF, Reward... 
    Relocation package

    Apple Inc.

    Santa Clara, CA
    5 days ago
  • $181.1k - $318.4k

     ...Machine Learning Engineer – Computer Vision & Data Systems At Apple, we are dedicated to creating technologies that enrich people's lives. Our...  ...Experience building or optimizing large-scale data pipelines (e.g., distributed ETL, dataset generation, annotation workflows,... 
    Relocation

    Apple

    Sunnyvale, CA
    4 days ago
  • $184k - $287.5k

     ...Scientist For Voice Of The Customer GeForce NOW (GFN) provides high-performance gaming...  ..., unstructured datasets into precise engineering actions. We are looking for a validated...  ...Proficiency in both supervised and unsupervised learning, with a specific focus on time-series... 

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $148.91k - $252k

     ...Machine Learning Engineer - LLM, AI & Robotics Santa Clara, CA XPENG is a leading smart technology company at the forefront of innovation,...  ...use cases in Humanoid Robots and Autonomous Driving Cars, e.g. chat, API calling, etc. Work on efficient LLMs (e.g. small... 
    Full time

    XPENG

    Santa Clara, CA
    4 days ago
  • $224k - $356.5k

     ...We are seeking exceptional Senior Machine Learning and Simulation Engineers to join NVIDIA's Autonomous Vehicles (AV) Simulation team! This role requires...  ...environments, and job scheduling/orchestration tools (e.g., Kubernetes, SLURM). Ways to stand out from the crowd... 

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $125k - $201.25k

     ...profoundly impact health for humanity. Learn more at jnj.com As guided by Our Credo...  ...for the best talent for Senior Machine Learning Engineer - Robotics to be in Santa Clara, CA....  ...proficiency in deep learning frameworks (e.g., PyTorch, TensorFlow). ~ Experience... 
    Work experience placement
    Local area
    Immediate start

    Johnson and Johnson

    Santa Clara, CA
    7 days ago
  • $147.4k - $272.1k

     ...Applied Machine Learning Research Engineer - Multimodal for Human Understanding We're starting to see the incredible potential of multimodal foundation...  ...Experience with at least one deep learning framework (e.g., PyTorch, JAX, or equivalent). Master's degree in... 
    Worldwide
    Relocation

    Apple

    Sunnyvale, CA
    8 hours ago
  • $257k

    About the Role We are looking for an experienced Senior Staff Machine Learning Engineer to join the Account Integrity team within Trusted Identity...  ...real‑world problems. Industry experience in ML frameworks (e.g. Tensorflow, Pytorch, or JAX) and complex data pipelines;... 

    Uber

    Sunnyvale, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning Engineer, GeForce G-Assist. Be the first to apply!