Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Solutions Architect, LLM Model Builder

$152k - $241.5k

NVIDIA

NVIDIA is seeking an outstanding Solutions Architect, Foundation Models to join our growing team focused on partner enablement for reasoning models, multimodal models, and production inference! In this role, you will act as both a strategic technical expert and a hands-on advisor, helping partners build, benchmark, fine-tune, optimize, and deploy foundation model solutions for customer workloads.

The Partner Solutions Architecture team acts as a trusted advisor to the ecosystem. We enable partners to translate customer requirements into architectures, benchmark recipes, cluster test plans, compute sizing, and production readiness—accelerating time to value through the full-stack accelerated computing platform.

What you'll be doing:

  • Serve as the lead technical advisor for partners delivering reasoning, multimodal, fine-tuning, and model-serving solutions.

  • Guide partners to the right approach for customer workloads across fine-tuning, distillation, quantization, compression, benchmarking, and evaluation.

  • Define benchmark plans, synthetic data and evaluation workflows, and repeatable validation recipes.

  • Advise on compute planning, including cluster sizing, GPU and network selection, storage, memory tradeoffs, latency and throughput targets, and production-readiness testing.

  • Guide inference architecture across prefill and decode tradeoffs, batching, routing, disaggregated inference, and serving efficiency.

  • Develop reference architectures, playbooks, benchmark recipes, TCO calculators, and sizing models across CUDA, NeMo, Nemotron, Dynamo, TensorRT-LLM, Triton, NIMs, and related tooling.

  • Support pre- and post-sales engagements by translating complex model and infrastructure topics for partner and customer teams.

What we need to see:

  • MSc, PhD in Computer Science, Electrical Engineering, Software Engineer, ML Engineer, or related fields (or equivalent experience).

  • 5+ years of relevant experience working with LLMs, VLMs, and large-scale inference systems, with hands-on expertise in fine-tuning, benchmarking, evaluation, optimization, and production deployment as a Research Engineer, Deep Learning Engineer, or equivalent.

  • Strong understanding of foundation models across data preparation, fine-tuning, post-training, evaluation, and inference.

  • Familiarity with reasoning models, reinforcement learning, and synthetic data generation and evaluation workflows.

  • Strong programming skills in Python and hands-on experience with PyTorch, JAX, or TensorFlow.

  • Familiarity with Nemotron, NeMo, Dynamo, TensorRT-LLM, Triton, vLLM, and similar inference and optimization stacks.

  • Strong communication and presentation skills, with the ability to advise both technical teams and executives.

Ways to stand out from the crowd:

  • Experience helping partners or customers deploy large-scale AI systems in production.

  • Built benchmark suites, fine-tuning recipes, sizing calculators, or TCO models for AI workloads.

  • Strong knowledge of GPU infrastructure, including NVLink, InfiniBand, MPI, NCCL, or adjacent cluster technologies.

  • Active OSS contributions in model tooling, inference, evaluation, or performance optimization.

  • Comfortable moving between deep technical reviews, architecture guidance, benchmarking, and partner enablement.

Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD.

You will also be eligible for equity and benefits ( .

Applications for this job will be accepted at least until April 11, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Solutions Architect, LLM Model Builder in Santa Clara, CA vacancy
  • $184k - $287.5k

     ...Join our team at NVIDIA and help bring AI solutions to our largest customers. We are seeking an expert Solutions Architect to assist customers in building AI/ML and HPC software...  ...aspects related to tasks like large scale LLM training and inference. Conducting regular... 
    Suggested

    NVIDIA

    Santa Clara, CA
    1 day ago
  • CloudAct Inc. seeks a talented Product Designer to shape their LLM infrastructure. You will own end-to-end surfaces, collaborating closely with engineering to create high-fidelity designs and maintain the Nemo design system. The ideal candidate has at least 4 years of experience... 
    Suggested

    CloudAct Inc.

    Sunnyvale, CA
    1 day ago
  • $100k - $180k

     ...Job Title: SOLUTION ARCHITECT L1 City: Sunnyvale State/Province: California Posting Start Date: 6/2/26 Wipro...  ...requirementsFamiliarity with AI agentic frameworks, MCPs and LLM libraries (e.g., Anthropic SDK). Understanding of LLM... 
    Suggested
    Minimum wage
    Local area

    Wipro

    Sunnyvale, CA
    5 days ago
  • $152k - $241.5k

     ...out and enhance AI inference solutions at scale, demonstrating NVIDIA...  ...and Kubernetes. As a Solutions Architect focused on inference, you’ll collaborate...  ...pipelines using TensorRT-LLM, vLLM, SGLang, and other...  ...Inference Server, or TensorRT-LLM for model optimization and serving. ~... 
    Suggested

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

     ...you to help shape that work. As a Senior Solution Architect on our Telco AI team, you will design...  ...operations using the latest generative models, NLP, RAG pipelines, and large-scale distributed...  ..., and continuously improving agentic LLM applications targeting Telco Network... 
    Suggested
    Remote work

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $184k - $287.5k

    **What You Will Be Doing:*** Partnering with other solution architects, engineering, product and business teams. Understanding their strategies...  ...* Excellent knowledge of the theory and practice of LLM and DL inference* Excellent presentation, communication and collaboration... 

    NVIDIA

    Santa Clara, CA
    6 hours ago
  •  ...looking for an expert AV and Robotics Solutions Architect who can help customers accelerate Physical...  ...AV perception and planning/policy models, simulations, synthetic data generation...  ...such as model training and validations, LLM, VLA, VLM, World Models, video encoding... 

    NVIDIA Gruppe

    Santa Clara, CA
    6 hours ago
  • $152k - $241.5k

     ...simulation, synthetic data generation, multi‑step model training, and inference, all on a large scale! We are seeking a hands‑on Solutions Architect with deep expertise in backend...  ...inference pipelines using NVIDIA NIM, TensorRT‑LLM, vLLM, SGLang, and other engines to... 

    NVIDIA Gruppe

    Santa Clara, CA
    6 hours ago
  • $140k - $210k

     ...built on CoreWeave. We hire technical, AI Solution Architects who want to operate the full stack, own...  ...’s full platform: infrastructure, Models, Weave, observability, and inference. You...  ...deep learning models, including modern LLM architectures Experience designing and... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    4 days ago
  • $152k - $241.5k

     ...seeking an outstanding AI Engineer or Solutions Architect to join our growing team focused on ecosystem...  ...work experience in deploying AI models at scale as a Software Engineer or Deep...  ...learning, with a particular emphasis on LLM and VLM. ~ Hands-on experience with LLM... 
    Work experience placement

    NVIDIA

    Santa Clara, CA
    5 days ago
  • $152k - $241.5k

     ...counterparts to develop accelerated compute and AI solutions for their customers? We are looking for a hardworking and self-starting Solutions Architect with a passion for accelerated compute,...  ...projects. Background in Agentic AI, LLM deployment, and building ingestion/... 
    Remote work

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...Join NVIDIA as a Solutions Architect to own the evolution of Agentic AI for the enterprise. You will collaborate with top-tier enterprise software...  ...grade deployment. As a trusted advisor, you’ll transform raw LLM capabilities into high-performance, industry-focused... 

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

     ...NVIDIA is seeking outstanding AI Solutions Architects to assist and support customers that are building...  ...on Generative AI and Large Language Models (LLMs). You will also collaborate with...  ...Inference Server ( , TensorRT ( , TensorRT-LLM ( Excellent C/C++ programming skills,... 

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...customers in the world? NVIDIA is looking for an experienced Solutions Architect to assist customers with adoption of GPU hardware and software...  ...with NVIDIA GPUs and SDKs (i.e. CUDA, Triton, TensorRT-LLM, etc.) Deep understanding of the full software development... 
    Local area
    Remote work

    NVIDIA

    Santa Clara, CA
    5 days ago
  • $184k - $287.5k

     ...Senior Solutions Architect - AI Factory Deployment We are seeking an ambitious Senior Solutions Architect - AI Factory Deployment to join...  ...factories end to end. You will focus on running and debugging AI/LLM workloads and benchmarks on Linux-based GPU clusters, using... 

    NVIDIA

    Santa Clara, CA
    5 days ago
  • $184k - $287.5k

     ...are looking for an expert AV and GenAI Solutions Architect to help assist customers with adoption...  ...for building AV perception and planning models and pipelines, simulations, synthetic data...  ..., DiT, etc. Experience in deploying LLM models at scale on mainstream cloud... 

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

     ...world. NVIDIA is seeking an experienced Solutions Architect to be a trusted technical advisor,...  ...by integrating libraries, frameworks, models, and software applications. Deliver GenAI...  ...Triton Inference Server, TensorRT, TensorRT-LLM, NVIDIA CUDA-X Hands-on expertise... 
    Remote work

    NVIDIA

    Santa Clara, CA
    18 days ago
  •  ...Senior Solutions Architect, Generative AI Specialist Job Type: Full-time, Remote Responsibilities...  ...agentic AI systems and complex LLM‑powered workflows. BS/MS/PhD in Computer...  ...MLOps principles, including CI/CD for models, automated training pipelines, and production... 
    Full time
    Remote work

    Descon Inc

    Santa Clara, CA
    1 day ago
  • $184k - $287.5k

     ...Partner (ATP) team as a Senior Solutions Architect who thrives at the...  ...full AI stack, from initial model selection to large-scale deployment...  ...ML systems and sophisticated LLM workflows. ~ MS or advanced...  ...body of work demonstrating your builder foundation: open-source contributions... 

    NVIDIA

    Santa Clara, CA
    2 days ago
  •  ...Technical Architect - AI/ML, LLM Location: Santa Clara, CA 95054 (Onsite) Full-Time...  ...strong hands-on experience in Large Language Models (LLMs) to design, build, and deploy production-grade AI solutions . This role focuses on advanced model development... 
    Full time

    Lorven Technologies

    Santa Clara, CA
    1 day ago
  • NVIDIA Gruppe is seeking a Principal Systems Engineer to define the vision for memory management in large-scale LLM and storage systems. This role involves designing a unified memory layer and integrating with leading LLM serving engines. The ideal candidate has 15+ years... 

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $220.2k - $330.4k

     ...industries through intelligent edge solutions that combine connectivity,...  ...customer data, fine‑tuned models, and inference loads to...  ...solutions for Generative AI (LLM, VLM, VLA), Agentic AI, Voice...  ...Principal Systems Solutions Architect, you will define, develop, document... 
    Work experience placement
    Work at office

    Qualcomm

    Santa Clara, CA
    4 days ago
  • $19 - $65 per hour

    PlusAI, located in Santa Clara, is seeking a Simulation Engineer Intern to join its innovative team. You will leverage large language models to generate driving scenarios and interface directly with simulation environments, ensuring they adhere to traffic laws. The role... 
    Hourly pay
    Internship

    PlusAI

    Santa Clara, CA
    3 days ago
  • $251k - $377k

     ...systems. You Are: You are an accomplished and visionary architect with a passion for solving complex technical challenges and enabling...  ..., influencing technology roadmaps, and delivering tailored solutions that drive business impact. You have a proventrack recordof... 
    Remote work
    Worldwide

    Synopsys

    Sunnyvale, CA
    4 days ago
  •  ...expansion by identifying and addressing customer business and technical needs, providing consulting and planning services, formulating solutions, overseeing delivery and implementation, and leading pre-sales POC tests. Create customer benchmark projects in industries such... 

    VeloDB (Powered by Apache Doris)

    Sunnyvale, CA
    1 day ago
  • $170k

     ...Solutions Architect, PS Location: Sunnyvale, CA Experience: 7-10 Years Fortune 500 clients and government agencies trust eGain AI knowledge solution to improve customer experience and reduce cost of service. Top rated by Gartner, eGain AI Knowledge Hub orchestrates... 
    Work at office

    eGain Corporation

    Sunnyvale, CA
    3 days ago
  • $130.7k - $261.3k

     ...scientists. THE OPPORTUNITY This Senior Cloud Solutions Architect position can work out of our Santa...  ...key functional gaps, data models, interface, transformation and technical...  ...support, imaging). Agentic AI workflows and LLM-based systems. RAG architectures with vector... 
    Contract work

    Abbott Laboratories company

    Santa Clara, CA
    6 hours ago
  • $65k - $400k

     ...Decisive Point is seeking a RTOS Solutions Architect in Sunnyvale, CA. This role involves developing real-time operating systems for next-generation vehicle intelligence and software. The ideal candidate will have over 10 years of experience in embedded software development... 

    Decisive Point

    Sunnyvale, CA
    6 hours ago
  •  ...NVIDIA Gruppe is seeking a Solutions Architect in Robotics Simulation to lead innovations driven by AI and 3D simulation. You will guide partners in overcoming robotics technical challenges using NVIDIA's groundbreaking technologies. The position demands a Bachelor’s degree... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 hours ago
  •  ...headquartered in metropolitan Atlanta, GA with prime emphasis on the following service offerings: Staff Augmentation Lifecycle IT solutions Application Development & Support Outsourced Testing Mobile Development and Test Automation The company was incorporated in the State... 
    Worldwide

    Pyramid Consulting

    Sunnyvale, CA
    6 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Solutions Architect, LLM Model Builder. Be the first to apply!