Solutions Architect, LLM Model Builder

$152k - $241.5k

NVIDIA

NVIDIA is seeking an outstanding Solutions Architect, Foundation Models to join our growing team focused on partner enablement for reasoning models, multimodal models, and production inference! In this role, you will act as both a strategic technical expert and a hands-on advisor, helping partners build, benchmark, fine-tune, optimize, and deploy foundation model solutions for customer workloads.

The Partner Solutions Architecture team acts as a trusted advisor to the ecosystem. We enable partners to translate customer requirements into architectures, benchmark recipes, cluster test plans, compute sizing, and production readiness—accelerating time to value through the full-stack accelerated computing platform.

What you'll be doing:

Serve as the lead technical advisor for partners delivering reasoning, multimodal, fine-tuning, and model-serving solutions.
Guide partners to the right approach for customer workloads across fine-tuning, distillation, quantization, compression, benchmarking, and evaluation.
Define benchmark plans, synthetic data and evaluation workflows, and repeatable validation recipes.
Advise on compute planning, including cluster sizing, GPU and network selection, storage, memory tradeoffs, latency and throughput targets, and production-readiness testing.
Guide inference architecture across prefill and decode tradeoffs, batching, routing, disaggregated inference, and serving efficiency.
Develop reference architectures, playbooks, benchmark recipes, TCO calculators, and sizing models across CUDA, NeMo, Nemotron, Dynamo, TensorRT-LLM, Triton, NIMs, and related tooling.
Support pre- and post-sales engagements by translating complex model and infrastructure topics for partner and customer teams.

What we need to see:

MSc, PhD in Computer Science, Electrical Engineering, Software Engineer, ML Engineer, or related fields (or equivalent experience).
5+ years of relevant experience working with LLMs, VLMs, and large-scale inference systems, with hands-on expertise in fine-tuning, benchmarking, evaluation, optimization, and production deployment as a Research Engineer, Deep Learning Engineer, or equivalent.
Strong understanding of foundation models across data preparation, fine-tuning, post-training, evaluation, and inference.
Familiarity with reasoning models, reinforcement learning, and synthetic data generation and evaluation workflows.
Strong programming skills in Python and hands-on experience with PyTorch, JAX, or TensorFlow.
Familiarity with Nemotron, NeMo, Dynamo, TensorRT-LLM, Triton, vLLM, and similar inference and optimization stacks.
Strong communication and presentation skills, with the ability to advise both technical teams and executives.

Ways to stand out from the crowd:

Experience helping partners or customers deploy large-scale AI systems in production.
Built benchmark suites, fine-tuning recipes, sizing calculators, or TCO models for AI workloads.
Strong knowledge of GPU infrastructure, including NVLink, InfiniBand, MPI, NCCL, or adjacent cluster technologies.
Active OSS contributions in model tooling, inference, evaluation, or performance optimization.
Comfortable moving between deep technical reviews, architecture guidance, benchmarking, and partner enablement.

Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD.

You will also be eligible for equity and benefits ( .

Applications for this job will be accepted at least until April 11, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Solutions Architect, LLM Model Builder in Santa Clara, CA vacancy

Senior Solutions Architect, GPU Performance and LLM - Cloud Service Providers
$184k - $287.5k
...Join our team at NVIDIA and help bring AI solutions to our largest customers. We are seeking an expert Solutions Architect to assist customers in building AI/ML and HPC software... ...aspects related to tasks like large scale LLM training and inference. Conducting regular...
Suggested
NVIDIA
Santa Clara, CA
1 day ago
Product Designer - LLM Infrastructure & Design System
CloudAct Inc. seeks a talented Product Designer to shape their LLM infrastructure. You will own end-to-end surfaces, collaborating closely with engineering to create high-fidelity designs and maintain the Nemo design system. The ideal candidate has at least 4 years of experience...
Suggested
CloudAct Inc.
Sunnyvale, CA
1 day ago
SOLUTION ARCHITECT L1
$100k - $180k
...Job Title: SOLUTION ARCHITECT L1 City: Sunnyvale State/Province: California Posting Start Date: 6/2/26 Wipro... ...requirementsFamiliarity with AI agentic frameworks, MCPs and LLM libraries (e.g., Anthropic SDK). Understanding of LLM...
Suggested
Minimum wage
Local area
Wipro
Sunnyvale, CA
5 days ago
Solutions Architect, Inference Deployments
$152k - $241.5k
...out and enhance AI inference solutions at scale, demonstrating NVIDIA... ...and Kubernetes. As a Solutions Architect focused on inference, you’ll collaborate... ...pipelines using TensorRT-LLM, vLLM, SGLang, and other... ...Inference Server, or TensorRT-LLM for model optimization and serving. ~...
Suggested
NVIDIA
Santa Clara, CA
4 days ago
Senior Solutions Architect, GenAI Agentic Networks - Telco
$184k - $287.5k
...you to help shape that work. As a Senior Solution Architect on our Telco AI team, you will design... ...operations using the latest generative models, NLP, RAG pipelines, and large-scale distributed... ..., and continuously improving agentic LLM applications targeting Telco Network...
Suggested
Remote work
NVIDIA
Santa Clara, CA
1 day ago
Senior Solutions Architect, Generative AI Deployment and AIOps
$184k - $287.5k
**What You Will Be Doing:*** Partnering with other solution architects, engineering, product and business teams. Understanding their strategies... ...* Excellent knowledge of the theory and practice of LLM and DL inference* Excellent presentation, communication and collaboration...
NVIDIA
Santa Clara, CA
6 hours ago
Senior Solutions Architect, Autonomous Vehicles - Data Center
...looking for an expert AV and Robotics Solutions Architect who can help customers accelerate Physical... ...AV perception and planning/policy models, simulations, synthetic data generation... ...such as model training and validations, LLM, VLA, VLM, World Models, video encoding...
NVIDIA Gruppe
Santa Clara, CA
6 hours ago
Senior Solutions Architect, Robotics Infrastructure
$152k - $241.5k
...simulation, synthetic data generation, multi‑step model training, and inference, all on a large scale! We are seeking a hands‑on Solutions Architect with deep expertise in backend... ...inference pipelines using NVIDIA NIM, TensorRT‑LLM, vLLM, SGLang, and other engines to...
NVIDIA Gruppe
Santa Clara, CA
6 hours ago
Account Solution Architect
$140k - $210k
...built on CoreWeave. We hire technical, AI Solution Architects who want to operate the full stack, own... ...’s full platform: infrastructure, Models, Weave, observability, and inference. You... ...deep learning models, including modern LLM architectures Experience designing and...
Permanent employment
Temporary work
Casual work
Work at office
Flexible hours
CoreWeave
Sunnyvale, CA
4 days ago
Solutions Architect, Generative AI
$152k - $241.5k
...seeking an outstanding AI Engineer or Solutions Architect to join our growing team focused on ecosystem... ...work experience in deploying AI models at scale as a Software Engineer or Deep... ...learning, with a particular emphasis on LLM and VLM. ~ Hands-on experience with LLM...
Work experience placement
NVIDIA
Santa Clara, CA
5 days ago
Solutions Architect - OEM AI
$152k - $241.5k
...counterparts to develop accelerated compute and AI solutions for their customers? We are looking for a hardworking and self-starting Solutions Architect with a passion for accelerated compute,... ...projects. Background in Agentic AI, LLM deployment, and building ingestion/...
Remote work
NVIDIA
Santa Clara, CA
3 days ago
Solutions Architect, Agentic AI
$152k - $241.5k
...Join NVIDIA as a Solutions Architect to own the evolution of Agentic AI for the enterprise. You will collaborate with top-tier enterprise software... ...grade deployment. As a trusted advisor, you’ll transform raw LLM capabilities into high-performance, industry-focused...
NVIDIA
Santa Clara, CA
2 days ago
Senior Solutions Architect, Generative AI
$184k - $287.5k
...NVIDIA is seeking outstanding AI Solutions Architects to assist and support customers that are building... ...on Generative AI and Large Language Models (LLMs). You will also collaborate with... ...Inference Server ( , TensorRT ( , TensorRT-LLM ( Excellent C/C++ programming skills,...
NVIDIA
Santa Clara, CA
4 days ago
Senior Solutions Architect, AI Cloud Services
$152k - $241.5k
...customers in the world? NVIDIA is looking for an experienced Solutions Architect to assist customers with adoption of GPU hardware and software... ...with NVIDIA GPUs and SDKs (i.e. CUDA, Triton, TensorRT-LLM, etc.) Deep understanding of the full software development...
Local area
Remote work
NVIDIA
Santa Clara, CA
5 days ago
Senior Solutions Architect - AI Factory Deployment
$184k - $287.5k
...Senior Solutions Architect - AI Factory Deployment We are seeking an ambitious Senior Solutions Architect - AI Factory Deployment to join... ...factories end to end. You will focus on running and debugging AI/LLM workloads and benchmarks on Linux-based GPU clusters, using...
NVIDIA
Santa Clara, CA
5 days ago
Senior Solutions Architect, Autonomous Driving - GenAI
$184k - $287.5k
...are looking for an expert AV and GenAI Solutions Architect to help assist customers with adoption... ...for building AV perception and planning models and pipelines, simulations, synthetic data... ..., DiT, etc. Experience in deploying LLM models at scale on mainstream cloud...
NVIDIA
Santa Clara, CA
4 days ago
Senior Solutions Architect, NVIDIA Cloud Partners
$184k - $287.5k
...world. NVIDIA is seeking an experienced Solutions Architect to be a trusted technical advisor,... ...by integrating libraries, frameworks, models, and software applications. Deliver GenAI... ...Triton Inference Server, TensorRT, TensorRT-LLM, NVIDIA CUDA-X Hands-on expertise...
Remote work
NVIDIA
Santa Clara, CA
18 days ago
Senior Solutions Architect, Generative AI Specialist
...Senior Solutions Architect, Generative AI Specialist Job Type: Full-time, Remote Responsibilities... ...agentic AI systems and complex LLM‑powered workflows. BS/MS/PhD in Computer... ...MLOps principles, including CI/CD for models, automated training pipelines, and production...
Full time
Remote work
Descon Inc
Santa Clara, CA
1 day ago
Senior Solutions Architect, Generative AI Specialist
$184k - $287.5k
...Partner (ATP) team as a Senior Solutions Architect who thrives at the... ...full AI stack, from initial model selection to large-scale deployment... ...ML systems and sophisticated LLM workflows. ~ MS or advanced... ...body of work demonstrating your builder foundation: open-source contributions...
NVIDIA
Santa Clara, CA
2 days ago
Technical Architect - AI/ML & LLM
...Technical Architect - AI/ML, LLM Location: Santa Clara, CA 95054 (Onsite) Full-Time... ...strong hands-on experience in Large Language Models (LLMs) to design, build, and deploy production-grade AI solutions . This role focuses on advanced model development...
Full time
Lorven Technologies
Santa Clara, CA
1 day ago
Principal LLM Memory & Storage Systems Architect
NVIDIA Gruppe is seeking a Principal Systems Engineer to define the vision for memory management in large-scale LLM and storage systems. This role involves designing a unified memory layer and integrating with leading LLM serving engines. The ideal candidate has 15+ years...
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Principal Engineer, Solutions Architect Lead - Industrial & Embedded IoT, Edge AI On‑Prem Appliance
$220.2k - $330.4k
...industries through intelligent edge solutions that combine connectivity,... ...customer data, fine‑tuned models, and inference loads to... ...solutions for Generative AI (LLM, VLM, VLA), Agentic AI, Voice... ...Principal Systems Solutions Architect, you will define, develop, document...
Work experience placement
Work at office
Qualcomm
Santa Clara, CA
4 days ago
Simulation Engineer Intern: LLM‑Driven Driving Scenarios
$19 - $65 per hour
PlusAI, located in Santa Clara, is seeking a Simulation Engineer Intern to join its innovative team. You will leverage large language models to generate driving scenarios and interface directly with simulation environments, ensuring they adhere to traffic laws. The role...
Hourly pay
Internship
PlusAI
Santa Clara, CA
3 days ago
Solutions Architect, Distinguished Engineer
$251k - $377k
...systems. You Are: You are an accomplished and visionary architect with a passion for solving complex technical challenges and enabling... ..., influencing technology roadmaps, and delivering tailored solutions that drive business impact. You have a proventrack recordof...
Remote work
Worldwide
Synopsys
Sunnyvale, CA
4 days ago
Senior Solution Architect
...expansion by identifying and addressing customer business and technical needs, providing consulting and planning services, formulating solutions, overseeing delivery and implementation, and leading pre-sales POC tests. Create customer benchmark projects in industries such...
VeloDB (Powered by Apache Doris)
Sunnyvale, CA
1 day ago
AI Solutions Architect
$170k
...Solutions Architect, PS Location: Sunnyvale, CA Experience: 7-10 Years Fortune 500 clients and government agencies trust eGain AI knowledge solution to improve customer experience and reduce cost of service. Top rated by Gartner, eGain AI Knowledge Hub orchestrates...
Work at office
eGain Corporation
Sunnyvale, CA
3 days ago
Senior Cloud Solutions Architect
$130.7k - $261.3k
...scientists. THE OPPORTUNITY This Senior Cloud Solutions Architect position can work out of our Santa... ...key functional gaps, data models, interface, transformation and technical... ...support, imaging). Agentic AI workflows and LLM-based systems. RAG architectures with vector...
Contract work
Abbott Laboratories company
Santa Clara, CA
6 hours ago
RTOS Solutions Architect - Embedded & Automotive
$65k - $400k
...Decisive Point is seeking a RTOS Solutions Architect in Sunnyvale, CA. This role involves developing real-time operating systems for next-generation vehicle intelligence and software. The ideal candidate will have over 10 years of experience in embedded software development...
Decisive Point
Sunnyvale, CA
6 hours ago
Robotics Simulation Solutions Architect - AI & 3D (Equity)
...NVIDIA Gruppe is seeking a Solutions Architect in Robotics Simulation to lead innovations driven by AI and 3D simulation. You will guide partners in overcoming robotics technical challenges using NVIDIA's groundbreaking technologies. The position demands a Bachelor’s degree...
NVIDIA Gruppe
Santa Clara, CA
5 hours ago
Senior .NET Solutions Architect Platform & Mobile
...headquartered in metropolitan Atlanta, GA with prime emphasis on the following service offerings: Staff Augmentation Lifecycle IT solutions Application Development & Support Outsourced Testing Mobile Development and Test Automation The company was incorporated in the State...
Worldwide
Pyramid Consulting
Sunnyvale, CA
6 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Solutions Architect, LLM Model Builder. Be the first to apply!