Solutions Architect, LLM Model Builder
$152k - $241.5kNVIDIA
NVIDIA is seeking an outstanding Solutions Architect, Foundation Models to join our growing team focused on partner enablement for reasoning models, multimodal models, and production inference! In this role, you will act as both a strategic technical expert and a hands-on advisor, helping partners build, benchmark, fine-tune, optimize, and deploy foundation model solutions for customer workloads.
The Partner Solutions Architecture team acts as a trusted advisor to the ecosystem. We enable partners to translate customer requirements into architectures, benchmark recipes, cluster test plans, compute sizing, and production readiness—accelerating time to value through the full-stack accelerated computing platform.
What you'll be doing:
Serve as the lead technical advisor for partners delivering reasoning, multimodal, fine-tuning, and model-serving solutions.
Guide partners to the right approach for customer workloads across fine-tuning, distillation, quantization, compression, benchmarking, and evaluation.
Define benchmark plans, synthetic data and evaluation workflows, and repeatable validation recipes.
Advise on compute planning, including cluster sizing, GPU and network selection, storage, memory tradeoffs, latency and throughput targets, and production-readiness testing.
Guide inference architecture across prefill and decode tradeoffs, batching, routing, disaggregated inference, and serving efficiency.
Develop reference architectures, playbooks, benchmark recipes, TCO calculators, and sizing models across CUDA, NeMo, Nemotron, Dynamo, TensorRT-LLM, Triton, NIMs, and related tooling.
Support pre- and post-sales engagements by translating complex model and infrastructure topics for partner and customer teams.
What we need to see:
MSc, PhD in Computer Science, Electrical Engineering, Software Engineer, ML Engineer, or related fields (or equivalent experience).
5+ years of relevant experience working with LLMs, VLMs, and large-scale inference systems, with hands-on expertise in fine-tuning, benchmarking, evaluation, optimization, and production deployment as a Research Engineer, Deep Learning Engineer, or equivalent.
Strong understanding of foundation models across data preparation, fine-tuning, post-training, evaluation, and inference.
Familiarity with reasoning models, reinforcement learning, and synthetic data generation and evaluation workflows.
Strong programming skills in Python and hands-on experience with PyTorch, JAX, or TensorFlow.
Familiarity with Nemotron, NeMo, Dynamo, TensorRT-LLM, Triton, vLLM, and similar inference and optimization stacks.
Strong communication and presentation skills, with the ability to advise both technical teams and executives.
Ways to stand out from the crowd:
Experience helping partners or customers deploy large-scale AI systems in production.
Built benchmark suites, fine-tuning recipes, sizing calculators, or TCO models for AI workloads.
Strong knowledge of GPU infrastructure, including NVLink, InfiniBand, MPI, NCCL, or adjacent cluster technologies.
Active OSS contributions in model tooling, inference, evaluation, or performance optimization.
Comfortable moving between deep technical reviews, architecture guidance, benchmarking, and partner enablement.
Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD.
You will also be eligible for equity and benefits ( .
Applications for this job will be accepted at least until April 11, 2026.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
$184k - $287.5k
...Join our team at NVIDIA and help bring AI solutions to our largest customers. We are seeking an expert Solutions Architect to assist customers in building AI/ML and HPC software... ...aspects related to tasks like large scale LLM training and inference. Conducting regular...Suggested- CloudAct Inc. seeks a talented Product Designer to shape their LLM infrastructure. You will own end-to-end surfaces, collaborating closely with engineering to create high-fidelity designs and maintain the Nemo design system. The ideal candidate has at least 4 years of experience...Suggested
$100k - $180k
...Job Title: SOLUTION ARCHITECT L1 City: Sunnyvale State/Province: California Posting Start Date: 6/2/26 Wipro... ...requirementsFamiliarity with AI agentic frameworks, MCPs and LLM libraries (e.g., Anthropic SDK). Understanding of LLM...SuggestedMinimum wageLocal area$152k - $241.5k
...out and enhance AI inference solutions at scale, demonstrating NVIDIA... ...and Kubernetes. As a Solutions Architect focused on inference, you’ll collaborate... ...pipelines using TensorRT-LLM, vLLM, SGLang, and other... ...Inference Server, or TensorRT-LLM for model optimization and serving. ~...Suggested$184k - $287.5k
...you to help shape that work. As a Senior Solution Architect on our Telco AI team, you will design... ...operations using the latest generative models, NLP, RAG pipelines, and large-scale distributed... ..., and continuously improving agentic LLM applications targeting Telco Network...SuggestedRemote work$184k - $287.5k
**What You Will Be Doing:*** Partnering with other solution architects, engineering, product and business teams. Understanding their strategies... ...* Excellent knowledge of the theory and practice of LLM and DL inference* Excellent presentation, communication and collaboration...- ...looking for an expert AV and Robotics Solutions Architect who can help customers accelerate Physical... ...AV perception and planning/policy models, simulations, synthetic data generation... ...such as model training and validations, LLM, VLA, VLM, World Models, video encoding...
$152k - $241.5k
...simulation, synthetic data generation, multi‑step model training, and inference, all on a large scale! We are seeking a hands‑on Solutions Architect with deep expertise in backend... ...inference pipelines using NVIDIA NIM, TensorRT‑LLM, vLLM, SGLang, and other engines to...$140k - $210k
...built on CoreWeave. We hire technical, AI Solution Architects who want to operate the full stack, own... ...’s full platform: infrastructure, Models, Weave, observability, and inference. You... ...deep learning models, including modern LLM architectures Experience designing and...Permanent employmentTemporary workCasual workWork at officeFlexible hours$152k - $241.5k
...seeking an outstanding AI Engineer or Solutions Architect to join our growing team focused on ecosystem... ...work experience in deploying AI models at scale as a Software Engineer or Deep... ...learning, with a particular emphasis on LLM and VLM. ~ Hands-on experience with LLM...Work experience placement$152k - $241.5k
...counterparts to develop accelerated compute and AI solutions for their customers? We are looking for a hardworking and self-starting Solutions Architect with a passion for accelerated compute,... ...projects. Background in Agentic AI, LLM deployment, and building ingestion/...Remote work$152k - $241.5k
...Join NVIDIA as a Solutions Architect to own the evolution of Agentic AI for the enterprise. You will collaborate with top-tier enterprise software... ...grade deployment. As a trusted advisor, you’ll transform raw LLM capabilities into high-performance, industry-focused...$184k - $287.5k
...NVIDIA is seeking outstanding AI Solutions Architects to assist and support customers that are building... ...on Generative AI and Large Language Models (LLMs). You will also collaborate with... ...Inference Server ( , TensorRT ( , TensorRT-LLM ( Excellent C/C++ programming skills,...$152k - $241.5k
...customers in the world? NVIDIA is looking for an experienced Solutions Architect to assist customers with adoption of GPU hardware and software... ...with NVIDIA GPUs and SDKs (i.e. CUDA, Triton, TensorRT-LLM, etc.) Deep understanding of the full software development...Local areaRemote work$184k - $287.5k
...Senior Solutions Architect - AI Factory Deployment We are seeking an ambitious Senior Solutions Architect - AI Factory Deployment to join... ...factories end to end. You will focus on running and debugging AI/LLM workloads and benchmarks on Linux-based GPU clusters, using...$184k - $287.5k
...are looking for an expert AV and GenAI Solutions Architect to help assist customers with adoption... ...for building AV perception and planning models and pipelines, simulations, synthetic data... ..., DiT, etc. Experience in deploying LLM models at scale on mainstream cloud...$184k - $287.5k
...world. NVIDIA is seeking an experienced Solutions Architect to be a trusted technical advisor,... ...by integrating libraries, frameworks, models, and software applications. Deliver GenAI... ...Triton Inference Server, TensorRT, TensorRT-LLM, NVIDIA CUDA-X Hands-on expertise...Remote work- ...Senior Solutions Architect, Generative AI Specialist Job Type: Full-time, Remote Responsibilities... ...agentic AI systems and complex LLM‑powered workflows. BS/MS/PhD in Computer... ...MLOps principles, including CI/CD for models, automated training pipelines, and production...Full timeRemote work
$184k - $287.5k
...Partner (ATP) team as a Senior Solutions Architect who thrives at the... ...full AI stack, from initial model selection to large-scale deployment... ...ML systems and sophisticated LLM workflows. ~ MS or advanced... ...body of work demonstrating your builder foundation: open-source contributions...- ...Technical Architect - AI/ML, LLM Location: Santa Clara, CA 95054 (Onsite) Full-Time... ...strong hands-on experience in Large Language Models (LLMs) to design, build, and deploy production-grade AI solutions . This role focuses on advanced model development...Full time
- NVIDIA Gruppe is seeking a Principal Systems Engineer to define the vision for memory management in large-scale LLM and storage systems. This role involves designing a unified memory layer and integrating with leading LLM serving engines. The ideal candidate has 15+ years...
$220.2k - $330.4k
...industries through intelligent edge solutions that combine connectivity,... ...customer data, fine‑tuned models, and inference loads to... ...solutions for Generative AI (LLM, VLM, VLA), Agentic AI, Voice... ...Principal Systems Solutions Architect, you will define, develop, document...Work experience placementWork at office$19 - $65 per hour
PlusAI, located in Santa Clara, is seeking a Simulation Engineer Intern to join its innovative team. You will leverage large language models to generate driving scenarios and interface directly with simulation environments, ensuring they adhere to traffic laws. The role...Hourly payInternship$251k - $377k
...systems. You Are: You are an accomplished and visionary architect with a passion for solving complex technical challenges and enabling... ..., influencing technology roadmaps, and delivering tailored solutions that drive business impact. You have a proventrack recordof...Remote workWorldwide- ...expansion by identifying and addressing customer business and technical needs, providing consulting and planning services, formulating solutions, overseeing delivery and implementation, and leading pre-sales POC tests. Create customer benchmark projects in industries such...
$170k
...Solutions Architect, PS Location: Sunnyvale, CA Experience: 7-10 Years Fortune 500 clients and government agencies trust eGain AI knowledge solution to improve customer experience and reduce cost of service. Top rated by Gartner, eGain AI Knowledge Hub orchestrates...Work at office$130.7k - $261.3k
...scientists. THE OPPORTUNITY This Senior Cloud Solutions Architect position can work out of our Santa... ...key functional gaps, data models, interface, transformation and technical... ...support, imaging). Agentic AI workflows and LLM-based systems. RAG architectures with vector...Contract work$65k - $400k
...Decisive Point is seeking a RTOS Solutions Architect in Sunnyvale, CA. This role involves developing real-time operating systems for next-generation vehicle intelligence and software. The ideal candidate will have over 10 years of experience in embedded software development...- ...NVIDIA Gruppe is seeking a Solutions Architect in Robotics Simulation to lead innovations driven by AI and 3D simulation. You will guide partners in overcoming robotics technical challenges using NVIDIA's groundbreaking technologies. The position demands a Bachelor’s degree...
- ...headquartered in metropolitan Atlanta, GA with prime emphasis on the following service offerings: Staff Augmentation Lifecycle IT solutions Application Development & Support Outsourced Testing Mobile Development and Test Automation The company was incorporated in the State...Worldwide
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Solutions Architect, LLM Model Builder. Be the first to apply!
- senior cloud solutions architect Santa Clara, CA
- anaplan senior solutions architect Santa Clara, CA
- contact center solution architect Santa Clara, CA
- entry level aws solution architect Santa Clara, CA
- senior solution manager Santa Clara, CA
- business solutions architect Santa Clara, CA
- sap solution architect Santa Clara, CA
- senior solutions architect Santa Clara, CA
- solutions architect Santa Clara, CA
- aws solution architect Santa Clara, CA


