Solution Architect (AI/LLM Inference)
Baseten
ABOUT BASETEN
Baseten powers mission‑critical inference for the world’s most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting‑edge models into production. We’re growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products.THE ROLE
As a Solution Architect (AI/LLM Inference) at Baseten you will partner closely with Sales and customers to translate business needs into technical solutions, run technical discovery, and guide repeatable deployments and proofs of value for customers. This role is a great fit for entrepreneurial, customer‑facing technical professionals who want a front‑row view into how modern companies adopt AI at scale, and who enjoy working across technical discovery, solution design, demos, deployment scoping, and hands‑on customer implementations, in close partnership with Sales and Engineering.RESPONSIBILITIES
Partner with Sales on customer discovery calls (most often second calls, occasionally first calls for large accounts). Lead demos and technical scoping to align on success criteria, architecture, and deployment approach. Own benchmarking and repeatable deployments , including: Handling standard deployment patterns and configurations across many modalities – LLMs, embeddings, image and video generation, VoiceAI, etc. Advising on tradeoffs like H100s vs B200s and latency‑optimized vs throughput‑optimized setups. Driving consistent “playbook” style deployments for common models and use cases. Become a power user of different runtimes such as vllm, sglang, and TRT‑LMM and all the common configurations and tradeoffs between them. Drive POC and project execution , including: Scoping POCs and keeping stakeholders aligned on timeline, deliverables, and next steps. Acting as the “ringleader” or project manager for POCs. Pulling in Forward Deployed Engineering (FDE) support when deeper or more complex technical work is needed.REQUIREMENTS
AI/ML background and the ability to credibly discuss AI/ML topics with technical stakeholders. Strong customer‑facing communication skills, including the ability to run structured discovery and clarify ambiguous requirements. Technical depth to scope solutions, without needing to write production code. Ability to script and prototype as needed, including comfort “vibe coding” to move quickly in technical workflows.NICE TO HAVE
Experience running or supporting benchmarks for ML inference deployments. Familiarity with infrastructure tradeoffs relevant to inference performance and cost (for example GPU selection and latency versus throughput tuning). Experience serving as a cross‑functional technical lead for customer POCs, including coordination across Sales and Engineering.BENEFITS
Competitive compensation, including meaningful equity. 100% coverage of medical, dental, and vision insurance for employee and dependents. Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year’s Day!). Paid parental leave. Fertility and family‑building stipend through Carrot. Company‑facilitated 401(k). Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities. Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward‑thinking team, we would love to hear from you. At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status. We are an Equal Opportunity Employer and will consider qualified applicants with criminal histories in a manner consistent with applicable law (by example, the requirements of the San Francisco Fair Chance Ordinance, where applicable). #J-18808-Ljbffr BasetenVacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Solution Architect (AI/LLM Inference) in San Francisco, CA vacancy
- Baseten is seeking a Solution Architect (AI/LLM Inference) to work closely with Sales and customers in San Francisco. This role involves translating business needs into technical solutions, conducting demos, and managing POCs. Ideal candidates will possess a strong AI/...Suggested
$151.5k - $222k
The Product Solutions Architecture (PSA) team acts as a technical multiplier... ...customer success. Datadog’s LLM Observability product enables... .... As a Product Solutions Architect, you will partner closely with... ...of applications. Privacy and AI Guidelines: Any information you...SuggestedWork at office- ...job for you. Role: Solution Architect Location: San Francisco,... ...implementation of scalable, secure, and AI-integrated solutions on the... ...frameworks and tools such as Infer.NET and ML.NET.... ...and experience in customizing LLM models. Frontend Development...SuggestedPermanent employmentContract workRemote work
- ...FuriosaAI is looking for a Solutions Architect to bring the full potential of our powerful RNGD... ...as the primary technical authority in AI/LLM model deployments. From running POCs to... ...landscape — tracking model releases, inference frameworks, and serving stack evolution...Suggested
- A dynamic tech startup is seeking a Solutions Architect to help customers implement and scale AI solutions. In this role, you will collaborate closely with both pre and post-sales teams to troubleshoot issues and enhance customer satisfaction. Candidates should have at...Suggested
$143k - $210k
...CoreWeave is The Essential Cloud for AI™. Built for pioneers by... .... We hire technical, AI Solution Architects who want to operate the full... ...Models, Weave, observability, and inference. You’ll help these customers... ...models, including modern LLM architectures Experience designing...Permanent employmentTemporary workCasual workWork at officeFlexible hours- ...customers are now running real AI workloads on top of us — LLM gateways, agent-to-agent... ...they need someone who can architect that layer with them, not... ...patterns. Securing inference traffic across multi-cloud... ...customer-facing technical role — Solutions Architect, Customer...Remote workWork from homeFlexible hours
- ...LiteLLM is the world’s most popular AI Gateway used by the largest companies (Adobe... ...etc.). About the Role We're looking for a Solutions Architect to collaborate with our key customers,... ...businesses—our customers run ALL their LLM calls through LiteLLM. You enjoy the autonomy...
$170k - $190k
...move from prototypes to production‑ready AI agents that teams can rely on. We began... ...500. About the Role We're looking for a Solutions Architect to join our Professional Services team.... ...applications or agents Strong experience with LLM frameworks (LangChain, LangGraph, or...Temporary workWork at officeFlexible hours- ...decades. We're changing that, using AI to disrupt a massive market.... .... From MCP integrations with LLM providers to enterprise... .... This role combines hands‑on solution building, enterprise customer... ...Work directly with B2B customers Architect integration workflows with enterprise...Work at officeWork from home
- ...About Liquid AI Spun out of MIT CSAIL, we build general-purpose AI systems that run efficiently... .... The Opportunity Liquid AI is building a solutions architecture function from scratch. You... ...model serving frameworks (vLLM, TensorRT-LLM, llama.cpp), and hardware-aware...
- ...Job Title: GEN AI Solution Architect Location: San Francisco, CA FTE Job Description • Product Roadmap & Modular... ...Gen AI modules (e.g., RAG, prompting frameworks, hybrid ML/LLM systems). - Architect parameterized, business-agnostic solutions...
$175k - $284k
...built. As the team behind Next.js, v0, and AI SDK, we create products that help... ...comes next. About the Role: As a Solution Architect in the Field Engineering team, you will... ..., and security fundamentals, ideally AI/LLM powered applications and Agentic workflows...Work from homeWorldwideFlexible hours$250k
...Modal Solutions Architect Role Modal is hiring a high-impact Solutions Architect to drive technical... ...driving multi-product adoption across AI/ML workloads. This role is not demo... ...AI infrastructure (training pipelines, inference systems, GPU workloads, MLOps) ~...$115k - $143k
We’re seeking a Solutions Architect that is excited to help guide customers in adopting Pulumi’s infrastructure-as-code platform... ..., and more. We have pioneered leveraging AI across all of these areas with our LLM-powered Pulumi Copilot to push the boundaries of what...Full timeLocal areaRemote workFlexible hours$135k - $180k
Solution Architect Sigma Computing The SA role has evolved. Here’s the version we’re hiring for.... ...architectural depth to back it up. 1. Use AI every day to do the job better. If you... ...modes, AI functions, input tables with LLM enrichment, MCP integration, and warehouse...Full timeWork at officeFlexible hoursWeekend work$180k - $260k
...Solutions Architect San Francisco About the Role As a Solutions Architect at Together AI, you will work with customers and prospects to create business value through Generative... ...of training, fine-tuning and inference in the context of open source LLMs ~ Proficiency...Full timeRemote work$140k - $240k
...work. From MCP integrations with LLM providers to enterprise... ...This role combines hands-on solution building, enterprise customer... ...work directly with enterprise AI transformation leaders deploying... ...deliver a complete experience Architect integration workflows with enterprise...Full timeWork at officeWork from home- ...Solution Architect Pleasanton, CA 94588 or San Francisco, CA 94105 – (Onsite) 8:00 AM - 5:00... ...composable P2M & Supply chain architecture, AI/ML-driven capabilities, and the ability... ...Familiarity with Agentic AI or LLM applications in P2M automation or Employees...Contract work
- ...databases to data warehouses, lakes, and AI applications. With tens of thousands of... ...native landscape. The Role: As an Airbyte Solutions Architect, you will have a unique opportunity to... ...customers. Advise customers on integrating LLM‑based tooling into their existing data...Local areaImmediate start3 days per week
- ...Careers Product & Engineering Senior Solution Architect - Hands On San Francisco, CA - Hybrid - Full... ...delivering scalable SaaS, software, and AI platforms. The ideal candidate is a... ...Five9, etc.). Hands-on experience with LLM evaluation frameworks, prompt engineering...Full time
$170k - $190k
...Solutions Architect We're looking for a Solutions Architect to join our Professional Services team... ..., deploy, and optimize production-grade AI infrastructure and agent systems. You'll... ...or agents ~ Strong experience with LLM frameworks (LangChain, LangGraph, or similar...Temporary work$125.9k - $231.1k
...EY and help to build a better working world. Microsoft 365 AI Solution Architect (Manager) EY advises clients to understand, architect,... ...Frameworks like: ISO/IEC 42001, NIST AI RMF, EU AI Act, OWASP LLM Top 10. Ideally, you’ll also have ~ Microsoft certifications...Summer holidayFlexible hours- ...backed UK startup pioneering a breakthrough AI accelerator for data centers which uses... ...deep and commercially astute Senior Solutions Architect to own the technical heart of our... ...bring the world's first optical compute inference platform to market. You will be the person...
$150k - $220k
...the world's best engineering teams use AI. Their open-source AI Gateway is already... ...and NASA — letting developers route 100+ LLM APIs through a single interface with cost... ...You'll be the first dedicated Solutions Architect/Sales Engineer on a rocket ship — owning...Remote job- ...Columbus, Cleveland, Akron, Cincinnati, Miami Microsoft 365 AI Solution Architect (Manager) EY advises clients to understand, architect, select... ...frameworks such as ISO/IEC 42001, NIST AI RMF, EU AI Act, and OWASP LLM Top 10. Desired Additional Qualifications Microsoft...
$151.5k - $222k
A SaaS company in San Francisco is seeking a Product Solutions Architect to lead implementation of their LLM Observability product. You'll collaborate with customers to design architectures and drive product adoption through technical guidance. The ideal candidate has a...$172.5k - $260.1k
About the Role The Monetization team is building AI agents into day‑to‑day workflows and extending that capability across the broader... ...engineering support Experience & Qualifications Built or deployed LLM‑based agentic workflows (not just chatbots) Hands‑on experience...$197.3k - $313.7k
...DetailsAbout SalesforceSalesforce is the #1 AI CRM, where humans with agents drive... ...one place.We are looking for a Specialist Solutions Architect to be the hands-on, customer-facing... ...solutionsFamiliarity with agentic AI orchestration, LLM-powered workflows, or AI-assisted agent...Work at officeShift work$172.5k - $260.1k
About Salesforce Salesforce is the #1 AI CRM, where humans with agents drive customer success together. Here, ambition meets action.... ...requiring outside engineering support Experience: Built or deployed LLM‑based agentic workflows (not just chatbots) Hands‑on experience...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Solution Architect (AI/LLM Inference). Be the first to apply!
Related searches
- business solutions architect San Francisco, CA
- mobile solution architect San Francisco, CA
- solutions architect San Francisco, CA
- solution architect contract San Francisco, CA
- contact center solution architect San Francisco, CA
- solution engineering manager San Francisco, CA
- enterprise solution architect San Francisco, CA
- anaplan senior solutions architect San Francisco, CA
- aws solution architect San Francisco, CA
- entry level solutions architect San Francisco, CA

