Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Solution Architect (AI/LLM Inference)

Baseten

ABOUT BASETEN

Baseten powers mission‑critical inference for the world’s most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting‑edge models into production. We’re growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products.

THE ROLE

As a Solution Architect (AI/LLM Inference) at Baseten you will partner closely with Sales and customers to translate business needs into technical solutions, run technical discovery, and guide repeatable deployments and proofs of value for customers. This role is a great fit for entrepreneurial, customer‑facing technical professionals who want a front‑row view into how modern companies adopt AI at scale, and who enjoy working across technical discovery, solution design, demos, deployment scoping, and hands‑on customer implementations, in close partnership with Sales and Engineering.

RESPONSIBILITIES

Partner with Sales on customer discovery calls (most often second calls, occasionally first calls for large accounts). Lead demos and technical scoping to align on success criteria, architecture, and deployment approach. Own benchmarking and repeatable deployments , including: Handling standard deployment patterns and configurations across many modalities – LLMs, embeddings, image and video generation, VoiceAI, etc. Advising on tradeoffs like H100s vs B200s and latency‑optimized vs throughput‑optimized setups. Driving consistent “playbook” style deployments for common models and use cases. Become a power user of different runtimes such as vllm, sglang, and TRT‑LMM and all the common configurations and tradeoffs between them. Drive POC and project execution , including: Scoping POCs and keeping stakeholders aligned on timeline, deliverables, and next steps. Acting as the “ringleader” or project manager for POCs. Pulling in Forward Deployed Engineering (FDE) support when deeper or more complex technical work is needed.

REQUIREMENTS

AI/ML background and the ability to credibly discuss AI/ML topics with technical stakeholders. Strong customer‑facing communication skills, including the ability to run structured discovery and clarify ambiguous requirements. Technical depth to scope solutions, without needing to write production code. Ability to script and prototype as needed, including comfort “vibe coding” to move quickly in technical workflows.

NICE TO HAVE

Experience running or supporting benchmarks for ML inference deployments. Familiarity with infrastructure tradeoffs relevant to inference performance and cost (for example GPU selection and latency versus throughput tuning). Experience serving as a cross‑functional technical lead for customer POCs, including coordination across Sales and Engineering.

BENEFITS

Competitive compensation, including meaningful equity. 100% coverage of medical, dental, and vision insurance for employee and dependents. Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year’s Day!). Paid parental leave. Fertility and family‑building stipend through Carrot. Company‑facilitated 401(k). Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities. Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward‑thinking team, we would love to hear from you. At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status. We are an Equal Opportunity Employer and will consider qualified applicants with criminal histories in a manner consistent with applicable law (by example, the requirements of the San Francisco Fair Chance Ordinance, where applicable). #J-18808-Ljbffr Baseten

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Solution Architect (AI/LLM Inference) in San Francisco, CA vacancy
  • Baseten is seeking a Solution Architect (AI/LLM Inference) to work closely with Sales and customers in San Francisco. This role involves translating business needs into technical solutions, conducting demos, and managing POCs. Ideal candidates will possess a strong AI/... 
    Suggested

    Baseten

    San Francisco, CA
    3 days ago
  • $151.5k - $222k

    The Product Solutions Architecture (PSA) team acts as a technical multiplier...  ...customer success. Datadog’s LLM Observability product enables...  .... As a Product Solutions Architect, you will partner closely with...  ...of applications. Privacy and AI Guidelines: Any information you... 
    Suggested
    Work at office

    Datadog

    San Francisco, CA
    more than 2 months ago
  •  ...job for you. Role: Solution Architect Location: San Francisco,...  ...implementation of scalable, secure, and AI-integrated solutions on the...  ...frameworks and tools such as Infer.NET and ML.NET....  ...and experience in customizing LLM models. Frontend Development... 
    Suggested
    Permanent employment
    Contract work
    Remote work

    Tekfortune Inc

    San Francisco, CA
    1 day ago
  •  ...FuriosaAI is looking for a Solutions Architect to bring the full potential of our powerful RNGD...  ...as the primary technical authority in AI/LLM model deployments. From running POCs to...  ...landscape — tracking model releases, inference frameworks, and serving stack evolution... 
    Suggested

    FuriosaAI, Inc.

    San Francisco, CA
    2 days ago
  • A dynamic tech startup is seeking a Solutions Architect to help customers implement and scale AI solutions. In this role, you will collaborate closely with both pre and post-sales teams to troubleshoot issues and enhance customer satisfaction. Candidates should have at... 
    Suggested

    LiteLLM

    San Francisco, CA
    3 days ago
  • $143k - $210k

     ...CoreWeave is The Essential Cloud for AI™. Built for pioneers by...  .... We hire technical, AI Solution Architects who want to operate the full...  ...Models, Weave, observability, and inference. You’ll help these customers...  ...models, including modern LLM architectures Experience designing... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    Somi AI

    San Francisco, CA
    2 days ago
  •  ...customers are now running real AI workloads on top of us — LLM gateways, agent-to-agent...  ...they need someone who can architect that layer with them, not...  ...patterns. Securing inference traffic across multi-cloud...  ...customer-facing technical role — Solutions Architect, Customer... 
    Remote work
    Work from home
    Flexible hours

    Strategic Employment Partners

    San Francisco, CA
    1 day ago
  •  ...LiteLLM is the world’s most popular AI Gateway used by the largest companies (Adobe...  ...etc.). About the Role We're looking for a Solutions Architect to collaborate with our key customers,...  ...businesses—our customers run ALL their LLM calls through LiteLLM. You enjoy the autonomy... 

    LiteLLM

    San Francisco, CA
    2 days ago
  • $170k - $190k

     ...move from prototypes to production‑ready AI agents that teams can rely on. We began...  ...500. About the Role We're looking for a Solutions Architect to join our Professional Services team....  ...applications or agents Strong experience with LLM frameworks (LangChain, LangGraph, or... 
    Temporary work
    Work at office
    Flexible hours

    Startups Inc

    San Francisco, CA
    1 day ago
  •  ...decades. We're changing that, using AI to disrupt a massive market....  .... From MCP integrations with LLM providers to enterprise...  .... This role combines hands‑on solution building, enterprise customer...  ...Work directly with B2B customers Architect integration workflows with enterprise... 
    Work at office
    Work from home

    Gamma

    San Francisco, CA
    1 day ago
  •  ...About Liquid AI Spun out of MIT CSAIL, we build general-purpose AI systems that run efficiently...  .... The Opportunity Liquid AI is building a solutions architecture function from scratch. You...  ...model serving frameworks (vLLM, TensorRT-LLM, llama.cpp), and hardware-aware... 

    Liquid AI

    San Francisco, CA
    1 day ago
  •  ...Job Title: GEN AI Solution Architect Location: San Francisco, CA FTE Job Description • Product Roadmap & Modular...  ...Gen AI modules (e.g., RAG, prompting frameworks, hybrid ML/LLM systems). - Architect parameterized, business-agnostic solutions... 

    AceStack LLC

    San Francisco, CA
    3 days ago
  • $175k - $284k

     ...built. As the team behind Next.js, v0, and AI SDK, we create products that help...  ...comes next. About the Role: As a Solution Architect in the Field Engineering team, you will...  ..., and security fundamentals, ideally AI/LLM powered applications and Agentic workflows... 
    Work from home
    Worldwide
    Flexible hours

    Vercel Corp

    San Francisco, CA
    22 hours ago
  • $250k

     ...Modal Solutions Architect Role Modal is hiring a high-impact Solutions Architect to drive technical...  ...driving multi-product adoption across AI/ML workloads. This role is not demo...  ...AI infrastructure (training pipelines, inference systems, GPU workloads, MLOps) ~... 

    Modal

    San Francisco, CA
    4 days ago
  • $115k - $143k

    We’re seeking a Solutions Architect that is excited to help guide customers in adopting Pulumi’s infrastructure-as-code platform...  ..., and more. We have pioneered leveraging AI across all of these areas with our LLM-powered Pulumi Copilot to push the boundaries of what... 
    Full time
    Local area
    Remote work
    Flexible hours

    Pulumi Corporation

    San Francisco, CA
    1 day ago
  • $135k - $180k

    Solution Architect Sigma Computing The SA role has evolved. Here’s the version we’re hiring for....  ...architectural depth to back it up. 1. Use AI every day to do the job better. If you...  ...modes, AI functions, input tables with LLM enrichment, MCP integration, and warehouse... 
    Full time
    Work at office
    Flexible hours
    Weekend work

    Sigma Computing

    San Francisco, CA
    1 day ago
  • $180k - $260k

     ...Solutions Architect San Francisco About the Role As a Solutions Architect at Together AI, you will work with customers and prospects to create business value through Generative...  ...of training, fine-tuning and inference in the context of open source LLMs ~ Proficiency... 
    Full time
    Remote work

    Together AI

    San Francisco, CA
    2 days ago
  • $140k - $240k

     ...work. From MCP integrations with LLM providers to enterprise...  ...This role combines hands-on solution building, enterprise customer...  ...work directly with enterprise AI transformation leaders deploying...  ...deliver a complete experience Architect integration workflows with enterprise... 
    Full time
    Work at office
    Work from home

    Gamma

    San Francisco, CA
    3 days ago
  •  ...Solution Architect Pleasanton, CA 94588 or San Francisco, CA 94105 – (Onsite) 8:00 AM - 5:00...  ...composable P2M & Supply chain architecture, AI/ML-driven capabilities, and the ability...  ...Familiarity with Agentic AI or LLM applications in P2M automation or Employees... 
    Contract work

    Tailored Management

    San Francisco, CA
    4 days ago
  •  ...databases to data warehouses, lakes, and AI applications. With tens of thousands of...  ...native landscape. The Role: As an Airbyte Solutions Architect, you will have a unique opportunity to...  ...customers. Advise customers on integrating LLM‑based tooling into their existing data... 
    Local area
    Immediate start
    3 days per week

    Airbyte

    San Francisco, CA
    4 days ago
  •  ...Careers Product & Engineering Senior Solution Architect - Hands On San Francisco, CA - Hybrid - Full...  ...delivering scalable SaaS, software, and AI platforms. The ideal candidate is a...  ...Five9, etc.). Hands-on experience with LLM evaluation frameworks, prompt engineering... 
    Full time

    Zingly, Inc.

    San Francisco, CA
    1 day ago
  • $170k - $190k

     ...Solutions Architect We're looking for a Solutions Architect to join our Professional Services team...  ..., deploy, and optimize production-grade AI infrastructure and agent systems. You'll...  ...or agents ~ Strong experience with LLM frameworks (LangChain, LangGraph, or similar... 
    Temporary work

    Langchain

    San Francisco, CA
    3 days ago
  • $125.9k - $231.1k

     ...EY and help to build a better working world. Microsoft 365 AI Solution Architect (Manager) EY advises clients to understand, architect,...  ...Frameworks like: ISO/IEC 42001, NIST AI RMF, EU AI Act, OWASP LLM Top 10. Ideally, you’ll also have ~ Microsoft certifications... 
    Summer holiday
    Flexible hours

    EY

    San Francisco, CA
    3 days ago
  •  ...backed UK startup pioneering a breakthrough AI accelerator for data centers which uses...  ...deep and commercially astute Senior Solutions Architect to own the technical heart of our...  ...bring the world's first optical compute inference platform to market. You will be the person... 

    Lumai

    San Francisco, CA
    22 hours ago
  • $150k - $220k

     ...the world's best engineering teams use AI. Their open-source AI Gateway is already...  ...and NASA — letting developers route 100+ LLM APIs through a single interface with cost...  ...You'll be the first dedicated Solutions Architect/Sales Engineer on a rocket ship — owning... 
    Remote job

    Lavendo

    San Francisco, CA
    3 days ago
  •  ...Columbus, Cleveland, Akron, Cincinnati, Miami Microsoft 365 AI Solution Architect (Manager) EY advises clients to understand, architect, select...  ...frameworks such as ISO/IEC 42001, NIST AI RMF, EU AI Act, and OWASP LLM Top 10. Desired Additional Qualifications Microsoft... 

    Ernst & Young Oman

    San Francisco, CA
    22 hours ago
  • $151.5k - $222k

    A SaaS company in San Francisco is seeking a Product Solutions Architect to lead implementation of their LLM Observability product. You'll collaborate with customers to design architectures and drive product adoption through technical guidance. The ideal candidate has a... 

    Datadog

    San Francisco, CA
    1 day ago
  • $172.5k - $260.1k

    About the Role The Monetization team is building AI agents into day‑to‑day workflows and extending that capability across the broader...  ...engineering support Experience & Qualifications Built or deployed LLM‑based agentic workflows (not just chatbots) Hands‑on experience... 

    100 Salesforce, Inc.

    San Francisco, CA
    3 days ago
  • $197.3k - $313.7k

     ...DetailsAbout SalesforceSalesforce is the #1 AI CRM, where humans with agents drive...  ...one place.We are looking for a Specialist Solutions Architect to be the hands-on, customer-facing...  ...solutionsFamiliarity with agentic AI orchestration, LLM-powered workflows, or AI-assisted agent... 
    Work at office
    Shift work

    Salesforce

    San Francisco, CA
    1 day ago
  • $172.5k - $260.1k

    About Salesforce Salesforce is the #1 AI CRM, where humans with agents drive customer success together. Here, ambition meets action....  ...requiring outside engineering support Experience: Built or deployed LLM‑based agentic workflows (not just chatbots) Hands‑on experience... 

    Centaur Labs

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Solution Architect (AI/LLM Inference). Be the first to apply!