Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Applied AI Inference - Forward Deployed Engineer

Baseten

ABOUT BASETEN

Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products.

THE ROLE

As a Forward Deployed Engineer at Baseten, you will partner directly with customers to architect, build, and deploy high-scale production AI applications on Baseten's platform. You'll own the journey with customers from initial exploration to production deployment, translating ambiguous business goals into reliable, observable services with clear quality, latency, and cost outcomes.

This role is a great fit for entrepreneurial engineers who want a front-row view into how modern companies adopt AI at scale and who enjoy working across product, software development, performance engineering, and customer-facing implementations.

To be clear, this is an engineering role with hands-on coding and software development that also includes aspects of product management, technical customer success, and pre-sales solution engineering mixed in.

EXAMPLE INITIATIVES

Take a look at these blog posts written by members of our Forward Deployed Engineering team:

  • Forward Deployed Engineering on the frontier of AI
  • The fastest, most accurate Whisper transcription
  • Deploy production-ready model servers from Docker images
  • Deploy custom ComfyUI workflows as APIs
RESPONSIBILITIES
  • Develop and maintain software systems and product features using one or more general-purpose programming languages in a production-level environment, with a preference for Python due to its relevance in ML projects.
  • Drive customer impact by designing, implementing, and deploying Baseten solutions end-to-end (problem framing → evaluation → production deployment → monitoring). This involves working with customers' engineering teams at every stage of the customer journey including: sales, implementation, and expansion.
  • Deliver with velocity: turn vague objectives into clear specs and well-defined PoCs so we can rapidly ship well-tested services and outcomes for our customers
  • Optimize and enhance AI/ML projects, contributing to the continuous improvement of our technical stack. This includes developing features and PRDs with other engineering and product orgs.
  • Own products and customer projects end-to-end, functioning as both an engineer, project manager, and product manager, with a focus on user empathy, project specification, and end-to-end execution.
  • Navigate ambiguity and exercise good judgment on tradeoffs and tools needed to solve problems, avoiding unnecessary complexity.
  • Demonstrate pride, ownership, and accountability for your work, expecting the same from your teammates.
REQUIREMENTS
  • Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field.
  • 2+ years of professional work experience in a fast-paced, high-growth environment.
  • Demonstrated experience with one or more general-purpose programming languages in a production-level environment, with a strong preference for Python.
  • Familiarity with AI/ML pipelines and the lifecycle of ML model development and deployment.
  • Strong communication skills, particularly on complex technical topics.
  • Experience in building or optimizing AI/ML projects is highly valued.
BENEFITS
  • Competitive compensation, including meaningful equity.
  • 100% coverage of medical, dental, and vision insurance for employee and dependents
  • Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
  • Paid parental leave
  • Fertility and family-building stipend through Carrot
  • Company-facilitated 401(k)
  • Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

We are an Equal Opportunity Employer and will consider qualified applicants with criminal histories in a manner consistent with applicable law (by example, the requirements of the San Francisco Fair Chance Ordinance, where applicable).
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Applied AI Inference - Forward Deployed Engineer in United States vacancy
  • $125.5k - $230.2k

    Forward Deployed Engineer - Applied AI - Manager - Financial Services Location: New York Other locations: Anywhere in Region Date: May 13, 2026 Requisition...  ...AI workloads and cloud AI services for model inference and deployment at scale. Familiarity with agile and modern... 
    Suggested
    Contract work
    Summer holiday
    Flexible hours

    Ernst & Young Advisory Services Sdn Bhd

    New York, NY
    3 days ago
  • $144k - $329.1k

    Forward Deployed Engineer - Applied AI - Senior Manager - Financial Services - Consulting Location: New York Other locations: Anywhere in Region Date...  ...on infrastructure strategy for model training and inference at scale (e.g., NVIDIA GPU platforms, NVIDIA AI Enterprise... 
    Suggested
    Contract work
    Summer holiday
    Immediate start
    Flexible hours

    Ernst & Young Advisory Services Sdn Bhd

    New York, NY
    1 day ago
  •  ...Forward Deployed Engineer - Applied AI Wonderschool's mission is to ensure every child has access to early childhood education that helps them realize their full potential. We empower small business owners, childcare providers, to launch and grow in-home programs through... 
    Suggested

    Wonderschool

    Dona Ana, NM
    3 days ago
  • $200k - $300k

     ...reliable, interpretable, and steerable AI systems. We want AI to be safe...  ...group of committed researchers, engineers, policy experts, and business...  ...The Role As a member of the Applied AI team at Anthropic, you will be a Forward Deployed Engineer (FDE) who embeds directly... 
    Suggested
    Work at office
    Visa sponsorship
    Flexible hours

    Menlo Ventures

    New York, NY
    5 days ago
  • $184.94k - $305.13k

    The vLLM and LLM-D Engineering team at Red Hat is looking for a customer...  ...to join our team as a Forward Deployed Engineer. In this role, you...  ...bridge between our cutting‑edge inference platform (LLM-D ( and vLLM (...  ...complex CNI failures. AI Inference Proficiency: You understand... 
    Suggested
    Permanent employment
    Full time
    Contract work
    Work experience placement
    Work at office
    Remote work
    Flexible hours

    Red Hat, Inc.

    Boston, MA
    3 days ago
  • $200k - $300k

    About the role: As a member of the Applied AI team at Anthropic, you will be a Forward Deployed Engineer (FDE) who embeds directly with our most strategic customers to drive transformational AI adoption. You will collaborate closely with customer teams to ship advanced... 
    Visa sponsorship

    Neura Market

    Boston, MA
    2 days ago
  • $106.9k - $176.5k

     ...infrastructure development services for AI/ML initiatives, applying strong technical capability and hands‑on engineering experience. Contributes to...  ...Design, develop, test, deploy, and support production‑...  ..., etc.). Familiarity with inference optimization principles — latency... 
    Summer holiday
    Flexible hours

    Ernst & Young Advisory Services Sdn Bhd

    New York, NY
    1 day ago
  • $180k - $264k

     ...Invisible Technologies makes AI work. Our end-to-end AI platform...  ...automates digital workflows, deploys agentic solutions, measures...  ...As a Senior Software Engineer, Forward Deployed Engineer (FDE) you'll...  ...Partner will confirm which tier applies to your location. For candidates... 
    Work at office
    Local area
    Remote work
    3 days per week

    Invisible Technologies Inc. Defunct

    New York, NY
    3 days ago
  • $209k - $300k

     ...Invisible Technologies makes AI work. Our end-to-end AI...  ...digital workflows, deploys agentic solutions,...  ...driven Principal Software Engineer with a strong background...  ...As part of the Forward Deployed Engineering team...  ...with DevOps principles as applied to machine learning models... 
    Work at office
    Local area
    Remote work
    3 days per week

    Invisible Technologies Inc. Defunct

    New York, NY
    3 days ago
  •  ...A leading open source software provider is seeking a Forward Deployed Engineer in Albany, NY. You will work on deploying and optimizing AI inference systems in Kubernetes environments. The ideal candidate has over 8 years of engineering experience, deep expertise in Kubernetes... 

    Red Hat

    Albany, NY
    5 days ago
  • $166.5k - $266.2k

     ...something unprecedented — an AI foundation that will push...  ...translational science. The Applied Intelligence for Discovery...  ...search — and to convert early deployments into repeatable system...  ...across therapeutic areas. The Forward Deployed AI Engineer is the connective tissue between... 
    Full time
    Flexible hours

    Eli Lilly

    San Francisco, CA
    5 days ago
  •  ...leading software solutions provider seeks a Forward Deployed Engineer to manage and optimize Large Language Model (LLM) inference systems for client environments. The ideal candidate...  ...experience, particularly with Kubernetes and AI inference techniques. Responsibilities... 

    Red Hat, Inc.

    Boston, MA
    3 days ago
  • $135k - $200k

     ...more. The Role We are seeking a Forward Deployed Software Engineer to join a newly-formed team focused...  ...into Palantir platform and with AI and autonomy software solutions such...  ...on an exceptional basis. If you are applying for one of these roles, you must work... 
    Work experience placement
    Work at office
    Remote work
    Work from home
    Relocation package

    Palantir Technologies

    Seattle, WA
    7 days ago
  •  ...client collaboration. Candidates with an active Secret or higher clearance are strongly encouraged to apply. This role can be remote. The AI Forward Deployed Engineering (AI FDE) team is a highly specialized customer‑facing AI team at Databricks. We deliver professional... 
    Contract work
    Remote work
    Worldwide

    Menlo Ventures

    Annapolis, MD
    5 days ago
  • $174k - $252k

     ...Qualifications Bachelor’s degree in Engineering, Computer Science, a...  ...Experience architecting AI systems on cloud...  ...(e.g., GCP). Experience deploying resources via Terraform or...  ...windows. About the Job As a Forward Deployed Engineer (FDE) in Applied AI, you are the “Agent... 
    Full time
    Temporary work
    Worldwide

    Google Inc.

    Miami, FL
    5 days ago
  • $270k - $300k

    About the role As a Forward Deployed Engineer (FDE) focused on Inference & Post-Training, you will be a hands-on technical...  ...strategic customers — production AI teams looking to leverage high...  ...customer deployments; tune KV cache, apply speculative decoding, determine optimal... 
    Full time
    Remote work

    Together AI

    United States
    2 days ago
  •  ...Employment Type Full time Location Type On-site Department Applied AI Our Mission Reflection’s mission is to build open...  ...core member of Reflection’s Applied AI team to drive our Forward Deployed Engineering efforts with enterprise customers. This team works hands‑... 
    Full time
    Relocation package

    B Capital

    San Francisco, CA
    2 days ago
  •  ...We build the enterprise platform for AI-powered search and agentic orchestration...  ...Professional Services team, you'll be the Forward Deployed Engineer (FDE) embedded on-site with one of our...  ...founders are also highly encouraged to apply. • Account-leadership range: You... 
    Permanent employment
    Local area
    Remote work
    Relocation
    Day shift

    ChapsVision

    Columbus, OH
    8 days ago
  • $134.5k - $265.1k

     ...At Deloitte, Forward Deployed Engineers (FDE) don't just build AI solutions, they help clients turn AI ambition into enterprise‑scale impact, pairing leading...  ...tool‑use approaches, and human‑in‑the‑loop controls. Apply architecture decisions that balance quality, safety,... 
    Visa sponsorship

    PowerToFly

    Phoenix, AZ
    1 day ago
  • $150k - $250k

     ...Senior Forward Deployed Software Engineer New York, United States About Us PhysicsX...  .... We are building an AI-driven simulation software...  ...physics simulation through AI inference across the entire engineering...  ...in tech to apply. To help make a change, we... 
    Work at office
    Work from home
    Flexible hours

    PhysicsX

    Farmington, NY
    3 days ago
  •  ...managed services and innovative Agentic AI ERP solutions, and is the leading...  ...VP, Innovation, Solution Delivery, the Forward Deployed Engineer (Agentic AI) embeds with enterprise clients...  ...to span production engineering and applied AI, and the demonstrated ability to ship... 
    Remote work

    Rimini Street

    New York, NY
    1 day ago
  • $150k - $250k

     ...Senior Forward Deployed Software Engineer New York, United States About us PhysicsX is a...  ...software. We are building an AI‑driven simulation software...  ...physics simulation through AI inference across the entire...  ...underrepresented in tech to apply. To help make a change, we... 
    Work at office
    Work from home
    Flexible hours

    PhysicsX Ltd

    New York, NY
    5 days ago
  • About the Team OpenAI’s Forward Deployed Engineering team partners with leading semiconductor companies to deploy production-grade AI systems across the entire chip design lifecycle: design...  ..., or physical design Experience applying AI/LLM systems to semiconductor... 

    OpenAI

    San Francisco, CA
    3 days ago
  •  ...enterprises harness agentic AI in the real world -...  ...Our FDEs don't just deploy solutions; they shape...  ...industries. Forward Deployed Engineers (FDEs) work alongside...  ...systems in production, applying large language models...  ...model behavior, optimize inference performance, and... 
    Remote work
    Flexible hours

    seekr

    United States
    1 day ago
  • $175.5k - $180k

    Tech & AI Senior Forward Deployed Engineer - QuantumBlack, AI by McKinsey Job ID: 109017 Atlanta Boston Chicago New York City San Francisco Washington DC Apply NowApply Later Do you want to do work that matters, alongside supportive leaders who... 
    Hourly pay
    Apprenticeship
    Work at office
    Easy work

    McKinsey & Company

    Chicago, IL
    2 days ago
  • $169.7k - $271.5k

     ...Opportunity Overview The AI Transformation team,...  ..., is accelerating the deployment of AI-augmented...  ...stakeholders across sales, engineering, operations, finance,...  ...LangGraph, etc.), local model inference (Ollama), and MCP...  ...phases Experience in forward‑deployed or customer‑... 
    Local area
    Flexible hours

    Teradyne

    North Reading, MA
    3 days ago
  • $147.4k - $272.1k

     ...Services The Maps Client Quality Engineering Intelligence (QEI) team builds AI-native tooling used every day by...  ...root cause analysis, and more. As a Forward Deployed Engineer on QEI, you will stay in...  ...as they learn. Engineers with applied AI experience who treat the model... 
    Relocation

    Apple Inc.

    Cupertino, CA
    5 days ago
  • Apple Inc. is seeking a Forward Deployed Engineer to join the Maps Client Quality Engineering Intelligence...  ...building tools that facilitate AI integration, ensuring that engineers can...  ...from concept to execution, and a background in applied AI. #J-18808-Ljbffr Apple Inc.

    Apple Inc.

    Cupertino, CA
    3 days ago
  •  ...about unlocking the potential of AI to transform the world. By combining...  ..., AI model fine-tuning, inference oversight and MLOps for AI solutions...  ...needed to design, prove, develop and deploy AI solutions we offer Forward deployed Engineering services that can advise, design,... 
    Relocation package

    CloudFactory Limited

    Dallas, TX
    4 days ago
  •  ...Job Description JOB SCOPE: The Forward Deployment Engineer (FDE) Intern will work within the SearsKAIros Applied AI Division at TransformCo, partnering directly with business and operational teams to identify opportunities for AI-driven automation and workflow... 
    Internship

    Transformco

    Hoffman Estates, IL
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Applied AI Inference - Forward Deployed Engineer. Be the first to apply!