Applied AI Inference - Forward Deployed Engineer
Baseten
ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE As a Forward Deployed Engineer at Baseten, you will partner directly with customers to architect, build, and deploy high-scale production AI applications on Baseten's platform. You'll own the journey with customers from initial exploration to production deployment, translating ambiguous business goals into reliable, observable services with clear quality, latency, and cost outcomes. This role is a great fit for entrepreneurial engineers who want a front-row view into how modern companies adopt AI at scale and who enjoy working across product, software development, performance engineering, and customer-facing implementations. To be clear, this is an engineering role with hands-on coding and software development that also includes aspects of product management, technical customer success, and pre-sales solution engineering mixed in. EXAMPLE INITIATIVES Take a look at these blog posts written by members of our Forward Deployed Engineering team:
Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you. At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status. We are an Equal Opportunity Employer and will consider qualified applicants with criminal histories in a manner consistent with applicable law (by example, the requirements of the San Francisco Fair Chance Ordinance, where applicable).
- Forward Deployed Engineering on the frontier of AI
- The fastest, most accurate Whisper transcription
- Deploy production-ready model servers from Docker images
- Deploy custom ComfyUI workflows as APIs
- Develop and maintain software systems and product features using one or more general-purpose programming languages in a production-level environment, with a preference for Python due to its relevance in ML projects.
- Drive customer impact by designing, implementing, and deploying Baseten solutions end-to-end (problem framing → evaluation → production deployment → monitoring). This involves working with customers' engineering teams at every stage of the customer journey including: sales, implementation, and expansion.
- Deliver with velocity: turn vague objectives into clear specs and well-defined PoCs so we can rapidly ship well-tested services and outcomes for our customers
- Optimize and enhance AI/ML projects, contributing to the continuous improvement of our technical stack. This includes developing features and PRDs with other engineering and product orgs.
- Own products and customer projects end-to-end, functioning as both an engineer, project manager, and product manager, with a focus on user empathy, project specification, and end-to-end execution.
- Navigate ambiguity and exercise good judgment on tradeoffs and tools needed to solve problems, avoiding unnecessary complexity.
- Demonstrate pride, ownership, and accountability for your work, expecting the same from your teammates.
- Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field.
- 2+ years of professional work experience in a fast-paced, high-growth environment.
- Demonstrated experience with one or more general-purpose programming languages in a production-level environment, with a strong preference for Python.
- Familiarity with AI/ML pipelines and the lifecycle of ML model development and deployment.
- Strong communication skills, particularly on complex technical topics.
- Experience in building or optimizing AI/ML projects is highly valued.
- Competitive compensation, including meaningful equity.
- 100% coverage of medical, dental, and vision insurance for employee and dependents
- Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
- Paid parental leave
- Fertility and family-building stipend through Carrot
- Company-facilitated 401(k)
- Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.
Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you. At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status. We are an Equal Opportunity Employer and will consider qualified applicants with criminal histories in a manner consistent with applicable law (by example, the requirements of the San Francisco Fair Chance Ordinance, where applicable).
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Applied AI Inference - Forward Deployed Engineer in United States vacancy
$125.5k - $230.2k
Forward Deployed Engineer - Applied AI - Manager - Financial Services Location: New York Other locations: Anywhere in Region Date: May 13, 2026 Requisition... ...AI workloads and cloud AI services for model inference and deployment at scale. Familiarity with agile and modern...SuggestedContract workSummer holidayFlexible hours$144k - $329.1k
Forward Deployed Engineer - Applied AI - Senior Manager - Financial Services - Consulting Location: New York Other locations: Anywhere in Region Date... ...on infrastructure strategy for model training and inference at scale (e.g., NVIDIA GPU platforms, NVIDIA AI Enterprise...SuggestedContract workSummer holidayImmediate startFlexible hours- ...Forward Deployed Engineer - Applied AI Wonderschool's mission is to ensure every child has access to early childhood education that helps them realize their full potential. We empower small business owners, childcare providers, to launch and grow in-home programs through...Suggested
$200k - $300k
...reliable, interpretable, and steerable AI systems. We want AI to be safe... ...group of committed researchers, engineers, policy experts, and business... ...The Role As a member of the Applied AI team at Anthropic, you will be a Forward Deployed Engineer (FDE) who embeds directly...SuggestedWork at officeVisa sponsorshipFlexible hours$184.94k - $305.13k
The vLLM and LLM-D Engineering team at Red Hat is looking for a customer... ...to join our team as a Forward Deployed Engineer. In this role, you... ...bridge between our cutting‑edge inference platform (LLM-D ( and vLLM (... ...complex CNI failures. AI Inference Proficiency: You understand...SuggestedPermanent employmentFull timeContract workWork experience placementWork at officeRemote workFlexible hours$200k - $300k
About the role: As a member of the Applied AI team at Anthropic, you will be a Forward Deployed Engineer (FDE) who embeds directly with our most strategic customers to drive transformational AI adoption. You will collaborate closely with customer teams to ship advanced...Visa sponsorship$106.9k - $176.5k
...infrastructure development services for AI/ML initiatives, applying strong technical capability and hands‑on engineering experience. Contributes to... ...Design, develop, test, deploy, and support production‑... ..., etc.). Familiarity with inference optimization principles — latency...Summer holidayFlexible hours$180k - $264k
...Invisible Technologies makes AI work. Our end-to-end AI platform... ...automates digital workflows, deploys agentic solutions, measures... ...As a Senior Software Engineer, Forward Deployed Engineer (FDE) you'll... ...Partner will confirm which tier applies to your location. For candidates...Work at officeLocal areaRemote work3 days per week$209k - $300k
...Invisible Technologies makes AI work. Our end-to-end AI... ...digital workflows, deploys agentic solutions,... ...driven Principal Software Engineer with a strong background... ...As part of the Forward Deployed Engineering team... ...with DevOps principles as applied to machine learning models...Work at officeLocal areaRemote work3 days per week- ...A leading open source software provider is seeking a Forward Deployed Engineer in Albany, NY. You will work on deploying and optimizing AI inference systems in Kubernetes environments. The ideal candidate has over 8 years of engineering experience, deep expertise in Kubernetes...
$166.5k - $266.2k
...something unprecedented — an AI foundation that will push... ...translational science. The Applied Intelligence for Discovery... ...search — and to convert early deployments into repeatable system... ...across therapeutic areas. The Forward Deployed AI Engineer is the connective tissue between...Full timeFlexible hours- ...leading software solutions provider seeks a Forward Deployed Engineer to manage and optimize Large Language Model (LLM) inference systems for client environments. The ideal candidate... ...experience, particularly with Kubernetes and AI inference techniques. Responsibilities...
$135k - $200k
...more. The Role We are seeking a Forward Deployed Software Engineer to join a newly-formed team focused... ...into Palantir platform and with AI and autonomy software solutions such... ...on an exceptional basis. If you are applying for one of these roles, you must work...Work experience placementWork at officeRemote workWork from homeRelocation package- ...client collaboration. Candidates with an active Secret or higher clearance are strongly encouraged to apply. This role can be remote. The AI Forward Deployed Engineering (AI FDE) team is a highly specialized customer‑facing AI team at Databricks. We deliver professional...Contract workRemote workWorldwide
$174k - $252k
...Qualifications Bachelor’s degree in Engineering, Computer Science, a... ...Experience architecting AI systems on cloud... ...(e.g., GCP). Experience deploying resources via Terraform or... ...windows. About the Job As a Forward Deployed Engineer (FDE) in Applied AI, you are the “Agent...Full timeTemporary workWorldwide$270k - $300k
About the role As a Forward Deployed Engineer (FDE) focused on Inference & Post-Training, you will be a hands-on technical... ...strategic customers — production AI teams looking to leverage high... ...customer deployments; tune KV cache, apply speculative decoding, determine optimal...Full timeRemote work- ...Employment Type Full time Location Type On-site Department Applied AI Our Mission Reflection’s mission is to build open... ...core member of Reflection’s Applied AI team to drive our Forward Deployed Engineering efforts with enterprise customers. This team works hands‑...Full timeRelocation package
- ...We build the enterprise platform for AI-powered search and agentic orchestration... ...Professional Services team, you'll be the Forward Deployed Engineer (FDE) embedded on-site with one of our... ...founders are also highly encouraged to apply. • Account-leadership range: You...Permanent employmentLocal areaRemote workRelocationDay shift
$134.5k - $265.1k
...At Deloitte, Forward Deployed Engineers (FDE) don't just build AI solutions, they help clients turn AI ambition into enterprise‑scale impact, pairing leading... ...tool‑use approaches, and human‑in‑the‑loop controls. Apply architecture decisions that balance quality, safety,...Visa sponsorship$150k - $250k
...Senior Forward Deployed Software Engineer New York, United States About Us PhysicsX... .... We are building an AI-driven simulation software... ...physics simulation through AI inference across the entire engineering... ...in tech to apply. To help make a change, we...Work at officeWork from homeFlexible hours- ...managed services and innovative Agentic AI ERP solutions, and is the leading... ...VP, Innovation, Solution Delivery, the Forward Deployed Engineer (Agentic AI) embeds with enterprise clients... ...to span production engineering and applied AI, and the demonstrated ability to ship...Remote work
$150k - $250k
...Senior Forward Deployed Software Engineer New York, United States About us PhysicsX is a... ...software. We are building an AI‑driven simulation software... ...physics simulation through AI inference across the entire... ...underrepresented in tech to apply. To help make a change, we...Work at officeWork from homeFlexible hours- About the Team OpenAI’s Forward Deployed Engineering team partners with leading semiconductor companies to deploy production-grade AI systems across the entire chip design lifecycle: design... ..., or physical design Experience applying AI/LLM systems to semiconductor...
- ...enterprises harness agentic AI in the real world -... ...Our FDEs don't just deploy solutions; they shape... ...industries. Forward Deployed Engineers (FDEs) work alongside... ...systems in production, applying large language models... ...model behavior, optimize inference performance, and...Remote workFlexible hours
$175.5k - $180k
Tech & AI Senior Forward Deployed Engineer - QuantumBlack, AI by McKinsey Job ID: 109017 Atlanta Boston Chicago New York City San Francisco Washington DC Apply NowApply Later Do you want to do work that matters, alongside supportive leaders who...Hourly payApprenticeshipWork at officeEasy work$169.7k - $271.5k
...Opportunity Overview The AI Transformation team,... ..., is accelerating the deployment of AI-augmented... ...stakeholders across sales, engineering, operations, finance,... ...LangGraph, etc.), local model inference (Ollama), and MCP... ...phases Experience in forward‑deployed or customer‑...Local areaFlexible hours$147.4k - $272.1k
...Services The Maps Client Quality Engineering Intelligence (QEI) team builds AI-native tooling used every day by... ...root cause analysis, and more. As a Forward Deployed Engineer on QEI, you will stay in... ...as they learn. Engineers with applied AI experience who treat the model...Relocation- Apple Inc. is seeking a Forward Deployed Engineer to join the Maps Client Quality Engineering Intelligence... ...building tools that facilitate AI integration, ensuring that engineers can... ...from concept to execution, and a background in applied AI. #J-18808-Ljbffr Apple Inc.
- ...about unlocking the potential of AI to transform the world. By combining... ..., AI model fine-tuning, inference oversight and MLOps for AI solutions... ...needed to design, prove, develop and deploy AI solutions we offer Forward deployed Engineering services that can advise, design,...Relocation package
- ...Job Description JOB SCOPE: The Forward Deployment Engineer (FDE) Intern will work within the SearsKAIros Applied AI Division at TransformCo, partnering directly with business and operational teams to identify opportunities for AI-driven automation and workflow...Internship
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Applied AI Inference - Forward Deployed Engineer. Be the first to apply!
Related searches

