Member of Technical Staff, Forward Deployed AI Engineer

$175k - $275k

Inception LLC

Forward Deployed AI Engineer

Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's fastest reasoning LLM and first commercially available diffusion LLM, delivering 5x greater speed and efficiency than today's LLMs, with best-in-class quality. We are the AI researchers and engineers behind such breakthrough AI technologies as diffusion models, flash attention, and DPO.

The Role

Inception is hiring Forward Deployed AI Engineers to help enterprise customers deliver the highest quality AI experiences using our diffusion-based language models. This role sits at the intersection of product engineering, customer implementation, evals, data collection, model optimization, and enterprise deployment ownership. You will work directly with enterprise customers to identify high-value AI workflows, collect and structure customer data, build LLM-as-judge evaluation systems, tune model and product behavior for customer-specific goals, and turn fast proof-of-concepts into production deployments. This is not a traditional solutions engineering role, a pure research role, or a long-cycle consulting implementation role. We are looking for full-stack engineers who can operate close to customers, build real systems, communicate clearly, and move fast — including running fast POC cycles that take weeks to produce customer impact rather than exploratory research projects that take months. As an early member of the team responsible for turning Mercury models into high-value enterprise deployments and building the customer data flywheel that improves our models, products, and go-to-market motion. You will work closely with platform, serving, post-training, product engineering, and GTM teams to translate customer deployment learnings into model, product, and infrastructure improvements.

Key Responsibilities

Enterprise customer deployments: Work directly with strategic enterprise customers to identify high-value AI workflows and turn them into production deployments.
Rapid prototyping: Build and run fast proof-of-concepts, iterating on customer requirements and technical constraints on 2-week cycles.
Production AI applications: Build full-stack AI applications, agentic workflows, integrations, internal tools, and customer-facing systems that bring Inception models into real enterprise environments.
Data collection & feedback loops: Collect, structure, and operationalize customer data to improve model and product performance on customer use cases.
Measurement and Evaluation: Define success metrics for customer deployments and design LLM-as-judge workflows, evaluation harnesses, and feedback loops for customer-specific use cases.
Model and product optimization: Tune and customize Mercury models, prompts, workflows, and system architecture to meet customer-specific performance goals.
Agentic workflows: Build and optimize agentic workflows including subagents involving classification, routing, context compaction, search, coding agents, voice, and other latency-sensitive applications.
Build, prove, and generalize: Turn customer-specific deployments into repeatable product patterns, eval frameworks, implementation playbooks, and platform capabilities that improve Inception's core product.

Qualifications

BS/MS/PhD in Computer Science, Machine Learning, or a related field (or equivalent experience).
Strong engineering skills in Python and modern full-stack development, including APIs, backend systems, and ideally TypeScript/JavaScript.
Experience building, deploying, or integrating AI/LLM products with real users or customers.
Familiarity with LLM evaluation, LLM-as-judge workflows, data pipelines, model tuning, prompt optimization, or agentic workflows.
Customer-facing experience with enterprise, strategic, or high-value accounts.
Experience deploying software or AI systems in enterprise environments with security, privacy, reliability, compliance, or integration constraints.
Strong communication and discovery skills, with the ability to translate ambiguous customer needs into concrete technical solutions.
Ability to operate across engineering, product, sales, and customer success without requiring heavy process or handholding.
Willingness to work directly with customers in person when needed, including occasional travel for strategic deployments, workshops, and executive technical sessions.

Preferred Skills

Experience with RAG, search, voice AI, coding agents, or agentic workflow systems.
Experience deploying AI systems for Fortune 500 or large enterprise customers.
Track record owning technical pre-sales, post-sales, implementation, or customer expansion for million-dollar enterprise accounts.
Familiarity with LLM serving, latency optimization, model evaluation, or production ML systems.
Experience with data engineering, synthetic data generation, or feedback loops for model improvement.
Background in product engineering, ML product engineering, applied AI, or forward deployed engineering.
Experience working with customer-specific evals, benchmarks, and performance targets.
Familiarity with latency-sensitive applications, especially voice systems where response speed is critical.

A Note on the Role

This role is for builders who want to be close to customers and close to the product. We are not looking for traditional solutions engineers who only configure demos, nor researchers who primarily want to work on open-ended model experiments. The strongest candidates are full-stack engineers with enough ML fluency to work across LLM systems, evals, data, tuning, deployment, and production application development — and enough customer instinct to discover what matters, build quickly, and drive real adoption. This role is also not just about serving one-off customer requests. The best FDEs will identify repeatable patterns across deployments and turn those learnings into better product surfaces, platform capabilities, evals, playbooks, and model feedback loops.

A Note on Startup Fit

This is an in-office role at an early-stage company moving with high velocity. We're looking for engineers who are actively seeking a startup environment — comfortable with ambiguity, customer-facing work, rapid iteration, and end-to-end ownership. The team is small and high-leverage. You should be excited to work directly with enterprise customers, own ambiguous problems, and build the systems that convert customer demand into production AI deployments.

Compensation

The annual base salary range for this role is $175,000 – $275,000 USD. Final compensation is determined based on experience, skills, and qualifications. Equity and benefits are included in the total package.

Why Join Inception

Work with World-Class Talent: Collaborate with the inventors of diffusion models and leading AI researchers
Shape Foundational Technology: Your decisions will influence how the next generation of AI products are built and used
Immediate Impact: Join at the ground floor where your contributions directly shape product direction and company trajectory

Perks & Benefits

Competitive salary and equity in a rapidly growing startup
Flexible vacation and paid time off (PTO)
Health, dental, and vision insurance
401k match
Catered meals (breakfast, lunch, & dinner)
Commuter subsidies
A collaborative and inclusive culture

About Us

Inception creates the world's fastest, most efficient AI models. Today's autoregressive LLMs generate tokens sequentially, which makes them painfully slow and expensive. Inception's diffusion-based LLMs (dLLMs) generate answers in parallel. They are 5x faster and more efficient, while delivering best-in-class quality. Inception was co-founded by Stanford professor Stefano Ermon, who co-invented such breakthrough AI technologies as diffusion models, flash attention, and DPO, UCLA professor Aditya Grover, who co-invented node2vec, decision transformers, and d1 reasoning, and Cornell professor and Afresh co-founder Volodymyr Kuleshov, who co-invented MDLM and Block Diffusion. We pioneered the application of diffusion to language, with world's first (and only) commercially available dLLM, Mercury. We are currently deploying our large-scale diffusion LLMs at Fortune 500 companies. Diffusion is the technology behind today's image and video AI, and we're making it the standard for LLMs as well. Our team includes engineers from AWS, Google DeepMind, Meta AI, Microsoft, HashiCorp, and OpenAI. Based in Palo Alto, CA, we are backed by top-tier venture capitalists, including Menlo Ventures, Mayfield, M12 (Microsoft's venture fund), Snowflake Ventures,

Apply

Vacancy posted 3 hours ago

Similar jobs that could be interesting for youBased on the Member of Technical Staff, Forward Deployed AI Engineer in Palo Alto, CA vacancy

Forward-Deployed AI Engineer: Prototyping & Customer Impact
A leader in AI-driven e-commerce solutions seeks a Forward Deployed Engineer in Mountain View, CA. This role involves building prototypes with customers to address real-world problems while integrating data from various sources. Candidates should have 4-6 years of software...
Suggested
CommerceIQ
Mountain View, CA
14 hours ago
Forward Deployed AI Engineer (Agent Engineering)
$106.9k - $229.4k
...software: working directly with customers to design, build, and deploy AI agents that automate real workflows and decision‑making in production within weeks—not quarters. As a Forward Deployed AI Engineer, you will partner directly with customers and product managers to...
Suggested
SAP SE
Palo Alto, CA
3 days ago
Forward-Deployed AI Engineer: Enterprise Agent Systems
$106.9k - $229.4k
SAP SE is seeking a Forward Deployed AI Engineer in Palo Alto, California. You will work directly with customers to design and deploy production-grade AI systems that enhance workflows and decision-making processes. This hands-on role requires you to own the end-to-end...
Suggested
SAP SE
Palo Alto, CA
3 days ago
Remote AI Forward‑Deployed Engineer
A leading technology company is seeking an AI Forward Deployed Engineer to design AI solutions, develop integrated systems, and communicate results effectively. Candidates should have substantial experience in AI/ML, cloud services, and problem-solving capabilities. This...
Suggested
Remote job
IBM Computing
Santa Clara, CA
3 days ago
AI Forward Deployed Engineer
...helping them shape their hybrid cloud and AI journeys. With support from our strategic... ...Your role and responsibilities As an AI Forward Deployed Engineer, you will work with customers to understand their workflows, technical environments, and business objectives, translating...
Suggested
Worldwide
IBM Computing
Santa Clara, CA
1 day ago
Forward Deployed AI Engineer (Post-Sales)
$230k - $300k
...that cost more to train and deploy. At DatologyAI, we’ve... ...Partners, Microsoft, Amazon, and AI visionaries like Geoff... ...data research and data engineering necessary to solve this incredibly... ...are looking for a highly technical, customer-obsessed Forward Deployed AI Engineer (Post...
Full time
Work at office
Relocation package
Datology
Redwood City, CA
1 day ago
AI Forward-Deployed Engineer
$147.7k - $215k
...opportunity in UCaaS, Contact Center and AI-powered adjacencies. We invest more... ...in. We’re currently looking for: an AI Forward-Deployed Engineer This role is responsible for closing... ...acquisition and value realization. This is a technical deployment role, not a services or...
Full time
Local area
Remote work
Flexible hours
RingCentral
Belmont, CA
3 days ago
Enterprise AI Forward-Deployed Engineer
$90k - $145k
A leading Enterprise AI software company in Redwood City is seeking a Forward Deployed Engineer to provide expert product support for C3 AI applications. Candidates should have application development experience, capable of troubleshooting and customer interaction, and...
C3 AI
Redwood City, CA
1 day ago
Member of Technical Staff (AI Inference Engineer)
$190k - $250k
...Employment Type Full time Location Type Hybrid Department AI We are looking for an AI Inference engineer to join our growing team. Our current stack is Python... ...You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference....
Full time
Kindredventures
Palo Alto, CA
2 days ago
Member of Technical Staff (AI Infrastructure Engineer)
...AI Infra Engineer We are looking for an AI Infra engineer to join our growing team. We work with Kubernetes, Slurm, Python, C++, PyTorch... ...partnering closely with our Inference and Research teams to build, deploy, and optimize our large-scale AI training and inference...
Perplexity AI
Palo Alto, CA
2 days ago
Member of Technical Staff - AI Product Engineer - Web
$119.8k - $234.7k
...continues to push the boundaries of AI, we are on the lookout for... ...seeking experienced Product Engineers to help build the Copilot web... ...web applications Plan and deploy front end infrastructure necessary... ...computer science, or related technical discipline AND 4+ years...
Ongoing contract
Work at office
Local area
3 days per week
Microsoft Corporation
Mountain View, CA
6 days ago
Member of Technical Staff
...Member Of Technical Staff Memphis, TN About XAI XAI's mission is to create AI systems that can accurately understand... ...motivated, and focused on engineering excellence. This... ..., develop, and deploy scalable code and services... ...to drive forward-thinking solutions....
Xai
Palo Alto, CA
1 day ago
Member of Technical Staff - Backend Software Engineer
$140k - $160k
...Member Of Technical Staff - Backend Software Engineer This role is based in Palo Alto, California,... ...was written by humans or AI, and whether it's 40-year... ...end to end, from IDE to deployment, fueling the next... ...ensuring alignment and forward momentum. Adaptable Innovator...
Shift work
Endor Labs
Palo Alto, CA
3 days ago
Member of Technical Staff, AI Platform Engineer
$119.8k - $234.7k
...continues to push the boundaries of AI, we are on the lookout for... ...(MS AI) is seeking skilled engineers to advance the next... ...and machine learning, bringing technical expertise and a methodical problem... ...including experience using, deploying, and?scaling language learning...
Ongoing contract
Work at office
Local area
3 days per week
Microsoft Corporation
Mountain View, CA
14 hours ago
Software Systems Engineering
$135k - $200k
...Forward Deployed Software Engineer - Edge Autonomous Systems Title of Role: Forward Deployed Software Engineer - Edge Autonomous Systems Location... ..., focusing on latency and resilience. Act as a technical liaison between engineering teams and end users, clarifying...
Work at office
Recruiting from Scratch
Palo Alto, CA
1 day ago
Software Systems Engineering
$135k - $200k
...missing children, and more. The Role We are seeking a Forward Deployed Software Engineer to join a newly-formed team focused on developing advanced... ...Autonomy C2 solutions into Palantir platform and with AI and autonomy software solutions such sensor and data fusion...
Work experience placement
Work at office
Remote work
Work from home
Relocation package
Palantir Technologies
Palo Alto, CA
1 day ago
Forward‑Deployed AI Engineer | GenAI Production Expert
An innovative AI solutions company in Palo Alto is looking for an Applied AI Engineer. The role involves facilitating customer adoption of AI products by addressing complex technical challenges. You will manage daily relations, onboard clients on use products, and work...
Mistral AI
Palo Alto, CA
4 days ago
Senior AI Solutions Engineer for Enterprise Deployments
Uniphore Technologies North America Inc is seeking a Senior Forward Deployed Engineer in Palo Alto, CA. This role involves leading the architecture and delivery of AI solutions and requires strong technical ownership and teamwork. Ideal candidates should have a deep understanding...
Uniphore Technologies North America Inc
Palo Alto, CA
3 days ago
X Developer Platform - Forward Deployed Engineer, X API
$180k
...xAI's mission is to create AI systems that can accurately understand... ...motivated, and focused on engineering excellence. This organization... ...are seeking an exceptional Forward Deployed Engineer who will work at the intersection of deep technical implementation and world-...
Temporary work
Flexible hours
xAI
Palo Alto, CA
22 days ago
Member of Technical Staff, Software Engineer
$200k - $350k
...Member Of Technical Staff, Software Engineer Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's fastest reasoning LLM and first commercially... ...Actions, Jenkins, BuildKite) and deployment automation. ~ Familiarity with...
Immediate start
Flexible hours
Inception LLC
Palo Alto, CA
3 days ago
Member of Technical Staff, AI Product, Android Engineer
$119.8k - $234.7k
...Overview Microsoft AI is looking for a talented Android engineer to help build the next wave of capabilities of our personalized AI assistant, Copilot... ...~ Bachelor’s degree in computer science, or related technical discipline AND 4+ years technical engineering experience...
Ongoing contract
Work at office
Local area
Microsoft Corporation
Mountain View, CA
5 days ago
Field AI Vision Engineer - VLM & VFM Deployment
...(VLM) & Visual Foundation Model (VFM) Forward Deployed Engineer in Palo Alto, CA. The role involves training and deploying vision-centric AI systems, defining performance benchmarks... ...hold a Bachelor’s degree in a relevant technical field with proficiency in Python and experience...
Flexible hours
Matroid Inc.
Palo Alto, CA
4 days ago
Enterprise AI Deployment Engineer (Hybrid, Equity)
A leading AI-driven company in Palo Alto is seeking a Forward Deployed Engineer to bridge their advanced platform with strategic enterprise customers. In this hybrid role, you'll drive technical success from initial deployment to production optimization, involving work...
Nectar
Palo Alto, CA
14 hours ago
Member of Technical Staff - Inference
$180k
...xAI's mission is to create AI systems that can accurately... ...motivated, and focused on engineering excellence. This organization... ...reliability. As a Member of Technical Staff - Inference, you will design... ...infrastructure for seamless endpoint deployment, image publishing, and...
Temporary work
xAI
Palo Alto, CA
a month ago
Member of Technical Staff - AI Product - Backend Engineer
$119.8k - $234.7k
...Overview Microsoft AI is looking for a talented Backend engineer to help build the next wave of capabilities of our personalized AI assistant, Copilot... ...~ Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with...
Ongoing contract
Work at office
Local area
3 days per week
Microsoft Corporation
Mountain View, CA
14 hours ago
Member of Technical Staff, Software Engineering
...making it uniquely positioned for AI to have an enormous impact. Stage... ...radiology datasets. About the Role As a Member of Technical Staff in Software Engineering, you will build the full stack... ...design and implementation through deployment and operation. What You Bring We...
Cognita Imaging Inc.
Palo Alto, CA
2 days ago
Forward-Deployed AI & Cloud Enterprise Engineer
...Palo Alto is seeking a skilled professional to design and develop AI native enterprise applications. In this role, you will work... ...ideal candidate will possess extensive experience in software engineering and a strong understanding of architectural principles. Join a...
SAP SE
Palo Alto, CA
4 days ago
Member of Technical Staff - Supercomputing
...About the Role RadixArk is hiring a Member of Technical Staff - Supercomputing to help build, deploy, and operate production-grade AI infrastructure for frontier-scale... ...This role sits at the intersection of engineering, deployment, reliability, and customer...
Flexible hours
RadixArk
Palo Alto, CA
4 days ago
Sr. Member of Technical Staff, Architecture
...Powered by the Illumio AI Security Graph, our... ...Team's Vision Our Engineering team is driven by a culture... ...that drives us forward in the ever-evolving landscape... ...and become productive members of the team. ~... ...their successful deployment and usage by customers...
Immediate start
Illumio
Sunnyvale, CA
1 day ago
Member of Technical Staff, Reinforcement Learning
$200k - $350k
...Member Of Technical Staff, Reinforcement Learning Inception creates the world... ...'s fastest, most efficient AI models. Our Mercury model is... ...are the AI researchers and engineers behind such breakthrough AI... ...Mercury. We are currently deploying our large-scale diffusion LLMs...
Immediate start
Flexible hours
Inception LLC
Palo Alto, CA
4 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff, Forward Deployed AI Engineer. Be the first to apply!