Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff, Forward Deployed AI Engineer

$175k - $275k

Inception LLC

Forward Deployed AI Engineer

Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's fastest reasoning LLM and first commercially available diffusion LLM, delivering 5x greater speed and efficiency than today's LLMs, with best-in-class quality. We are the AI researchers and engineers behind such breakthrough AI technologies as diffusion models, flash attention, and DPO.

The Role

Inception is hiring Forward Deployed AI Engineers to help enterprise customers deliver the highest quality AI experiences using our diffusion-based language models. This role sits at the intersection of product engineering, customer implementation, evals, data collection, model optimization, and enterprise deployment ownership. You will work directly with enterprise customers to identify high-value AI workflows, collect and structure customer data, build LLM-as-judge evaluation systems, tune model and product behavior for customer-specific goals, and turn fast proof-of-concepts into production deployments. This is not a traditional solutions engineering role, a pure research role, or a long-cycle consulting implementation role. We are looking for full-stack engineers who can operate close to customers, build real systems, communicate clearly, and move fast — including running fast POC cycles that take weeks to produce customer impact rather than exploratory research projects that take months. As an early member of the team responsible for turning Mercury models into high-value enterprise deployments and building the customer data flywheel that improves our models, products, and go-to-market motion. You will work closely with platform, serving, post-training, product engineering, and GTM teams to translate customer deployment learnings into model, product, and infrastructure improvements.

Key Responsibilities
  • Enterprise customer deployments: Work directly with strategic enterprise customers to identify high-value AI workflows and turn them into production deployments.
  • Rapid prototyping: Build and run fast proof-of-concepts, iterating on customer requirements and technical constraints on 2-week cycles.
  • Production AI applications: Build full-stack AI applications, agentic workflows, integrations, internal tools, and customer-facing systems that bring Inception models into real enterprise environments.
  • Data collection & feedback loops: Collect, structure, and operationalize customer data to improve model and product performance on customer use cases.
  • Measurement and Evaluation: Define success metrics for customer deployments and design LLM-as-judge workflows, evaluation harnesses, and feedback loops for customer-specific use cases.
  • Model and product optimization: Tune and customize Mercury models, prompts, workflows, and system architecture to meet customer-specific performance goals.
  • Agentic workflows: Build and optimize agentic workflows including subagents involving classification, routing, context compaction, search, coding agents, voice, and other latency-sensitive applications.
  • Build, prove, and generalize: Turn customer-specific deployments into repeatable product patterns, eval frameworks, implementation playbooks, and platform capabilities that improve Inception's core product.
Qualifications
  • BS/MS/PhD in Computer Science, Machine Learning, or a related field (or equivalent experience).
  • Strong engineering skills in Python and modern full-stack development, including APIs, backend systems, and ideally TypeScript/JavaScript.
  • Experience building, deploying, or integrating AI/LLM products with real users or customers.
  • Familiarity with LLM evaluation, LLM-as-judge workflows, data pipelines, model tuning, prompt optimization, or agentic workflows.
  • Customer-facing experience with enterprise, strategic, or high-value accounts.
  • Experience deploying software or AI systems in enterprise environments with security, privacy, reliability, compliance, or integration constraints.
  • Strong communication and discovery skills, with the ability to translate ambiguous customer needs into concrete technical solutions.
  • Ability to operate across engineering, product, sales, and customer success without requiring heavy process or handholding.
  • Willingness to work directly with customers in person when needed, including occasional travel for strategic deployments, workshops, and executive technical sessions.
Preferred Skills
  • Experience with RAG, search, voice AI, coding agents, or agentic workflow systems.
  • Experience deploying AI systems for Fortune 500 or large enterprise customers.
  • Track record owning technical pre-sales, post-sales, implementation, or customer expansion for million-dollar enterprise accounts.
  • Familiarity with LLM serving, latency optimization, model evaluation, or production ML systems.
  • Experience with data engineering, synthetic data generation, or feedback loops for model improvement.
  • Background in product engineering, ML product engineering, applied AI, or forward deployed engineering.
  • Experience working with customer-specific evals, benchmarks, and performance targets.
  • Familiarity with latency-sensitive applications, especially voice systems where response speed is critical.
A Note on the Role

This role is for builders who want to be close to customers and close to the product. We are not looking for traditional solutions engineers who only configure demos, nor researchers who primarily want to work on open-ended model experiments. The strongest candidates are full-stack engineers with enough ML fluency to work across LLM systems, evals, data, tuning, deployment, and production application development — and enough customer instinct to discover what matters, build quickly, and drive real adoption. This role is also not just about serving one-off customer requests. The best FDEs will identify repeatable patterns across deployments and turn those learnings into better product surfaces, platform capabilities, evals, playbooks, and model feedback loops.

A Note on Startup Fit

This is an in-office role at an early-stage company moving with high velocity. We're looking for engineers who are actively seeking a startup environment — comfortable with ambiguity, customer-facing work, rapid iteration, and end-to-end ownership. The team is small and high-leverage. You should be excited to work directly with enterprise customers, own ambiguous problems, and build the systems that convert customer demand into production AI deployments.

Compensation

The annual base salary range for this role is $175,000 – $275,000 USD. Final compensation is determined based on experience, skills, and qualifications. Equity and benefits are included in the total package.

Why Join Inception
  • Work with World-Class Talent: Collaborate with the inventors of diffusion models and leading AI researchers
  • Shape Foundational Technology: Your decisions will influence how the next generation of AI products are built and used
  • Immediate Impact: Join at the ground floor where your contributions directly shape product direction and company trajectory
Perks & Benefits
  • Competitive salary and equity in a rapidly growing startup
  • Flexible vacation and paid time off (PTO)
  • Health, dental, and vision insurance
  • 401k match
  • Catered meals (breakfast, lunch, & dinner)
  • Commuter subsidies
  • A collaborative and inclusive culture
About Us

Inception creates the world's fastest, most efficient AI models. Today's autoregressive LLMs generate tokens sequentially, which makes them painfully slow and expensive. Inception's diffusion-based LLMs (dLLMs) generate answers in parallel. They are 5x faster and more efficient, while delivering best-in-class quality. Inception was co-founded by Stanford professor Stefano Ermon, who co-invented such breakthrough AI technologies as diffusion models, flash attention, and DPO, UCLA professor Aditya Grover, who co-invented node2vec, decision transformers, and d1 reasoning, and Cornell professor and Afresh co-founder Volodymyr Kuleshov, who co-invented MDLM and Block Diffusion. We pioneered the application of diffusion to language, with world's first (and only) commercially available dLLM, Mercury. We are currently deploying our large-scale diffusion LLMs at Fortune 500 companies. Diffusion is the technology behind today's image and video AI, and we're making it the standard for LLMs as well. Our team includes engineers from AWS, Google DeepMind, Meta AI, Microsoft, HashiCorp, and OpenAI. Based in Palo Alto, CA, we are backed by top-tier venture capitalists, including Menlo Ventures, Mayfield, M12 (Microsoft's venture fund), Snowflake Ventures,

Vacancy posted 3 hours ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff, Forward Deployed AI Engineer in Palo Alto, CA vacancy
  • A leader in AI-driven e-commerce solutions seeks a Forward Deployed Engineer in Mountain View, CA. This role involves building prototypes with customers to address real-world problems while integrating data from various sources. Candidates should have 4-6 years of software... 
    Suggested

    CommerceIQ

    Mountain View, CA
    14 hours ago
  • $106.9k - $229.4k

     ...software: working directly with customers to design, build, and deploy AI agents that automate real workflows and decision‑making in production within weeks—not quarters. As a Forward Deployed AI Engineer, you will partner directly with customers and product managers to... 
    Suggested

    SAP SE

    Palo Alto, CA
    3 days ago
  • $106.9k - $229.4k

    SAP SE is seeking a Forward Deployed AI Engineer in Palo Alto, California. You will work directly with customers to design and deploy production-grade AI systems that enhance workflows and decision-making processes. This hands-on role requires you to own the end-to-end... 
    Suggested

    SAP SE

    Palo Alto, CA
    3 days ago
  • A leading technology company is seeking an AI Forward Deployed Engineer to design AI solutions, develop integrated systems, and communicate results effectively. Candidates should have substantial experience in AI/ML, cloud services, and problem-solving capabilities. This... 
    Suggested
    Remote job

    IBM Computing

    Santa Clara, CA
    3 days ago
  •  ...helping them shape their hybrid cloud and AI journeys. With support from our strategic...  ...Your role and responsibilities As an AI Forward Deployed Engineer, you will work with customers to understand their workflows, technical environments, and business objectives, translating... 
    Suggested
    Worldwide

    IBM Computing

    Santa Clara, CA
    1 day ago
  • $230k - $300k

     ...that cost more to train and deploy. At DatologyAI, we’ve...  ...Partners, Microsoft, Amazon, and AI visionaries like Geoff...  ...data research and data engineering necessary to solve this incredibly...  ...are looking for a highly technical, customer-obsessed Forward Deployed AI Engineer (Post... 
    Full time
    Work at office
    Relocation package

    Datology

    Redwood City, CA
    1 day ago
  • $147.7k - $215k

     ...opportunity in UCaaS, Contact Center and AI-powered adjacencies. We invest more...  ...in. We’re currently looking for: an AI Forward-Deployed Engineer This role is responsible for closing...  ...acquisition and value realization. This is a technical deployment role, not a services or... 
    Full time
    Local area
    Remote work
    Flexible hours

    RingCentral

    Belmont, CA
    3 days ago
  • $90k - $145k

    A leading Enterprise AI software company in Redwood City is seeking a Forward Deployed Engineer to provide expert product support for C3 AI applications. Candidates should have application development experience, capable of troubleshooting and customer interaction, and... 

    C3 AI

    Redwood City, CA
    1 day ago
  • $190k - $250k

     ...Employment Type Full time Location Type Hybrid Department AI We are looking for an AI Inference engineer to join our growing team. Our current stack is Python...  ...You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.... 
    Full time

    Kindredventures

    Palo Alto, CA
    2 days ago
  •  ...AI Infra Engineer We are looking for an AI Infra engineer to join our growing team. We work with Kubernetes, Slurm, Python, C++, PyTorch...  ...partnering closely with our Inference and Research teams to build, deploy, and optimize our large-scale AI training and inference... 

    Perplexity AI

    Palo Alto, CA
    2 days ago
  • $119.8k - $234.7k

     ...continues to push the boundaries of AI, we are on the lookout for...  ...seeking experienced Product Engineers to help build the Copilot web...  ...web applications Plan and deploy front end infrastructure necessary...  ...computer science, or related technical discipline AND 4+ years... 
    Ongoing contract
    Work at office
    Local area
    3 days per week

    Microsoft Corporation

    Mountain View, CA
    6 days ago
  •  ...Member Of Technical Staff Memphis, TN About XAI XAI's mission is to create AI systems that can accurately understand...  ...motivated, and focused on engineering excellence. This...  ..., develop, and deploy scalable code and services...  ...to drive forward-thinking solutions.... 

    Xai

    Palo Alto, CA
    1 day ago
  • $140k - $160k

     ...Member Of Technical Staff - Backend Software Engineer This role is based in Palo Alto, California,...  ...was written by humans or AI, and whether it's 40-year...  ...end to end, from IDE to deployment, fueling the next...  ...ensuring alignment and forward momentum. Adaptable Innovator... 
    Shift work

    Endor Labs

    Palo Alto, CA
    3 days ago
  • $119.8k - $234.7k

     ...continues to push the boundaries of AI, we are on the lookout for...  ...(MS AI) is seeking skilled engineers to advance the next...  ...and machine learning, bringing technical expertise and a methodical problem...  ...including experience using, deploying, and?scaling language learning... 
    Ongoing contract
    Work at office
    Local area
    3 days per week

    Microsoft Corporation

    Mountain View, CA
    14 hours ago
  • $135k - $200k

     ...Forward Deployed Software Engineer - Edge Autonomous Systems Title of Role: Forward Deployed Software Engineer - Edge Autonomous Systems Location...  ..., focusing on latency and resilience. Act as a technical liaison between engineering teams and end users, clarifying... 
    Work at office

    Recruiting from Scratch

    Palo Alto, CA
    1 day ago
  • $135k - $200k

     ...missing children, and more. The Role We are seeking a Forward Deployed Software Engineer to join a newly-formed team focused on developing advanced...  ...Autonomy C2 solutions into Palantir platform and with AI and autonomy software solutions such sensor and data fusion... 
    Work experience placement
    Work at office
    Remote work
    Work from home
    Relocation package

    Palantir Technologies

    Palo Alto, CA
    1 day ago
  • An innovative AI solutions company in Palo Alto is looking for an Applied AI Engineer. The role involves facilitating customer adoption of AI products by addressing complex technical challenges. You will manage daily relations, onboard clients on use products, and work... 

    Mistral AI

    Palo Alto, CA
    4 days ago
  • Uniphore Technologies North America Inc is seeking a Senior Forward Deployed Engineer in Palo Alto, CA. This role involves leading the architecture and delivery of AI solutions and requires strong technical ownership and teamwork. Ideal candidates should have a deep understanding... 

    Uniphore Technologies North America Inc

    Palo Alto, CA
    3 days ago
  • $180k

     ...xAI's mission is to create AI systems that can accurately understand...  ...motivated, and focused on engineering excellence. This organization...  ...are seeking an exceptional Forward Deployed Engineer who will work at the intersection of deep technical implementation and world-... 
    Temporary work
    Flexible hours

    xAI

    Palo Alto, CA
    22 days ago
  • $200k - $350k

     ...Member Of Technical Staff, Software Engineer Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's fastest reasoning LLM and first commercially...  ...Actions, Jenkins, BuildKite) and deployment automation. ~ Familiarity with... 
    Immediate start
    Flexible hours

    Inception LLC

    Palo Alto, CA
    3 days ago
  • $119.8k - $234.7k

     ...Overview Microsoft AI is looking for a talented Android engineer to help build the next wave of capabilities of our personalized AI assistant, Copilot...  ...~ Bachelor’s degree in computer science, or related technical discipline AND 4+ years technical engineering experience... 
    Ongoing contract
    Work at office
    Local area

    Microsoft Corporation

    Mountain View, CA
    5 days ago
  •  ...(VLM) & Visual Foundation Model (VFM) Forward Deployed Engineer in Palo Alto, CA. The role involves training and deploying vision-centric AI systems, defining performance benchmarks...  ...hold a Bachelor’s degree in a relevant technical field with proficiency in Python and experience... 
    Flexible hours

    Matroid Inc.

    Palo Alto, CA
    4 days ago
  • A leading AI-driven company in Palo Alto is seeking a Forward Deployed Engineer to bridge their advanced platform with strategic enterprise customers. In this hybrid role, you'll drive technical success from initial deployment to production optimization, involving work... 

    Nectar

    Palo Alto, CA
    14 hours ago
  • $180k

     ...xAI's mission is to create AI systems that can accurately...  ...motivated, and focused on engineering excellence. This organization...  ...reliability. As a Member of Technical Staff - Inference, you will design...  ...infrastructure for seamless endpoint deployment, image publishing, and... 
    Temporary work

    xAI

    Palo Alto, CA
    a month ago
  • $119.8k - $234.7k

     ...Overview Microsoft AI is looking for a talented Backend engineer to help build the next wave of capabilities of our personalized AI assistant, Copilot...  ...~ Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with... 
    Ongoing contract
    Work at office
    Local area
    3 days per week

    Microsoft Corporation

    Mountain View, CA
    14 hours ago
  •  ...making it uniquely positioned for AI to have an enormous impact. Stage...  ...radiology datasets. About the Role As a Member of Technical Staff in Software Engineering, you will build the full stack...  ...design and implementation through deployment and operation. What You Bring We... 

    Cognita Imaging Inc.

    Palo Alto, CA
    2 days ago
  •  ...Palo Alto is seeking a skilled professional to design and develop AI native enterprise applications. In this role, you will work...  ...ideal candidate will possess extensive experience in software engineering and a strong understanding of architectural principles. Join a... 

    SAP SE

    Palo Alto, CA
    4 days ago
  •  ...About the Role RadixArk is hiring a Member of Technical Staff - Supercomputing to help build, deploy, and operate production-grade AI infrastructure for frontier-scale...  ...This role sits at the intersection of engineering, deployment, reliability, and customer... 
    Flexible hours

    RadixArk

    Palo Alto, CA
    4 days ago
  •  ...Powered by the Illumio AI Security Graph, our...  ...Team's Vision Our Engineering team is driven by a culture...  ...that drives us forward in the ever-evolving landscape...  ...and become productive members of the team. ~...  ...their successful deployment and usage by customers... 
    Immediate start

    Illumio

    Sunnyvale, CA
    1 day ago
  • $200k - $350k

     ...Member Of Technical Staff, Reinforcement Learning Inception creates the world...  ...'s fastest, most efficient AI models. Our Mercury model is...  ...are the AI researchers and engineers behind such breakthrough AI...  ...Mercury. We are currently deploying our large-scale diffusion LLMs... 
    Immediate start
    Flexible hours

    Inception LLC

    Palo Alto, CA
    4 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff, Forward Deployed AI Engineer. Be the first to apply!