Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff, Training Infra

$200k - $350k

Inception LLC

Member Of Technical Staff, Training Infra Bay Area Ai Systems

Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world's fastest reasoning LLM and first commercially available diffusion LLM, delivering 5x greater speed and efficiency than today's LLMs, with best-in-class quality. We are the AI researchers and engineers behind such breakthrough AI technologies as diffusion models, flash attention, and DPO.

The Role

We're looking for engineers and scientists to design, optimize, and maintain the core systems that enable scalable, efficient training of LLM. Your goal is to make experimentation and training at Inception fast and reliable so our team can focus on science, not system bottlenecks.

Key Responsibilities
  • Design, implement, and optimize distributed training systems that scale across thousands of GPUs and nodes.
  • Develop high-performance optimizations to maximize throughput and efficiency.
  • Develop reusable frameworks and libraries to improve training reproducibility, reliability, and scalability for new model architectures.
Qualifications
  • BS/MS/PhD in Computer Science, Engineering, or a related field (or equivalent experience).
  • Understanding of ML frameworks (PyTorch, TensorFlow) from a systems perspective.
  • Strong engineering skills — ability to contribute performant, maintainable code and debug in complex codebases.
  • Proficiency in Python and at least one systems programming language (C++/Rust/Go).
  • Experience with containerization (Docker), orchestration (Kubernetes), and CI/CD pipelines.
Preferred Skills
  • Experience building and maintaining large-scale language models with tens of billions of parameters or more.
  • Experience with ML workflow orchestration tools (Kubeflow, Airflow).
  • Background in performance optimization and profiling of ML systems (Prometheus, Grafana, OpenTelemetry).
  • Familiarity with distributed frameworks such as PyTorch/XLA, DeepSpeed, Megatron-LM.
Compensation

The annual base salary range for this role is $200,000 – $350,000 USD. Final compensation is determined based on experience, skills, and qualifications. Equity and benefits are included in the total package.

Why Join Inception
  • Work with World-Class Talent : Collaborate with the inventors of diffusion models and leading AI researchers
  • Shape Foundational Technology : Your decisions will influence how the next generation of AI products are built and used
  • Immediate Impact : Join at the ground floor where your contributions directly shape product direction and company trajectory
Perks & Benefits
  • Competitive salary and equity in a rapidly growing startup
  • Flexible vacation and paid time off (PTO)
  • Health, dental, and vision insurance
  • 401k match
  • Catered meals (breakfast, lunch, & dinner)
  • Commuter subsidies
  • A collaborative and inclusive culture
About Us

Inception creates the world's fastest, most efficient AI models. Today's autoregressive LLMs generate tokens sequentially, which makes them painfully slow and expensive. Inception's diffusion-based LLMs (dLLMs) generate answers in parallel. They are 5x faster and more efficient, while delivering best-in-class quality. Inception was co-founded by Stanford professor Stefano Ermon, who co-invented such breakthrough AI technologies as diffusion models, flash attention, and DPO, UCLA professor Aditya Grover, who co-invented node2vec, decision transformers, and d1 reasoning, and Cornell professor and Afresh co-founder Volodymyr Kuleshov, who co-invented MDLM and Block Diffusion. We pioneered the application of diffusion to language, with world's first (and only) commercially available dLLM, Mercury. We are currently deploying our large-scale diffusion LLMs at Fortune 500 companies. Diffusion is the technology behind today's image and video AI, and we're making it the standard for LLMs as well. Our team includes engineers from AWS, Google DeepMind, Meta AI, Microsoft, HashiCorp, and OpenAI. Based in Palo Alto, CA, we are backed by top-tier venture capitalists, including Menlo Ventures, Mayfield, M12 (Microsoft's venture fund), Snowflake Ventures, Databricks, and Innovation Endeavors, and by tech luminaries such as Andrew Ng, Andrej Karpathy, and Eric Schmidt.

If you are talented, innovative, and ambitious, come help us invent the future of AI. We are an equal opportunity employer and encourage candidates of all backgrounds to apply.

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff, Training Infra in San Francisco, CA vacancy
  •  ...come from DeepMind, OpenAI, Google Brain, Meta, Character.AI, Anthropic and beyond. About the Role Build and scale distributed training systems that power frontier model pre-training. Work closely with research teams to design and operate large-scale training runs... 
    Technical training
    Full time
    Relocation package

    B Capital

    San Francisco, CA
    4 days ago
  • Member of Technical Staff - Post‑Training Join to apply for the Member of Technical Staff - Post‑Training role at Reflection AI . Our Mission Reflection...  ...execution. Able to work fluidly across research and infra boundaries. Strong communication capabilities and comfort... 
    Technical training
    Full time
    Relocation package

    Reflection AI

    San Francisco, CA
    1 day ago
  •  ...and beyond. About the Role Design, build, and operate large-scale GPU infrastructure for high-throughput model inference and mid-training workloads. Develop systems that power synthetic data generation and reinforcement learning pipelines at scale. Build high-... 
    Technical training
    Relocation package

    Reflection

    San Francisco, CA
    4 days ago
  • $150k - $300k

     ...infrastructure that lets anyone create, train, and deploy them. We aggregate...  ...runs the jobs. Core Technical Responsibilities Hosted...  ...new models, training methods, infra patterns - and the ability to...  ...and encourage team members to contribute to the broader AI... 
    Technical training
    Work at office
    Local area
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime Intellect

    San Francisco, CA
    3 days ago
  • $180k

     ...Member Of Technical Staff - Pre-Training Palo Alto, CA About XAI XAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence... 
    Technical training
    Temporary work

    Xai

    San Francisco, CA
    3 days ago
  • Member of Technical Staff, Infrastructure and Training Systems Location: SF Bay Area or Tokyo, Japan Type: Full-time About Radical Numerics Radical Numerics is an AI lab bringing the rigor of distributed systems, model architecture, and numerics research to the challenges... 
    Technical training
    Full time

    Radical Numerics

    San Francisco, CA
    4 days ago
  • Member of Technical Staff, Post-Training Location: SF Bay Area or Tokyo, Japan Type: Full-time About Radical Numerics Radical Numerics is an AI lab bringing the rigor of distributed systems, model architecture, and numerics research to the challenges of biology. We are... 
    Technical training
    Full time

    Radical Numerics

    San Francisco, CA
    4 days ago
  • Member of Technical Staff, Pretraining Science Member of Technical Staff, Pre-Training Science Location: SF Bay Area or Tokyo, Japan Type: Full-time About Radical Numerics Radical Numerics is an AI lab bringing the rigor of distributed systems, model architecture, and... 
    Technical training
    Full time

    Radical Numerics

    San Francisco, CA
    1 day ago
  •  ...mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises...  ...New York but also embrace being remote-friendly! As a Member of Technical Staff, you will: Design and write high-performant and scalable... 
    Technical training
    Full time
    Work at office
    Remote work
    Flexible hours

    Jaide Health

    San Francisco, CA
    3 days ago
  • Pixeltable, Inc. is seeking a Member of Technical Staff based in San Francisco, CA. As a founding member of our engineering team, you will directly influence the design and development of a revolutionary AI data platform. With over 5 years of experience in systems engineering... 
    Flexible hours

    Pixeltable, Inc.

    San Francisco, CA
    4 days ago
  • $225k

     ...code generation to improve models and solve alignment more reliably than humans can alone. Our approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal. About the Role As a Software... 
    Technical training
    Relocation
    Visa sponsorship

    Magic Inc

    San Francisco, CA
    3 days ago
  •  ...challenging problem You are obsessed about building incredibly useful models You are a power user of AI models If you previously trained models used by millions of people it’s a big plus, but modeling experience is not required You take pride in your work and thrive... 
    Technical training

    xAI

    San Francisco, CA
    1 day ago
  • We are looking for strong engineers with experience training production machine learning models. If you are interested in contributing to open-source projects and evolving Modal's infrastructure to train the next generation of language models, we'd love to hear from you... 
    Technical training
    Work at office

    Modal Labs

    San Francisco, CA
    4 days ago
  •  ...deep learning literature. Lead small research projects independently while collaborating on larger initiatives Optimize the training infrastructure for efficient scaling. Contribute across the entire stack, from low-level optimizations to high-level model design... 
    Technical training
    Relocation package

    Reflection AI, Inc

    San Francisco, CA
    2 days ago
  •  ...models and real-world deployment. You'll own the applied post-training work that adapts LFM2.5-Audio for customer use cases end-to-end...  ...as design parameters, not blockers. The Work Act as the technical owner for enterprise audio post-training engagements. Translate... 
    Technical training

    Liquid AI

    San Francisco, CA
    1 day ago
  •  ...language models and real-world deployment. You'll own applied post-training work for VLMs end-to-end for some of the world's largest...  ...Communicates clearly: Can translate between customer needs and internal technical teams, and push back when needed. The Work Act as the... 
    Technical training

    Liquid AI

    San Francisco, CA
    1 day ago
  •  ...Job Title Responsibilities Design, deploy, and maintain large distributed ML training and inference clusters Develop efficient, scalable end-to-end pipelines to manage petabyte-scale datasets and model training throughout the entire ML lifecycle Research... 

    Causal Labs

    San Francisco, CA
    3 days ago
  •  ...narrative around document understanding. About the Role The Infra team at LlamaIndex owns the foundations that our product is...  ...volume data platform for AI applications. We are looking for team members who love building enabling systems that empower our engineers... 
    Work at office

    LlamaIndex

    San Francisco, CA
    1 day ago
  •  ...enterprise customers Own the technical relationship with enterprise...  ...intercept ~ Shipped real infra at a startup and lived with the...  ...As a founding member, you'll help define the technical...  ...researchers and engineers who have trained frontier-scale models and widely... 
    Visa sponsorship
    Flexible hours

    NeoSigma

    San Francisco, CA
    1 day ago
  • $170k - $220k

    Member of Technical Staff - Infrastructure & LLMs Location: San Francisco, CA (Hybrid) Compensation: $170,000 - $220,000 base + 1-3% equity Work...  ...Role is Unique Massive Technical Ownership: Drive core infra design with zero red tape. Frontier Engineering: Work on distributed... 
    Full time
    Temporary work
    Immediate start
    Visa sponsorship
    Work visa

    Amadeus Search

    San Francisco, CA
    2 days ago
  •  ...code and architecture reviews for other members of the team Help establish engineering best...  ...meets their needs Requirements Technical 2+ YOE: 2+ years of full-time experience...  ..., we encourage you to apply to our MTS - Infra role. Strong engineering fundamentals :... 
    Full time
    Work experience placement
    Relocation
    Relocation package
    Shift work

    PetsApp

    San Francisco, CA
    3 days ago
  • $150k - $300k

     ...from frontier agentic models to the infra that enables anyone to create, train, and deploy them. We aggregate and...  ...into our RL training stack. Core Technical Responsibilities LLM Serving Multi...  ...development and encourage team members to contribute to the broader AI community... 
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours
    Shift work

    Prime-Intellect

    San Francisco, CA
    5 days ago
  • $150k - $300k

     ...from frontier agentic models to the infra that enables anyone to create, train, and deploy them. We aggregate and...  ..., and reliable at scale. Core Technical Responsibilities Infrastructure Development...  ...development and encourage team members to contribute to the broader AI... 
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours

    Prime Intellect, Inc.

    San Francisco, CA
    2 days ago
  • $176k - $253k

    Senior Member of Technical Staff, Developer Experience Harper is an AI-native commercial insurance company in San Francisco. We're not bolting AI...  ...building software, with developer-platform / CI-CD / build-infra / dev-environment experience that other engineers actually... 
    Work at office
    Local area
    Immediate start
    Remote work
    Relocation
    Weekend work

    Harper Group

    San Francisco, CA
    3 days ago
  • $187k - $264k

     ...Senior Member of Technical Staff, Harness Engineering Harper is an AI-native commercial insurance company in San Francisco. We're not bolting...  ..., multi-step orchestration. Ship execution-environment infra. Sandbox lifecycle, isolation, blast-radius limits, filesystem... 
    Work at office
    Relocation

    Harper Group

    San Francisco, CA
    1 day ago
  •  ...future of AI. About the role Gimlet Labs is seeking an Member of Staff focused on AI Research (Intern). As an AI Researcher (Intern)...  ..., you will have significant ownership, work alongside highly technical engineers, and help shape both the systems we build and how... 
    Internship

    Gimlet Labs

    San Francisco, CA
    1 day ago
  •  ...massive scale and help define the infrastructure layer for the future of AI. About the role Gimlet Labs is seeking a Member of Technical Staff (Intern) to help develop Gimlet's platform for deploying and monitoring AI workloads. In this role, you will be applying... 
    Internship

    Gimlet Labs

    San Francisco, CA
    2 days ago
  • $180k

     ...analysis, empowering over 5,000 scientists across 150+ R&D labs to handle data from instrument-to-insights. We're seeking a Member of Technical Staff for Therapeutics to lead our therapeutics bench, pushing its capabilities to the frontier of what artificial intelligence... 
    Full time
    Work at office

    LatchBio

    San Francisco, CA
    4 days ago
  •  ...Member Of Technical Staff – Applied AI, Fullstack Stuut is transforming accounts receivable for B2B companies—making collections smarter and faster for companies that have historically relied on manual processes that are labor intensive and costly. Our platform is gaining... 
    Full time
    Flexible hours

    Stuut

    San Francisco, CA
    1 day ago
  •  ...Member of Technical Staff, Autonomous AI Systems Mirendil Mirendil is a tech-first company focused on solving core bottlenecks that...  ...possible. We are building a frontier AI research company and training our own models end-to-end. Our work spans areas such as... 

    Mirendil

    San Francisco, CA
    35 minutes ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff, Training Infra. Be the first to apply!