Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff, Reinforcement Learning

Inception LLC

The Role

We seek experienced scientists and engineers with deep expertise in post-training large language models through reinforcement learning. You will design and implement RL training pipelines for our diffusion LLMs, develop reward modeling strategies, and build the algorithms that align model behavior with human intent at scale.

Key Responsibilities

  • Design, develop, and optimize RL training pipelines (PPO, DPO, RLHF, and novel approaches) for diffusion-based LLMs.
  • Build and iterate on reward models, reward shaping strategies, and evaluation of reward quality.
  • Implement innovative approaches for fine-tuning and scaling generative AI models.
  • Work on data preprocessing pipelines, model evaluation, and alignment to enterprise use cases.
  • Research and implement techniques for controlled text generation and constraint satisfaction.
  • Improve training stability, efficiency, and reproducibility of RL workloads.
Qualifications
  • BS/MS/PhD in Computer Science or a related field (or equivalent experience).
  • At least 2 years of experience working on ML projects in PyTorch (or equivalent), preferably in a research lab or engineering role.
  • Excellent familiarity with transformers and core LLM concepts (autoregressive pretraining, instruction tuning, in-context learning, KV caching).
  • Hands-on experience with reinforcement learning from human feedback (RLHF), PPO, DPO, or related post-training methods.
  • Familiarity with training and inference in diffusion models.
  • Experience training deep learning models at scale in distributed computing environments.
Preferred Skills
  • Extensive experience training transformer-based language models from scratch.
  • Experience designing and implementing reward models or preference learning systems.
  • Knowledge of advanced training techniques (mixed precision, gradient accumulation, etc.).
  • Background in optimization theory and neural network architecture design.
  • Experience with LLM serving frameworks like vLLM, SGLang, or TensorRT.
Why Join Inception
  • Work with World-Class Talent : Collaborate with the inventors of diffusion models and leading AI researchers
  • Shape Foundational Technology : Your decisions will influence how the next generation of AI products are built and used
  • Immediate Impact : Join at the ground floor where your contributions directly shape product direction and company trajectory
Perks & Benefits
  • Competitive salary and equity in a rapidly growing startup
  • Flexible vacation and paid time off (PTO)
  • Health, dental, and vision insurance
  • Catered meals (breakfast, lunch, & dinner)
  • Commuter subsidies
  • A collaborative and inclusive culture

About Us

Inception creates the world's fastest, most efficient AI models. Today's autoregressive LLMs generate tokens sequentially, which makes them painfully slow and expensive. Inception's diffusion-based LLMs (dLLMs) generate answers in parallel. They are 5x faster and more efficient, while delivering best-in-class quality.

Inception was co-founded by Stanford professor Stefano Ermon, who co-invented such breakthrough AI technologies as diffusion models, flash attention, and DPO, UCLA professor Aditya Grover, who co-invented node2vec, decision transformers, and d1 reasoning, and Cornell professor and Afresh co-founder Volodymyr Kuleshov, who co-invented MDLM and Block Diffusion.

We pioneered the application of diffusion to language, with world's first (and only) commercially available dLLM, Mercury. We are currently deploying our large-scale diffusion LLMs at Fortune 500 companies. Diffusion is the technology behind today's image and video AI, and we're making it the standard for LLMs as well.

Our team includes engineers from AWS, Google DeepMind, Meta AI, Microsoft, HashiCorp, and OpenAI. Based in Palo Alto, CA, we are backed by top-tier venture capitalists, including Menlo Ventures, Mayfield, M12 (Microsoft's venture fund), Snowflake Ventures, Databricks, and Innovation Endeavors, and by tech luminaries such as Andrew Ng, Andrej Karpathy, and Eric Schmidt.

If you are talented, innovative, and ambitious, come help us invent the future of AI.

We are an equal opportunity employer and encourage candidates of all backgrounds to apply.
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff, Reinforcement Learning in San Mateo, CA vacancy
  •  ...Job Title What You'll Do Develop and optimize a learning-based robotic manipulation control stack Design and maintain...  ...Train robotic policies for manipulation and locomotion with reinforcement learning and imitation learning Deploy robotic policies and... 
    Suggested

    GenesisAI

    San Carlos, CA
    1 hour ago
  •  ...role. Excellent familiarity with transformers and core LLM concepts (autoregressive pretraining, instruction tuning, in-context learning, KV caching). Familiarity with training and inference in diffusion models. Experience training deep learning models at scale... 
    Suggested
    Immediate start
    Flexible hours

    Inception LLC

    San Mateo, CA
    1 day ago
  • $175k - $220k

     ...Member of Technical Staff, Software Engineer San Mateo, CA About Us At Fireworks, we're building the future of generative AI infrastructure...  ...shapes the future of AI—no bureaucracy, just results. Learn from the Best: Collaborate with world-class engineers and... 
    Suggested

    Fireworks AI

    San Mateo, CA
    3 days ago
  • Introducing Moonlake, AI for creating real-time interactive content Mission : As an applied AI Research Engineer: Code agents (post training + systems) Scope of Work : - Agentic systems design: Tool catalogs, function calling, program synthesis/repair loops, ...
    Suggested

    Embedding VC

    San Mateo, CA
    3 days ago
  • Job Title Develop a high-throughput, GPU-based simulation pipeline (primarily rigid body simulation for robots) to train robotics foundation models Implement essential robotics features, including actuators, sensors, and controllers, in collaboration with the robotics...
    Suggested

    GenesisAI

    San Carlos, CA
    4 days ago
  • Job Title What You'll Do Build low-latency inference pipelines for on-device deployment, enabling real-time next-token and diffusion-based control loops in robotics Design and optimize distributed inference systems on GPU clusters, pushing throughput with large...

    GenesisAI

    San Carlos, CA
    4 days ago
  • Security Infrastructure Engineer What You'll Do Design, build, and scale security infrastructure from the ground up across our systems, networks, endpoints, and products Own and evolve security architecture across endpoint security, network security, application...
    Interim role

    GenesisAI

    San Carlos, CA
    3 days ago
  • What You’ll Do Design, build, and maintain large-scale data pipelines (batch and streaming) for robotics foundation model training and evaluation at petabyte scale Own core data infrastructure: data model, storage systems, ingestion pipelines, transformation frameworks...
    Remote work

    AI Chopping Block, Inc.

    San Carlos, CA
    2 days ago
  • What You’ll Do Drive down wall-clock time to convergence by profiling and eliminating bottlenecks across the foundation model training stack stack, from data pipelines to GPU kernels Design, build, and optimize distributed training systems (PyTorch) for multi-node GPU ...
    Remote work

    AI Chopping Block, Inc.

    San Carlos, CA
    1 day ago
  • The Role We're looking for engineers and scientists to design, optimize, and scale the systems that power our diffusion LLMs in production. Your work will make inference faster, more cost-effective, and more reliable. Key Responsibilities Build and optimize ...
    Immediate start
    Flexible hours

    Inception LLC

    San Mateo, CA
    3 days ago
  • What You'll Do Develop a high-throughput rendering pipeline for training robotics foundation models Design protocols and interfaces between the rendering pipeline, physics engine, and 3D generative models Build an efficient platform for large-scale robotics training and...
    Remote work

    AI Chopping Block, Inc.

    San Carlos, CA
    1 day ago
  •  ...paradigm of physical data synthesis— combining simulation, generative models, and autonomous agents Deep curiosity and strong technical ownership, with a track record of driving complex, open-ended projects from concept to implementation Experience with (multimodal... 

    GenesisAI

    San Carlos, CA
    4 days ago
  • $83.09 - $103.42 per hour

     ...Responsibilities # Implement the Learning Disabilities Program, consistent with the...  ...materials. # Train and direct the work of staff, including tutors who work with students...  ...supported by regular training for all members of selection and screening committees. We... 
    Hourly pay
    Part time
    Work experience placement
    H1b
    Work at office
    Local area
    Immediate start
    Remote work

    San Mateo County Community College District

    San Mateo, CA
    4 days ago
  • $35k - $50k

     ...and self-reflective practitioners who are open to teaching and learning using new, innovative methods. Our professionals create safe, supportive...  ...value a cooperative learning environment with all community members. The candidate will actively seek to build relationships with... 
    Full time
    Part time

    Ronald C. Wornick Jewish Day School

    Foster, CA
    19 hours ago
  •  ...their business. Bring your curiosity for learning, bold ideas, courage and passion to...  ...ambiguous business needs into concrete technical solutions that redefine what's possible....  ...guidance and mentorship to Associate team members. What You'll Bring A PhD Degree... 
    Work experience placement
    Work at office
    Local area
    Work from home
    Worldwide
    Flexible hours

    ZS Associates

    South San Francisco, CA
    19 hours ago
  •  ...inspired to do their best work. As a Technical Specialist, you offer technical support and...  ...differences and having the curiosity to learn. Demonstrate Apple’s values of inclusion...  ...and accountability with other team members. Be trusted with sensitive or confidential... 
    Local area
    Relocation
    Night shift

    Apple

    San Mateo, CA
    4 days ago
  •  ...The Role We're hiring a hands-on Staff Security Engineer to build the security foundation for a frontier AI platform serving...  ..., privacy, compliance, and infrastructure risk as we scale - a technical leader, not a friction point for the engineering team. What... 
    Immediate start
    Flexible hours

    Inception LLC

    San Mateo, CA
    2 days ago
  •  ...supply chain environments is required. Technical Skills: Experience with SAP and/or PLM...  ..., collaboration, and continuous learning. We offer quality career resources, training...  ...' as well, which an Everforth Apex team member can provide. Everforth Apex Systems is... 
    Contract work

    Apex Systems

    Foster, CA
    2 days ago
  •  ...working within Tier 1, 2, and 3 support models; able to quickly learn to use new systems Experience working for pharmaceutical / Biotech...  ...skills to build and maintain productive relationships with team members Provide constructive feedback during code reviews and be open to... 

    Vidorra LLC

    Foster, CA
    1 day ago
  •  ...operational efficiencies Mentor team members in AI/ML business analysis and product development...  ..., and foster a culture of continuous learning What This Role Is Not Not a...  ...or execution management Not a purely technical ML, data science or data analyst role... 

    Insight Global

    San Mateo, CA
    2 days ago
  • $75k - $115k

     ...Stanbridge Academy is seeking a skilled and collaborative Education Specialist to support students with mild to moderate learning differences, including specific learning disabilities, ADHD, executive functioning challenges, and related needs. This role is ideal for an... 

    Stanbridge Academy

    San Mateo, CA
    3 hours ago
  •  ...Network Security Job Summary The Technical Support Engineerindependently resolves complex...  .... Shares insights, lessons learned, and configuration guidance across the team...  ...business and what we look for in every team member: Trust is paramount. We deliver... 
    Full time
    Remote work

    Keyfactor

    San Mateo, CA
    25 days ago
  • $123k - $190.9k

     ...engineering roadmaps , user requirements, and technical specifications that enable breakthrough...  ...practices, and contributing to internal learning programs to elevate Visa's AI maturity...  ...and solution strategies to team members that improve the design and functionality... 
    Work experience placement
    Work at office
    Local area

    Visa

    San Mateo, CA
    3 hours ago
  • $178.1k - $267.1k

     ...excellence and creativity. Staff Business Analyst, PlayStation...  ...recommendations * Mentor team members in business analysis, AI / ML...  ..., and encourage continuous learning across teams What This Role...  ...management role * Not a purely technical ML, data science or data... 
    Remote work

    Sony Playstation Network

    San Mateo, CA
    2 days ago
  • $138k - $189.2k

     ...all disease to transforming how students learn. At CZI, you'll join a deeply collaborative...  ...Support Specialist to act as the senior technical lead for end-user support across our...  ...guidance and mentorship to support team members Support High Impact User Workflows (... 
    Work at office
    Local area
    Remote work
    Relocation package

    Chan Zuckerberg Initiative

    Redwood City, CA
    4 days ago
  •  ...Requirements This position involves working with a team of consultants and following the direction and guidance of a senior staff member and/or technical lead. Your customer service and people skills are paramount, but you will need to be comfortable working with... 
    Full time
    Part time
    Currently hiring
    Work at office
    Flexible hours

    Clarke Consulting LLC

    San Mateo, CA
    19 hours ago
  •  ...Help Desk Technician serves as an advanced technical support resource and escalation point...  ...standardization initiatives while mentoring Tier 1 staff and helping scale IT operations in a fast...  ...Amenities (In-Office Only) Want to learn more about what we are up to? Meet the... 
    Full time
    Temporary work
    Work at office
    Remote work
    Worldwide
    Monday to Friday
    Flexible hours

    Replit

    Foster, CA
    2 days ago
  • $128.45k - $183.5k

     ...support teams Be the techno-functional person who can provide technical solutions to business problems, by understanding the functional...  ...a proactive, can-do attitude with a strong willingness to learn. ~ Great organizational skills and attention to detail; able... 
    Full time
    Work at office
    Local area
    Flexible hours

    RingCentral

    Belmont, CA
    7 days ago
  • $41 - $47 per hour

     ...performance tracking. You'll work closely with cross-functional teams and learn how key performance indicators (KPIs), dashboards, and reporting...  ...Present findings in a clear and organized way to team members and stakeholders What Would Make You a Good Fit: Currently... 
    Hourly pay
    Full time
    Internship
    Local area

    Skydio

    San Mateo, CA
    19 hours ago
  • $160k - $240k

     ...The Senior Business Systems Analyst will be a key member of Snorkel's Revenue Operations team. In this...  ...ongoing success. Whether you're looking to deepen your technical expertise, explore leadership opportunities, or learn new skills across multiple functions, you're... 
    Local area
    Remote work

    Snorkel AI

    Redwood City, CA
    19 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff, Reinforcement Learning. Be the first to apply!