Member of Technical Staff, Reinforcement Learning
Inception LLC
The Role We seek experienced scientists and engineers with deep expertise in post-training large language models through reinforcement learning. You will design and implement RL training pipelines for our diffusion LLMs, develop reward modeling strategies, and build the algorithms that align model behavior with human intent at scale. Key Responsibilities
- Design, develop, and optimize RL training pipelines (PPO, DPO, RLHF, and novel approaches) for diffusion-based LLMs.
- Build and iterate on reward models, reward shaping strategies, and evaluation of reward quality.
- Implement innovative approaches for fine-tuning and scaling generative AI models.
- Work on data preprocessing pipelines, model evaluation, and alignment to enterprise use cases.
- Research and implement techniques for controlled text generation and constraint satisfaction.
- Improve training stability, efficiency, and reproducibility of RL workloads.
- BS/MS/PhD in Computer Science or a related field (or equivalent experience).
- At least 2 years of experience working on ML projects in PyTorch (or equivalent), preferably in a research lab or engineering role.
- Excellent familiarity with transformers and core LLM concepts (autoregressive pretraining, instruction tuning, in-context learning, KV caching).
- Hands-on experience with reinforcement learning from human feedback (RLHF), PPO, DPO, or related post-training methods.
- Familiarity with training and inference in diffusion models.
- Experience training deep learning models at scale in distributed computing environments.
- Extensive experience training transformer-based language models from scratch.
- Experience designing and implementing reward models or preference learning systems.
- Knowledge of advanced training techniques (mixed precision, gradient accumulation, etc.).
- Background in optimization theory and neural network architecture design.
- Experience with LLM serving frameworks like vLLM, SGLang, or TensorRT.
- Work with World-Class Talent : Collaborate with the inventors of diffusion models and leading AI researchers
- Shape Foundational Technology : Your decisions will influence how the next generation of AI products are built and used
- Immediate Impact : Join at the ground floor where your contributions directly shape product direction and company trajectory
- Competitive salary and equity in a rapidly growing startup
- Flexible vacation and paid time off (PTO)
- Health, dental, and vision insurance
- Catered meals (breakfast, lunch, & dinner)
- Commuter subsidies
- A collaborative and inclusive culture
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff, Reinforcement Learning in San Mateo, CA vacancy
- ...Job Title What You'll Do Develop and optimize a learning-based robotic manipulation control stack Design and maintain... ...Train robotic policies for manipulation and locomotion with reinforcement learning and imitation learning Deploy robotic policies and...Suggested
- ...role. Excellent familiarity with transformers and core LLM concepts (autoregressive pretraining, instruction tuning, in-context learning, KV caching). Familiarity with training and inference in diffusion models. Experience training deep learning models at scale...SuggestedImmediate startFlexible hours
$175k - $220k
...Member of Technical Staff, Software Engineer San Mateo, CA About Us At Fireworks, we're building the future of generative AI infrastructure... ...shapes the future of AI—no bureaucracy, just results. Learn from the Best: Collaborate with world-class engineers and...Suggested- Introducing Moonlake, AI for creating real-time interactive content Mission : As an applied AI Research Engineer: Code agents (post training + systems) Scope of Work : - Agentic systems design: Tool catalogs, function calling, program synthesis/repair loops, ...Suggested
- Job Title Develop a high-throughput, GPU-based simulation pipeline (primarily rigid body simulation for robots) to train robotics foundation models Implement essential robotics features, including actuators, sensors, and controllers, in collaboration with the robotics...Suggested
- Job Title What You'll Do Build low-latency inference pipelines for on-device deployment, enabling real-time next-token and diffusion-based control loops in robotics Design and optimize distributed inference systems on GPU clusters, pushing throughput with large...
- Security Infrastructure Engineer What You'll Do Design, build, and scale security infrastructure from the ground up across our systems, networks, endpoints, and products Own and evolve security architecture across endpoint security, network security, application...Interim role
- What You’ll Do Design, build, and maintain large-scale data pipelines (batch and streaming) for robotics foundation model training and evaluation at petabyte scale Own core data infrastructure: data model, storage systems, ingestion pipelines, transformation frameworks...Remote work
- What You’ll Do Drive down wall-clock time to convergence by profiling and eliminating bottlenecks across the foundation model training stack stack, from data pipelines to GPU kernels Design, build, and optimize distributed training systems (PyTorch) for multi-node GPU ...Remote work
- The Role We're looking for engineers and scientists to design, optimize, and scale the systems that power our diffusion LLMs in production. Your work will make inference faster, more cost-effective, and more reliable. Key Responsibilities Build and optimize ...Immediate startFlexible hours
- What You'll Do Develop a high-throughput rendering pipeline for training robotics foundation models Design protocols and interfaces between the rendering pipeline, physics engine, and 3D generative models Build an efficient platform for large-scale robotics training and...Remote work
- ...paradigm of physical data synthesis— combining simulation, generative models, and autonomous agents Deep curiosity and strong technical ownership, with a track record of driving complex, open-ended projects from concept to implementation Experience with (multimodal...
$83.09 - $103.42 per hour
...Responsibilities # Implement the Learning Disabilities Program, consistent with the... ...materials. # Train and direct the work of staff, including tutors who work with students... ...supported by regular training for all members of selection and screening committees. We...Hourly payPart timeWork experience placementH1bWork at officeLocal areaImmediate startRemote work$35k - $50k
...and self-reflective practitioners who are open to teaching and learning using new, innovative methods. Our professionals create safe, supportive... ...value a cooperative learning environment with all community members. The candidate will actively seek to build relationships with...Full timePart time- ...their business. Bring your curiosity for learning, bold ideas, courage and passion to... ...ambiguous business needs into concrete technical solutions that redefine what's possible.... ...guidance and mentorship to Associate team members. What You'll Bring A PhD Degree...Work experience placementWork at officeLocal areaWork from homeWorldwideFlexible hours
- ...inspired to do their best work. As a Technical Specialist, you offer technical support and... ...differences and having the curiosity to learn. Demonstrate Apple’s values of inclusion... ...and accountability with other team members. Be trusted with sensitive or confidential...Local areaRelocationNight shift
- ...The Role We're hiring a hands-on Staff Security Engineer to build the security foundation for a frontier AI platform serving... ..., privacy, compliance, and infrastructure risk as we scale - a technical leader, not a friction point for the engineering team. What...Immediate startFlexible hours
- ...supply chain environments is required. Technical Skills: Experience with SAP and/or PLM... ..., collaboration, and continuous learning. We offer quality career resources, training... ...' as well, which an Everforth Apex team member can provide. Everforth Apex Systems is...Contract work
- ...working within Tier 1, 2, and 3 support models; able to quickly learn to use new systems Experience working for pharmaceutical / Biotech... ...skills to build and maintain productive relationships with team members Provide constructive feedback during code reviews and be open to...
- ...operational efficiencies Mentor team members in AI/ML business analysis and product development... ..., and foster a culture of continuous learning What This Role Is Not Not a... ...or execution management Not a purely technical ML, data science or data analyst role...
$75k - $115k
...Stanbridge Academy is seeking a skilled and collaborative Education Specialist to support students with mild to moderate learning differences, including specific learning disabilities, ADHD, executive functioning challenges, and related needs. This role is ideal for an...- ...Network Security Job Summary The Technical Support Engineerindependently resolves complex... .... Shares insights, lessons learned, and configuration guidance across the team... ...business and what we look for in every team member: Trust is paramount. We deliver...Full timeRemote work
$123k - $190.9k
...engineering roadmaps , user requirements, and technical specifications that enable breakthrough... ...practices, and contributing to internal learning programs to elevate Visa's AI maturity... ...and solution strategies to team members that improve the design and functionality...Work experience placementWork at officeLocal area$178.1k - $267.1k
...excellence and creativity. Staff Business Analyst, PlayStation... ...recommendations * Mentor team members in business analysis, AI / ML... ..., and encourage continuous learning across teams What This Role... ...management role * Not a purely technical ML, data science or data...Remote work$138k - $189.2k
...all disease to transforming how students learn. At CZI, you'll join a deeply collaborative... ...Support Specialist to act as the senior technical lead for end-user support across our... ...guidance and mentorship to support team members Support High Impact User Workflows (...Work at officeLocal areaRemote workRelocation package- ...Requirements This position involves working with a team of consultants and following the direction and guidance of a senior staff member and/or technical lead. Your customer service and people skills are paramount, but you will need to be comfortable working with...Full timePart timeCurrently hiringWork at officeFlexible hours
- ...Help Desk Technician serves as an advanced technical support resource and escalation point... ...standardization initiatives while mentoring Tier 1 staff and helping scale IT operations in a fast... ...Amenities (In-Office Only) Want to learn more about what we are up to? Meet the...Full timeTemporary workWork at officeRemote workWorldwideMonday to FridayFlexible hours
$128.45k - $183.5k
...support teams Be the techno-functional person who can provide technical solutions to business problems, by understanding the functional... ...a proactive, can-do attitude with a strong willingness to learn. ~ Great organizational skills and attention to detail; able...Full timeWork at officeLocal areaFlexible hours$41 - $47 per hour
...performance tracking. You'll work closely with cross-functional teams and learn how key performance indicators (KPIs), dashboards, and reporting... ...Present findings in a clear and organized way to team members and stakeholders What Would Make You a Good Fit: Currently...Hourly payFull timeInternshipLocal area$160k - $240k
...The Senior Business Systems Analyst will be a key member of Snorkel's Revenue Operations team. In this... ...ongoing success. Whether you're looking to deepen your technical expertise, explore leadership opportunities, or learn new skills across multiple functions, you're...Local areaRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Member of Technical Staff, Reinforcement Learning. Be the first to apply!
Related searches
- technical support assistant San Mateo, CA
- technical analyst San Mateo, CA
- IT assistant San Mateo, CA
- help desk assistant San Mateo, CA
- IT support technician San Mateo, CA
- desktop support analyst San Mateo, CA
- support analyst San Mateo, CA
- technical associate San Mateo, CA
- support technician San Mateo, CA
- work from home technical support specialist San Mateo, CA

