Research Engineer: RL & Reasoning for Next-Gen LMs

Zyphra

A cutting-edge AI company based in San Francisco is seeking a Research Engineer specializing in Agency and Reasoning. The role focuses on performing research in reinforcement learning and applying innovative ideas to the next generation of their language models. Candidates should have a postgraduate degree in a scientific field and be proficient in PyTorch and Python. The company values creativity and provides a dynamic work environment with excellent benefits, including comprehensive health plans and unlimited PTO. #J-18808-Ljbffr Zyphra

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the Research Engineer: RL & Reasoning for Next-Gen LMs in San Francisco, CA vacancy

Research Engineer/Research Scientist, RL/Reasoning
$310k
About the Team The RL and Reasoning team drives the core reasoning paradigm and has created... ...of reinforcement learning research, building next-generation generative models, and deploying... ...scale. About the Role As a Research Engineer/Research Scientist at OpenAI, you will...
Suggested
Work at office
Relocation package
Slope
San Francisco, CA
2 days ago
Research Engineer - Agency and Reasoning
...Francisco, California. The Role: As a Research Engineer - Agency and Reasoning , you will be a core contributor to... ...applying your ideas at scale to our next generation of language models.... ...model reasoning or more classical RL tasks Experience with language-model...
Suggested
Work at office
Relocation package
Zyphra
San Francisco, CA
25 days ago
Research Engineer - Agency and Reasoning
...Francisco, California. The Role: As a Research Engineer - Agency and Reasoning , you will be a core contributor to... ...applying your ideas at scale to our next generation of language models. What... ...model reasoning or more classical RL tasks Experience with language-model...
Suggested
Work at office
Relocation package
Zyphra
San Francisco, CA
5 days ago
Research Engineer - Reinforcement Learning
...plane and pair it with the full RL post-training stack:... ...async RL trainer. We enable researchers, startups and enterprises to... ...deployment contexts. As a Research Engineer in our Reasoning team, you'll play a crucial... ...and tools, synthetic data gen research and proactively...
Suggested
Remote work
Worldwide
Visa sponsorship
Relocation package
Flexible hours
Prime-Intellect
San Francisco, CA
4 days ago
Cybersecurity RL Research Engineer - Flexible Hours
A technology-driven AI company in San Francisco is hiring a Research Engineer for its Cybersecurity RL team. This role involves advancing AI capabilities in secure coding and vulnerability remediation by blending research with engineering tasks. Candidates should have...
Suggested
Flexible hours
Menlo Ventures
San Francisco, CA
4 days ago
Staff RL Research Engineer — Post-Training Environments
$200k
A technology company in San Francisco is seeking a Software Engineer for their RL Research & Environments team. The role focuses on designing and improving data and evaluation systems to enhance model capabilities. Candidates should have a strong software engineering background...
SupportFinity™
San Francisco, CA
4 days ago
Staff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAI
$264.8k - $331k
...Staff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAI San Francisco... ...MLRE, you will build out our next-gen Agent RL training platform. You'll build out the... ...committed to working with and providing reasonable accommodations to applicants with...
Full time
Scale AI
San Francisco, CA
5 days ago
Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI
$264.8k - $331k
...The Enterprise ML Research Lab works on the front lines... ...clients. As an ML Sys Research Engineer, you'll work on building out the algorithms for our next-gen Agent RL training platform, support large... ...working with and providing reasonable accommodations to applicants...
Full time
Scale AI
San Francisco, CA
5 days ago
Research Engineer
...infrastructure / Reinforcement Learning (RL) training data & evaluations... ...Opportunity Our partner is hiring a Research Engineer to help scale the quality assurance (QA... ...or any other protected characteristic. Reasonable accommodations are available throughout...
Remote work
talentpluto
San Francisco, CA
20 days ago
Research Engineer — Reinforcement Learning
$180k - $270k
Research Engineer (Focused on RL) You'll bring reinforcement learning to Firecrawl's core product — building... ...this week than one polished one next month. And when you have results, you... ...techniques, production instincts, and fast reasoning. Founder Chat (~30 min) — Culture,...
Full time
Temporary work
Remote work
Firecrawl
San Francisco, CA
5 days ago
Research Engineer, Virtual Collaborator at Anthropic - San Francisco, CA | New York City, NY | [...]
Research Engineer, Virtual Collaborator at Anthropic - San Francisco, CA | New York City, NY | Seattle... ...and preventing reward hacking in RL systems Translating product requirements... ...make you an offer, we will make every reasonable effort to get you a visa, and we retain...
Work at office
Visa sponsorship
Flexible hours
Victrays
San Francisco, CA
2 days ago
Research Engineer - Benchmarking, Evals & Failure Analysis
...team. You’ll work alongside researchers, operators, and AI companies... ...About the Role As a Research Engineer at Mercor, you’ll work at the... ...agentic behavior, and real-world reasoning. You’ll design and run evals,... ..., rubric design, or RL‑style workflows that use evals...
Work at office
Mercor
San Francisco, CA
2 days ago
Research Engineer — Search/IR
$180k - $270k
Research Engineer (Focused on Search/IR) You'll own and advance the search and... ...to tell you what to try next — you have a backlog of ideas... ...the Head of Research and the RL‑focused Research Engineer to... ...and when to use which. You can reason about relevance tradeoffs and...
Full time
Temporary work
Remote work
Firecrawl
San Francisco, CA
5 days ago
RL Research Engineer — Open AI Infra (Remote)
Prime-Intellect is seeking a Research Engineer in San Francisco to shape the technological direction of their AI infrastructure. This role demands expertise in AI/ML engineering and the ability to lead research efforts in synthetic data generation. You will optimize AI...
Remote job
Flexible hours
Prime-Intellect
San Francisco, CA
4 days ago
Research Engineer, Infrastructure
...the first AI software engineer, and Windsurf, an AI-native... ..., former founders, and researchers from the frontier of AI... ...what they need next, and build systems that... ...hold up at our largest RL training scales. Performance... ..., and the ability to reason about performance across...
Cognition
San Francisco, CA
3 days ago
Principal Research Engineer - Code
...Turing is the world’s leading research accelerator for frontier AI... ...pipelines that advance thinking, reasoning, coding, multimodality, and... ...and reinforcement learning (RL) environments that power post... ...: Environments for Software Engineering / coding agents UI-Environments...
For contractors
Flexible hours
Cerebras
San Francisco, CA
1 day ago
[Expression of Interest] Research Scientist / Engineer, Honesty
$315k - $340k
...a quickly growing group of committed researchers, engineers, policy experts, and business leaders... ...and honesty. Develop and test novel RL environments that reward truthful outputs... ...We sponsor visas. We will make every reasonable effort to obtain a visa if you are offered...
Work at office
Visa sponsorship
Flexible hours
Anthropic
San Francisco, CA
1 day ago
Research Engineering Manager — AI Agents & RL Leader
Adept in San Francisco is seeking a Research Engineering Manager to lead a team of research engineers and scientists. The role involves setting... ...developing a research agenda, and collaborating with leadership on next-generation AI agents. The ideal candidate has experience...
I did my part and supported the Regular Toilet
San Francisco, CA
5 days ago
Research Systems Engineer: Frontier RL for Enterprise
A leading AI company based in San Francisco is seeking a research systems engineer to train large language models and explore reinforcement learning techniques. The ideal candidate will work at the intersection of research and systems design experiments at scale. Benefits...
Applied Compute Inc.
San Francisco, CA
2 days ago
Real-World Robotics RL Research Engineer
Pantograph is looking for research engineers to build robots that learn through exploration in the real world. Ideal candidates will have strong foundations in reinforcement learning and experience working with large GPU clusters, Kubernetes, and complex distributed systems...
Pantograph
San Francisco, CA
5 days ago
Research Engineer: RL Data QA & Tooling
talentpluto is seeking a Research Engineer to enhance the quality assurance (QA) systems supporting training data for reinforcement learning. This position demands close collaboration with stakeholders to guarantee reliability and consistency in datasets. Key responsibilities...
talentpluto
San Francisco, CA
5 days ago
Senior Research Engineer - Experimental Engineering
$140k - $160k
...employees. OVERVIEW The Experimental Engineering team leads the engineering R&D efforts... ...We're seeking a highly capable Senior Research Engineer to join our team and drive... ...Consumer Reports will provide you with any reasonable assistance or accommodation for any...
Full time
Local area
Remote work
Relocation package
Consumer Reports
San Francisco, CA
12 days ago
Research Engineer
$200k - $250k
Research Engineer Location San Francisco (On-site) Compensation $200,000 - $250,000 + variable equity based on cash compensation Technologies... ...Research Engineer to design, prototype, and ship the core reasoning and agentic systems that power Lotus. You will turn messy...
Lotus Health AI
San Francisco, CA
4 days ago
Founding Applied Research Engineer
...for revenue is closing in the next few months. Rox is in market.... ...every day. We see exactly where research meets production and where the... ...rewards. Offline-to-online RL on multi-touch trajectories with... ...elite and competitive engineering minds. Translate findings into...
Relocation
Rox Data Corp
San Francisco, CA
2 days ago
Applied Research Engineer
$145.2k - $196.4k
...Francisco Employment Type Full time Department Engineering Job Summary Drata is seeking an Applied Research Engineer to drive the quality and effectiveness of... ...science behind how Drata's AI products retrieve, reason, and respond, and you'll work closely with AI and...
Full time
Flexible hours
Cacheflow
San Francisco, CA
2 days ago
Research Engineer - Evals
$160k - $240k
Research Engineer — Evals You’ll build the evaluation systems that tell us whether Firecrawl actually... ...system. Close the loop with models and RL. Evals here aren’t a reporting layer —... ...directly influence what gets trained next. Run fast experiments and communicate clearly...
Full time
Temporary work
Remote work
Firecrawl
San Francisco, CA
3 days ago
Founding Research Engineer
At Camfer, our research engineers are training models to intelligently interpret and edit parametric CAD designs in 3D space. This is a cutting... ..., evals to measure performance of generations in 3D space, or RL frameworks to train agents. We are looking for people who: Have...
Work at office
Camfer
San Francisco, CA
4 days ago
Applied Research Engineer
$197.3k - $313.7k
## Applied Research EngineerApplyremote type: Office Tech-Flexiblelocations... ...software and platform engineers to embed in our AI team to... ...want to build and deploy the next generation of services leveraging... ....AccommodationsIf you need a reasonable accommodation during the...
Work at office
Salesforce, Inc.
San Francisco, CA
3 days ago
Research Engineer
...friction with seamless automation. As a Research Engineer at Capably, you’ll help define how... ...capability. You’ll explore new approaches in reasoning, planning, tool use, memory,... ...impact will be essential to shaping the next generation of Capably’s platform. What...
Capably
San Francisco, CA
5 days ago
Research Engineer, Codex
...the-art AI systems that can write code, reason about software, and act as intelligent... ...as ChatGPT and the API, as well as in next-generation tools specifically designed for agentic coding. We operate across research, engineering, product, and infrastructure—owning the...
Work at office
Relocation package
OpenAI
San Francisco, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Engineer: RL & Reasoning for Next-Gen LMs. Be the first to apply!