Research Engineer: RL & Reasoning for Next-Gen LMs
Zyphra
A cutting-edge AI company based in San Francisco is seeking a Research Engineer specializing in Agency and Reasoning. The role focuses on performing research in reinforcement learning and applying innovative ideas to the next generation of their language models. Candidates should have a postgraduate degree in a scientific field and be proficient in PyTorch and Python. The company values creativity and provides a dynamic work environment with excellent benefits, including comprehensive health plans and unlimited PTO. #J-18808-Ljbffr Zyphra
$310k
About the Team The RL and Reasoning team drives the core reasoning paradigm and has created... ...of reinforcement learning research, building next-generation generative models, and deploying... ...scale. About the Role As a Research Engineer/Research Scientist at OpenAI, you will...SuggestedWork at officeRelocation package- ...Francisco, California. The Role: As a Research Engineer - Agency and Reasoning , you will be a core contributor to... ...applying your ideas at scale to our next generation of language models. What... ...model reasoning or more classical RL tasks Experience with language-model...SuggestedWork at officeRelocation package
- ...plane and pair it with the full RL post-training stack:... ...async RL trainer. We enable researchers, startups and enterprises to... ...deployment contexts. As a Research Engineer in our Reasoning team, you'll play a crucial... ...and tools, synthetic data gen research and proactively...SuggestedRemote workWorldwideVisa sponsorshipRelocation packageFlexible hours
- A technology-driven AI company in San Francisco is hiring a Research Engineer for its Cybersecurity RL team. This role involves advancing AI capabilities in secure coding and vulnerability remediation by blending research with engineering tasks. Candidates should have...SuggestedFlexible hours
$200k
A technology company in San Francisco is seeking a Software Engineer for their RL Research & Environments team. The role focuses on designing and improving data and evaluation systems to enhance model capabilities. Candidates should have a strong software engineering background...Suggested$264.8k - $331k
...Staff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAI San Francisco... ...MLRE, you will build out our next-gen Agent RL training platform. You'll build out the... ...committed to working with and providing reasonable accommodations to applicants with...Full time$264.8k - $331k
...The Enterprise ML Research Lab works on the front lines... ...clients. As an ML Sys Research Engineer, you'll work on building out the algorithms for our next-gen Agent RL training platform, support large... ...working with and providing reasonable accommodations to applicants...Full time$180k - $270k
Research Engineer (Focused on RL) You'll bring reinforcement learning to Firecrawl's core product — building... ...this week than one polished one next month. And when you have results, you... ...techniques, production instincts, and fast reasoning. Founder Chat (~30 min) — Culture,...Full timeTemporary workRemote work- ...infrastructure / Reinforcement Learning (RL) training data & evaluations... ...The Opportunity Our partner is hiring a Research Engineer to help scale the quality assurance (QA... ...or any other protected characteristic. Reasonable accommodations are available throughout...Remote work
- Research Engineer, Virtual Collaborator at Anthropic - San Francisco, CA | New York City, NY | Seattle... ...and preventing reward hacking in RL systems Translating product requirements... ...make you an offer, we will make every reasonable effort to get you a visa, and we retain...Work at officeVisa sponsorshipFlexible hours
- Prime-Intellect is seeking a Research Engineer in San Francisco to shape the technological direction of their AI infrastructure. This role demands expertise in AI/ML engineering and the ability to lead research efforts in synthetic data generation. You will optimize AI...Remote jobFlexible hours
- ...team. You’ll work alongside researchers, operators, and AI companies... ...About the Role As a Research Engineer at Mercor, you’ll work at the... ...agentic behavior, and real-world reasoning. You’ll design and run evals,... ..., rubric design, or RL‑style workflows that use evals...Work at office
$180k - $270k
Research Engineer (Focused on Search/IR) You'll own and advance the search and... ...to tell you what to try next — you have a backlog of ideas... ...the Head of Research and the RL‑focused Research Engineer to... ...and when to use which. You can reason about relevance tradeoffs and...Full timeTemporary workRemote work- ...the first AI software engineer, and Windsurf, an AI-native... ..., former founders, and researchers from the frontier of AI... ...what they need next, and build systems that... ...hold up at our largest RL training scales. Performance... ..., and the ability to reason about performance across...
- ...Turing is the world’s leading research accelerator for frontier AI... ...pipelines that advance thinking, reasoning, coding, multimodality, and... ...and reinforcement learning (RL) environments that power post... ...: Environments for Software Engineering / coding agents UI-Environments...For contractorsFlexible hours
$315k - $340k
...a quickly growing group of committed researchers, engineers, policy experts, and business leaders... ...and honesty. Develop and test novel RL environments that reward truthful outputs... ...We sponsor visas. We will make every reasonable effort to obtain a visa if you are offered...Work at officeVisa sponsorshipFlexible hours- Adept in San Francisco is seeking a Research Engineering Manager to lead a team of research engineers and scientists. The role involves setting... ...developing a research agenda, and collaborating with leadership on next-generation AI agents. The ideal candidate has experience...
- ...Francisco, California. The Role: As a Research Engineer - Agency and Reasoning , you will be a core contributor to... ...applying your ideas at scale to our next generation of language models.... ...model reasoning or more classical RL tasks Experience with language-model...Work at officeRelocation package
- A leading AI company based in San Francisco is seeking a research systems engineer to train large language models and explore reinforcement learning techniques. The ideal candidate will work at the intersection of research and systems design experiments at scale. Benefits...
- Pantograph is looking for research engineers to build robots that learn through exploration in the real world. Ideal candidates will have strong foundations in reinforcement learning and experience working with large GPU clusters, Kubernetes, and complex distributed systems...
- talentpluto is seeking a Research Engineer to enhance the quality assurance (QA) systems supporting training data for reinforcement learning. This position demands close collaboration with stakeholders to guarantee reliability and consistency in datasets. Key responsibilities...
- ...the-art AI systems that can write code, reason about software, and act as intelligent... ...as ChatGPT and the API, as well as in next-generation tools specifically designed for agentic coding. We operate across research, engineering, product, and infrastructure—owning the...Work at officeRelocation package
$197.3k - $313.7k
## Applied Research EngineerApplyremote type: Office Tech-Flexiblelocations... ...software and platform engineers to embed in our AI team to... ...want to build and deploy the next generation of services leveraging... ....AccommodationsIf you need a reasonable accommodation during the...Work at office- ...friction with seamless automation. As a Research Engineer at Capably, you’ll help define how... ...capability. You’ll explore new approaches in reasoning, planning, tool use, memory,... ...impact will be essential to shaping the next generation of Capably’s platform. What...
$200k - $250k
Research Engineer Location San Francisco (On-site) Compensation $200,000 - $250,000 + variable equity based on cash compensation Technologies... ...Research Engineer to design, prototype, and ship the core reasoning and agentic systems that power Lotus. You will turn messy...- ...for revenue is closing in the next few months. Rox is in market.... ...every day. We see exactly where research meets production and where the... ...rewards. Offline-to-online RL on multi-touch trajectories with... ...elite and competitive engineering minds. Translate findings into...Relocation
$145.2k - $196.4k
...Francisco Employment Type Full time Department Engineering Job Summary Drata is seeking an Applied Research Engineer to drive the quality and effectiveness of... ...science behind how Drata's AI products retrieve, reason, and respond, and you'll work closely with AI and...Full timeFlexible hours$160k - $240k
Research Engineer — Evals You’ll build the evaluation systems that tell us whether Firecrawl actually... ...system. Close the loop with models and RL. Evals here aren’t a reporting layer —... ...directly influence what gets trained next. Run fast experiments and communicate clearly...Full timeTemporary workRemote work- At Camfer, our research engineers are training models to intelligently interpret and edit parametric CAD designs in 3D space. This is a cutting... ..., evals to measure performance of generations in 3D space, or RL frameworks to train agents. We are looking for people who: Have...Work at office
$170k - $230k
...Center for AI Safety (CAIS) is a leading research and advocacy organization focused on... ...Safety Action Fund. As a Senior Research Engineer here, you’ll work at the intersection of... ...records for employment. If you require a reasonable accommodation during the application or...Work at officeLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Engineer: RL & Reasoning for Next-Gen LMs. Be the first to apply!
- research software engineer San Francisco, CA
- research assistant engineering San Francisco, CA
- deep learning research engineer San Francisco, CA
- senior research engineer San Francisco, CA
- research programmer San Francisco, CA
- ai research engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- research engineer San Francisco, CA
- microsoft research San Francisco, CA
- oncology research nurse San Francisco, CA

