Research Engineer: RL Data QA & Tooling
talentpluto
talentpluto is seeking a Research Engineer to enhance the quality assurance (QA) systems supporting training data for reinforcement learning. This position demands close collaboration with stakeholders to guarantee reliability and consistency in datasets. Key responsibilities include defining quality standards, building auditing workflows, and integrating learnings into processes. Ideal candidates will have strong Python skills, experience with Docker, and a proven ability to operate independently in fast-paced settings. Competitive compensation and benefits are offered. #J-18808-Ljbffr talentpluto
- ...infrastructure / Reinforcement Learning (RL) training data & evaluations Compensation:... ...company building the tooling and workflows that power reinforcement... ...Our partner is hiring a Research Engineer to help scale the quality assurance (QA) systems behind training data generated...SuggestedRemote work
$310k
About the Team The RL and Reasoning team drives the... ...reinforcement learning research, building next-... ...the Role As a Research Engineer/Research Scientist at OpenAI... ...an extremely powerful tool that must be created with... ...possession (including the data contained therein) upon...SuggestedWork at officeRelocation package- ...that are used by many engineers , Experience in building... ...automating workflows and data visualization ,... ...in rapid iteration of research to production cycles ,... ...environments for AI agents , Tools for automating common workflows... ...on large scale RL jobs , Refactoring existing...Suggested
$320k
...Anthropic Rl Data Engineer Anthropic's mission is to create reliable,... ...growing group of committed researchers, engineers, policy experts,... ...collection pipelines, human feedback tooling, the execution environments... ...Develop and improve QA frameworks to catch reward hacking...SuggestedVisa sponsorship- A technology-driven AI company in San Francisco is hiring a Research Engineer for its Cybersecurity RL team. This role involves advancing AI capabilities in secure coding and vulnerability remediation by blending research with engineering tasks. Candidates should have...SuggestedFlexible hours
$200k
A technology company in San Francisco is seeking a Software Engineer for their RL Research & Environments team. The role focuses on designing and improving data and evaluation systems to enhance model capabilities. Candidates should have a strong software engineering background...- Plaid Inc in San Francisco is looking for an Analytics Engineer to own and build data models, predictive frameworks, and analytics tools. The ideal candidate has at least 4 years of experience in analytics engineering or data science, advanced SQL skills, and proficiency...
- ...expertise in model innovation and systems engineering paired with a design‑minded product... ...global AI, our models must be trained on data that reflects the world’s diversity of languages... ...building scalable systems that bridge research and production. What We Offer...Work at officeRelocation package
- A technology company in San Francisco is looking for a candidate to drive research initiatives that influence engineering solutions. You'll build evaluations using real tool data, tackle search challenges for tools, and train systems for improved accuracy. Ideal candidates...
- ...petabytes of video, lidar, radar, and sensor data. But today's data platforms (Databricks,... ...in 2022 to close it. Our open‑source engine, Daft, is the distributed data engine purpose... ...Mission district office. Your Role As a Research Engineer on the Visual Understanding team...Hourly payWork at officeFlexible hoursNight shift1 day per week
$200k
...safe AGI lies in automating research and code generation to improve... ...pre‑training, domain‑specific RL, ultra‑long context, and... ...lead developer experience and data tooling for our pre‑training data team... ...organization, packaging, and engineering best practices What we’re looking...Work at officeRelocationVisa sponsorship- OpenAI is looking for a self-starter full stack engineer in San Francisco to rapidly prototype and develop internal tools for researchers. The role requires proficiency in Python and experience building maintainable codebases. Ideal candidates are comfortable with both...Relocation package
$180k - $270k
Research Engineer (Focused on RL) You'll bring reinforcement learning to Firecrawl's core product — building... ..., understanding, and structuring web data. This isn't theoretical RL research.... ...ll be frustrated here. You build the tools you need. People who only know one paradigm...Full timeTemporary workRemote work- ...pair it with the full RL post-training stack: environments... ...RL trainer. We enable researchers, startups and... ...adapting models to real tools, workflows, and deployment... .... As a Research Engineer in our Reasoning team,... ...working with synthetic data and teaching LLMs reasoning...Remote workWorldwideVisa sponsorshipRelocation packageFlexible hours
$150k - $300k
...pair it with the full rl post-training stack: environments... ...RL trainer. We enable researchers, startups and... ...adapting models to real tools, workflows, and deployment... .... As a Research Engineer working on Distributed... ...training techniques such as data, tensor & pipeline parallelism...Remote workWorldwideVisa sponsorshipRelocation packageFlexible hours$160k - $240k
Research Engineer — Evals You’ll build the evaluation systems that... ...structured, LLM‑ready data reliably — is hard to... ...build the human review tooling needed when automation... ...the loop with models and RL. Evals here aren’t a reporting... ...‑class product, not a QA gate. Researchers who...Full timeTemporary workRemote work- At Camfer, our research engineers are training models to intelligently interpret and edit parametric... ...of generations in 3D space, or RL frameworks to train agents. We are looking... ...everyone. Our first step is creating a tool that allows engineers to use text, images...Work at office
- Research Engineer, Virtual Collaborator at Anthropic - San Francisco, CA | New... ...on real organizational data and workflows. Your job will... ...platforms Working with enterprise tools and APIs (Google Workspace,... ...preventing reward hacking in RL systems Translating product...Work at officeVisa sponsorshipFlexible hours
- Prime-Intellect is seeking a Research Engineer in San Francisco to shape the technological direction of their AI infrastructure. This role demands... ...and the ability to lead research efforts in synthetic data generation. You will optimize AI inference workloads and contribute...Remote jobFlexible hours
- ...You’ll work alongside researchers, operators, and AI companies... ...the Role As a Research Engineer at Mercor, you’ll work... ...define how we measure tool use, agentic behavior,... ...-training, RLVR, and data pipelines. What You’ll... ...generation, rubric design, or RL‑style workflows that...Work at office
- A cutting-edge AI company based in San Francisco is seeking a Research Engineer specializing in Agency and Reasoning. The role focuses on performing research in reinforcement learning and applying innovative ideas to the next generation of their language models. Candidates...
$264.8k - $331k
...Staff Machine Learning Research Engineer, Agent Post-training - Enterprise... ...Scale has been the leading AI data foundry, helping fuel the most... ...arsenal of proprietary research, tools, and resources that serve all... ...build out our next-gen Agent RL training platform. You'll...Full time- A cutting-edge AI research lab in San Francisco seeks a Product Engineer to develop a video collection platform. In this role, you will build features across frontend and backend while improving platform reliability and performance. Ideal candidates are strong full stack...
$250k - $300k
...breakthrough AI models at leading research labs and enterprises. Since 2018, we've been pioneering data-centric approaches that are... ...development: Enterprise Platform & Tools : Advanced annotation tools,... ...SFT) and reinforcement learning (RL) methodologies. Implement and...Work at officeFlexible hours2 days per week- ...the world’s leading research accelerator for... ...with high-quality data, specialized talent... ...reinforcement learning (RL) environments that... ...Retail, Developer Tools, Collaboration, Customer... ...for Software Engineering / coding agents UI... ...them and setting up QA processes. Hands‑on...For contractorsFlexible hours
$315k - $340k
[Expression of Interest] Research Scientist/Engineer, Honesty About Anthropic Anthropic... ...Design and implement novel data curation pipelines to... ...honesty Develop and test novel RL environments that reward truthful... ...fabricated claims Create tools to help human evaluators efficiently...Full timeWork at officeVisa sponsorshipFlexible hours- ...Devin, the first AI software engineer, and Windsurf, an AI-... ...programmers, former founders, and researchers from the frontier of AI,... ...experiment orchestration, data pipelines, and the tooling that turns raw compute... ...that hold up at our largest RL training scales....
$264.8k - $331k
...has been the leading AI data foundry, helping fuel... ...The Enterprise ML Research Lab works on the front... ...As an ML Sys Research Engineer, you'll work on building... ...for our next-gen Agent RL training platform, support... ...for multi-agent/multi-tool rollouts. Ideally you...Full time$119k - $299.93k
...processes and related controls. Those in data, analytics and technology solutions at PwC... ...wrangling technology, and automation tools to leverage data and focus on establishing... ...years of professional AI/ML development, engineering, or testing experience What Sets You...Full timeH1b- Adept in San Francisco is seeking a Research Engineering Manager to lead a team of research engineers and scientists. The role involves setting team goals, developing a research agenda, and collaborating with leadership on next-generation AI agents. The ideal candidate...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Engineer: RL Data QA & Tooling. Be the first to apply!
- research software engineer San Francisco, CA
- research assistant engineering San Francisco, CA
- deep learning research engineer San Francisco, CA
- senior research engineer San Francisco, CA
- research programmer San Francisco, CA
- ai research engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- research engineer San Francisco, CA
- staff data engineer San Francisco, CA
- data engineering intern summer San Francisco, CA


