Research Engineer, RL Infrastructure (Knowledge Work)

$350k

United States Digital Space LLC

About the company the company’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role The Knowledge Work team builds the training environments and evaluations that make Claude effective at real-world professional workflows — searching, analyzing, and creating across the tools and documents knowledge workers use every day. As that work scales, the systems behind it need to be as rigorous as the research itself. We are looking for a Research Engineer to own the reliability, observability, and infrastructure foundation that the team's research depends on. You will be responsible for ensuring our training and evaluation runs remain stable, well‑instrumented, and high‑quality as they grow in scale and complexity. A core part of this role is shifting reliability work from reactive to proactive: hardening systems, stress‑testing at realistic scale, and building the observability and tooling that surface problems early — so researchers can stay focused on research rather than incident response. You will be the team's stable, context‑rich owner for environment health and evaluation integrity, and the primary point of contact for partner teams when issues arise. Where this role focuses: While you'll work closely with researchers building new training environments, the priority for this role is the reliability those environments depend on. It's best suited to an engineer who finds real ownership and impact in making critical systems dependable, and in being the person behind trustworthy evaluation results the entire organization relies on. Key Responsibilities Serve as the dedicated reliability owner for the Knowledge Work training environments, providing continuity of context and reducing the operational overhead of rotating ownership. Own a clean, canonical set of evaluation tools and processes for Knowledge Work capabilities, including the process used for model releases. Build and automate observability, dashboards, and operational tooling for our training environments and evaluation systems, with an emphasis on high signal‑to‑noise: a small set of trusted metrics and alerts rather than sprawling instrumentation. Proactively harden environments and evaluation systems through load testing, fault injection, and stress testing at realistic scale, so failures surface early rather than during critical training work. Act as the primary point of contact for partner training and infrastructure teams when issues in our environments arise, and drive incidents to resolution. Reduce the operational burden on researchers so they can stay focused on research. Minimum Qualifications Highly experienced Python engineer who ships reliable, well‑instrumented code that teammates trust in production. Demonstrated experience operating ML or distributed systems at scale, including significant on‑call and incident‑response experience. Strong SRE or production‑engineering mindset — reaching for SLOs, load tests, and failure injection before reaching for more dashboards. Foundational ML knowledge sufficient to understand what a training environment or evaluation is actually measuring, and recognize when an evaluation has become stale or gameable. Able to read research code and reason evaluation integrity. Preferred Qualifications 5+ years of experience operating ML or distributed systems at scale. Experience building or operating RL environments, agent harnesses, or LLM evaluation frameworks. Familiarity with reward modeling, evaluation design, or detecting and mitigating reward hacking. Experience with observability stacks (metrics, tracing, structured logging) and operational dashboard tooling. Background in chaos engineering, fault injection, or large‑scale load testing. Experience with data quality pipelines, drift detection, or evaluation‑set curation and versioning. Familiarity with large‑scale training or inference infrastructure (schedulers, multi‑agent orchestration, sandboxed execution). Prior experience as a dedicated reliability or operations owner embedded within a research team. Annual Salary Annual Salary: $350,000—$850,000 USD Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience. Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience. Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position. Location‑based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. #J-18808-Ljbffr United States Digital Space LLC

Apply

Vacancy posted 17 hours ago

Similar jobs that could be interesting for youBased on the Research Engineer, RL Infrastructure (Knowledge Work) in San Francisco, CA vacancy

Research Engineer, Code RL (Reinforcement Learning)
...quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the RL Teams Our Reinforcement Learning... ...models Building scalable RL infrastructure and training methodologies Enhancing...
Suggested
Visa sponsorship
United States Digital Space LLC
San Francisco, CA
17 hours ago
Research Engineer, Performance RL (Reinforcement Learning)
$350k
...quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the RL Teams Our Reinforcement Learning... ...models Building scalable RL infrastructure and training methodologies Enhancing...
Suggested
Work at office
Visa sponsorship
Flexible hours
Menlo Ventures
San Francisco, CA
1 day ago
Performance RL Research Engineer: Safe Code for Accelerators
$350k
Menlo Ventures is looking for a Research Engineer for their Code RL team. This role focuses on advancing AI models' coding capabilities while ensuring... ...$350,000 to $850,000, and benefits including flexible working hours and a lovely office in San Francisco. #J-18808-Ljbffr...
Suggested
Work at office
Flexible hours
Menlo Ventures
San Francisco, CA
5 days ago
Research Engineer, Code RL: Build & Validate AI Code
United States Digital Space LLC in San Francisco is hiring a Research Engineer for the Code RL team. This position aims to enhance AI models' software... ...and a passion for AI. The position offers a hybrid work policy and an annual salary range of $500,000 to $850,000...
Suggested
United States Digital Space LLC
San Francisco, CA
5 days ago
Cybersecurity RL Research Engineer - Flexible Hours
...technology-driven AI company in San Francisco is hiring a Research Engineer for its Cybersecurity RL team. This role involves advancing AI capabilities in... ...benefit from competitive compensation and a supportive work environment that values safety and collaboration. #J-18...
Suggested
Flexible hours
Menlo Ventures
San Francisco, CA
4 days ago
Research Engineer/Research Scientist, RL/Reasoning
$310k
About the Team The RL and Reasoning team drives the core reasoning... ...of reinforcement learning research, building next-generation generative... ...About the Role As a Research Engineer/Research Scientist at OpenAI,... ...cutting-edge RL methods. Your work will sit at the heart of...
Work at office
Relocation package
Slope
San Francisco, CA
2 days ago
Staff RL Research Engineer — Post-Training Environments
$200k
A technology company in San Francisco is seeking a Software Engineer for their RL Research & Environments team. The role focuses on designing and improving data and evaluation systems to enhance model capabilities. Candidates should have a strong software engineering background...
SupportFinity™
San Francisco, CA
4 days ago
Platform Research Engineer
The role As a platform research engineer, you’ll build the... ..., continual learning infrastructure, agent-building tooling... ...and integrating the RL stack that powers agent... ...engineers (who work directly with customers... ...combined with deep ML/AI knowledge Experience building...
Work at office
Visa sponsorship
Relocation package
Applied Compute
San Francisco, CA
2 days ago
Research Engineer - Synthetic RL Data & Systems
hillclimb is seeking a research engineer to work on synthetic data generation and maintain quality pipelines for RL environments. The ideal candidate will possess a strong understanding of NLP and RL techniques, alongside a solid grasp of data structures and modern programming...
hillclimb
San Francisco, CA
1 day ago
Research Engineer - Benchmarking, Evals & Failure Analysis
...defining the future of work. We partner with leading... ...students: by sharing knowledge, experience, and context... ...You’ll work alongside researchers, operators, and AI companies... ...the Role As a Research Engineer at Mercor, you’ll work... ..., rubric design, or RL‑style workflows that use...
Work at office
Mercor
San Francisco, CA
2 days ago
Research Engineer, Virtual Collaborator at Anthropic - San Francisco, CA | New York City, NY | [...]
Research Engineer, Virtual Collaborator at Anthropic - San Francisco, CA |... ...experts, and business leaders working together to build beneficial... ...at general tasks, a lot of knowledge work requires targeted... ...preventing reward hacking in RL systems Translating product...
Work at office
Visa sponsorship
Flexible hours
Victrays
San Francisco, CA
2 days ago
Research Engineer — End-to-End Curriculum Generation & RL Systems
Tykhe Inc in San Francisco, CA is seeking a Research Engineer who will be responsible for designing experiments and building task generation systems. You will work on generating realistic curricula and transforming research prototypes into reliable systems. The ideal candidate...
Tykhe Inc
San Francisco, CA
4 days ago
Research Engineer: RL & Reasoning for Next-Gen LMs
...cutting-edge AI company based in San Francisco is seeking a Research Engineer specializing in Agency and Reasoning. The role focuses on performing... ...Python. The company values creativity and provides a dynamic work environment with excellent benefits, including comprehensive...
Zyphra
San Francisco, CA
1 day ago
[Expression of Interest] Research Scientist / Engineer, Honesty
$315k - $340k
[Expression of Interest] Research Scientist/Engineer, Honesty About Anthropic Anthropic... ..., and business leaders working together to build... ...accuracy given the model\'s knowledge Develop specialized classifiers... ...honesty Develop and test novel RL environments that reward truthful...
Full time
Work at office
Visa sponsorship
Flexible hours
Menlo Ventures
San Francisco, CA
1 day ago
[Expression of Interest] Research Scientist / Engineer, Honesty
$350k
...quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI... ...for accuracy given the model’s knowledge Develop specialized... ...honesty Develop and test novel RL environments that reward truthful...
Visa sponsorship
Anthropic
San Francisco, CA
1 day ago
Research Engineering Manager — AI Agents & RL Leader
Adept in San Francisco is seeking a Research Engineering Manager to lead a team of research engineers and scientists. The role involves setting team goals, developing a research agenda, and collaborating with leadership on next-generation AI agents. The ideal candidate...
I did my part and supported the Regular Toilet
San Francisco, CA
5 days ago
Research Systems Engineer: Frontier RL for Enterprise
A leading AI company based in San Francisco is seeking a research systems engineer to train large language models and explore reinforcement learning techniques. The ideal candidate will work at the intersection of research and systems design experiments at scale. Benefits...
Applied Compute Inc.
San Francisco, CA
2 days ago
Real-World Robotics RL Research Engineer
Pantograph is looking for research engineers to build robots that learn through exploration in the real world. Ideal candidates will have strong foundations in reinforcement learning and experience working with large GPU clusters, Kubernetes, and complex distributed systems...
Pantograph
San Francisco, CA
5 days ago
Research Engineer - AI Security & RL Systems
General Analysis, based in San Francisco, is seeking a Research Engineer to lead efforts in post-training models and adversarial simulations. This role requires expertise in reinforcement learning and a solid understanding of the entire training pipeline. The ideal candidate...
General Analysis
San Francisco, CA
5 days ago
Research Engineer: RL Data QA & Tooling
talentpluto is seeking a Research Engineer to enhance the quality assurance (QA) systems supporting training data for reinforcement learning. This position demands close collaboration with stakeholders to guarantee reliability and consistency in datasets. Key responsibilities...
talentpluto
San Francisco, CA
5 days ago
Frontier AI Research Engineer - Post-Training RL/Reasoning
$350k
Mirendil is seeking research engineers in San Francisco to build the post-training stack for frontier reasoning models. You will engage in... ...reinforcement learning methodologies, focusing on scalable infrastructure for large-scale AI research. The position offers a base...
Mirendil
San Francisco, CA
2 days ago
AI Research Engineer
...Path’s AI Therapist. Combine research, data science, and engineering to create models,... ...training data curation, building RL environments, new model... ..., and Clinical Guardrails Work with clinicians and internal... ...therapy or coaching Domain knowledge of psychology,...
The Path
San Francisco, CA
3 days ago
Founding AI Research Engineer - Robot Learning
...Jetson AGX Orin. Every research project will have a... ...our imitation learning infrastructure. Build the data flywheel... ...codebases (you'll work directly with open-source... ...: imitation learning, RL, vision-language models... ...actual robot. Working knowledge of ROS2 or equivalent...
Origin
San Francisco, CA
5 days ago
Senior/Staff ML Research Engineer
...Customer Support startup with their search for senior/staff ML research engineers. The role will be onsite in their SF office. What you'... ...synthetic data, performing supervised fine-tuning and RL, and working on evaluations and deployments. ~ Familiar with SFT, FL,...
Work at office
DRH Search
San Francisco, CA
3 hours ago
Hybrid SF RL Infrastructure Engineer GPU-Scale Orchestration
...seeking a Member of Technical Staff for RL Infrastructure in San Francisco. This role... ...Candidates should have strong software engineering experience, particularly in... ...inference or RL training, and be able to work closely with ML researchers. #J-18808-Ljbffr VMAX LLC
VMAX LLC
San Francisco, CA
3 days ago
RL Infrastructure Engineer — Frontier AI Research
$300k
A rare infrastructure role in a frontier RL research operation. Compensation: $300K-$500K base + equity. San Francisco, on-site. Hiring urgent. Opportunity... ...In a seed‑stage, well‑funded AI company, a small engineering team works with top researchers to automate task objectives...
H1b
Aionia Group
San Francisco, CA
5 days ago
Research Engineer
Introduction The Center for AI Safety is a research and field-building nonprofit located in... ...building and technical research. As a research engineer here, you will pursue a variety of... ...with teammates. Have co-authored an NLP or RL paper in a top conference. About Us The...
Work at office
Center for AI Safety
San Francisco, CA
3 days ago
Research Engineer
...the gap between cutting-edge research and production systems.... ...research papers and ideas into working prototypes Collaborate with... ...communication between research and engineering teams What We're Looking For... ...and prototyping novel ideas Knowledge of machine learning...
Engramme, Inc.
San Francisco, CA
2 days ago
Research Engineer - Speech & Realtime Models
$295k
Research Engineer - Speech & Realtime Models B2B Applications - San Francisco About the Team OpenAI... ...Team, you will have the opportunity to work with some of the brightest minds in AI.... ...AI, participate in code reviews, share knowledge, and lead by example. Monitor and...
Internship
OpenAI
San Francisco, CA
2 days ago
Research Engineer, Applied Finetuning
$315k
As a Research Engineer or Research Scientist in Applied Finetuning, you will directly train the... ...about the societal impacts of your work Have clear written and verbal communication... ...models Complex shared codebases and RL infrastructure Authoring research papers in machine...
Work at office
Home office
Visa sponsorship
Relocation package
Anthropic
San Francisco, CA
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Engineer, RL Infrastructure (Knowledge Work). Be the first to apply!