Research Engineer, RL Infrastructure (Knowledge Work)
$350kUnited States Digital Space LLC
About the company the company’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role The Knowledge Work team builds the training environments and evaluations that make Claude effective at real-world professional workflows — searching, analyzing, and creating across the tools and documents knowledge workers use every day. As that work scales, the systems behind it need to be as rigorous as the research itself. We are looking for a Research Engineer to own the reliability, observability, and infrastructure foundation that the team's research depends on. You will be responsible for ensuring our training and evaluation runs remain stable, well‑instrumented, and high‑quality as they grow in scale and complexity. A core part of this role is shifting reliability work from reactive to proactive: hardening systems, stress‑testing at realistic scale, and building the observability and tooling that surface problems early — so researchers can stay focused on research rather than incident response. You will be the team's stable, context‑rich owner for environment health and evaluation integrity, and the primary point of contact for partner teams when issues arise. Where this role focuses: While you'll work closely with researchers building new training environments, the priority for this role is the reliability those environments depend on. It's best suited to an engineer who finds real ownership and impact in making critical systems dependable, and in being the person behind trustworthy evaluation results the entire organization relies on. Key Responsibilities Serve as the dedicated reliability owner for the Knowledge Work training environments, providing continuity of context and reducing the operational overhead of rotating ownership. Own a clean, canonical set of evaluation tools and processes for Knowledge Work capabilities, including the process used for model releases. Build and automate observability, dashboards, and operational tooling for our training environments and evaluation systems, with an emphasis on high signal‑to‑noise: a small set of trusted metrics and alerts rather than sprawling instrumentation. Proactively harden environments and evaluation systems through load testing, fault injection, and stress testing at realistic scale, so failures surface early rather than during critical training work. Act as the primary point of contact for partner training and infrastructure teams when issues in our environments arise, and drive incidents to resolution. Reduce the operational burden on researchers so they can stay focused on research. Minimum Qualifications Highly experienced Python engineer who ships reliable, well‑instrumented code that teammates trust in production. Demonstrated experience operating ML or distributed systems at scale, including significant on‑call and incident‑response experience. Strong SRE or production‑engineering mindset — reaching for SLOs, load tests, and failure injection before reaching for more dashboards. Foundational ML knowledge sufficient to understand what a training environment or evaluation is actually measuring, and recognize when an evaluation has become stale or gameable. Able to read research code and reason evaluation integrity. Preferred Qualifications 5+ years of experience operating ML or distributed systems at scale. Experience building or operating RL environments, agent harnesses, or LLM evaluation frameworks. Familiarity with reward modeling, evaluation design, or detecting and mitigating reward hacking. Experience with observability stacks (metrics, tracing, structured logging) and operational dashboard tooling. Background in chaos engineering, fault injection, or large‑scale load testing. Experience with data quality pipelines, drift detection, or evaluation‑set curation and versioning. Familiarity with large‑scale training or inference infrastructure (schedulers, multi‑agent orchestration, sandboxed execution). Prior experience as a dedicated reliability or operations owner embedded within a research team. Annual Salary Annual Salary: $350,000—$850,000 USD Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience. Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience. Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position. Location‑based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. #J-18808-Ljbffr United States Digital Space LLC
- ...quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the RL Teams Our Reinforcement Learning... ...models Building scalable RL infrastructure and training methodologies Enhancing...SuggestedVisa sponsorship
$350k
...quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the RL Teams Our Reinforcement Learning... ...models Building scalable RL infrastructure and training methodologies Enhancing...SuggestedWork at officeVisa sponsorshipFlexible hours$350k
Menlo Ventures is looking for a Research Engineer for their Code RL team. This role focuses on advancing AI models' coding capabilities while ensuring... ...$350,000 to $850,000, and benefits including flexible working hours and a lovely office in San Francisco. #J-18808-Ljbffr...SuggestedWork at officeFlexible hours- United States Digital Space LLC in San Francisco is hiring a Research Engineer for the Code RL team. This position aims to enhance AI models' software... ...and a passion for AI. The position offers a hybrid work policy and an annual salary range of $500,000 to $850,000...Suggested
- ...technology-driven AI company in San Francisco is hiring a Research Engineer for its Cybersecurity RL team. This role involves advancing AI capabilities in... ...benefit from competitive compensation and a supportive work environment that values safety and collaboration. #J-18...SuggestedFlexible hours
$310k
About the Team The RL and Reasoning team drives the core reasoning... ...of reinforcement learning research, building next-generation generative... ...About the Role As a Research Engineer/Research Scientist at OpenAI,... ...cutting-edge RL methods. Your work will sit at the heart of...Work at officeRelocation package$200k
A technology company in San Francisco is seeking a Software Engineer for their RL Research & Environments team. The role focuses on designing and improving data and evaluation systems to enhance model capabilities. Candidates should have a strong software engineering background...- The role As a platform research engineer, you’ll build the... ..., continual learning infrastructure, agent-building tooling... ...and integrating the RL stack that powers agent... ...engineers (who work directly with customers... ...combined with deep ML/AI knowledge Experience building...Work at officeVisa sponsorshipRelocation package
- hillclimb is seeking a research engineer to work on synthetic data generation and maintain quality pipelines for RL environments. The ideal candidate will possess a strong understanding of NLP and RL techniques, alongside a solid grasp of data structures and modern programming...
- ...defining the future of work. We partner with leading... ...students: by sharing knowledge, experience, and context... ...You’ll work alongside researchers, operators, and AI companies... ...the Role As a Research Engineer at Mercor, you’ll work... ..., rubric design, or RL‑style workflows that use...Work at office
- Research Engineer, Virtual Collaborator at Anthropic - San Francisco, CA |... ...experts, and business leaders working together to build beneficial... ...at general tasks, a lot of knowledge work requires targeted... ...preventing reward hacking in RL systems Translating product...Work at officeVisa sponsorshipFlexible hours
- Tykhe Inc in San Francisco, CA is seeking a Research Engineer who will be responsible for designing experiments and building task generation systems. You will work on generating realistic curricula and transforming research prototypes into reliable systems. The ideal candidate...
- ...cutting-edge AI company based in San Francisco is seeking a Research Engineer specializing in Agency and Reasoning. The role focuses on performing... ...Python. The company values creativity and provides a dynamic work environment with excellent benefits, including comprehensive...
$315k - $340k
[Expression of Interest] Research Scientist/Engineer, Honesty About Anthropic Anthropic... ..., and business leaders working together to build... ...accuracy given the model\'s knowledge Develop specialized classifiers... ...honesty Develop and test novel RL environments that reward truthful...Full timeWork at officeVisa sponsorshipFlexible hours$350k
...quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI... ...for accuracy given the model’s knowledge Develop specialized... ...honesty Develop and test novel RL environments that reward truthful...Visa sponsorship- Adept in San Francisco is seeking a Research Engineering Manager to lead a team of research engineers and scientists. The role involves setting team goals, developing a research agenda, and collaborating with leadership on next-generation AI agents. The ideal candidate...
- A leading AI company based in San Francisco is seeking a research systems engineer to train large language models and explore reinforcement learning techniques. The ideal candidate will work at the intersection of research and systems design experiments at scale. Benefits...
- Pantograph is looking for research engineers to build robots that learn through exploration in the real world. Ideal candidates will have strong foundations in reinforcement learning and experience working with large GPU clusters, Kubernetes, and complex distributed systems...
- General Analysis, based in San Francisco, is seeking a Research Engineer to lead efforts in post-training models and adversarial simulations. This role requires expertise in reinforcement learning and a solid understanding of the entire training pipeline. The ideal candidate...
- talentpluto is seeking a Research Engineer to enhance the quality assurance (QA) systems supporting training data for reinforcement learning. This position demands close collaboration with stakeholders to guarantee reliability and consistency in datasets. Key responsibilities...
$350k
Mirendil is seeking research engineers in San Francisco to build the post-training stack for frontier reasoning models. You will engage in... ...reinforcement learning methodologies, focusing on scalable infrastructure for large-scale AI research. The position offers a base...- ...Path’s AI Therapist. Combine research, data science, and engineering to create models,... ...training data curation, building RL environments, new model... ..., and Clinical Guardrails Work with clinicians and internal... ...therapy or coaching Domain knowledge of psychology,...
- ...Jetson AGX Orin. Every research project will have a... ...our imitation learning infrastructure. Build the data flywheel... ...codebases (you'll work directly with open-source... ...: imitation learning, RL, vision-language models... ...actual robot. Working knowledge of ROS2 or equivalent...
- ...Customer Support startup with their search for senior/staff ML research engineers. The role will be onsite in their SF office. What you'... ...synthetic data, performing supervised fine-tuning and RL, and working on evaluations and deployments. ~ Familiar with SFT, FL,...Work at office
- ...seeking a Member of Technical Staff for RL Infrastructure in San Francisco. This role... ...Candidates should have strong software engineering experience, particularly in... ...inference or RL training, and be able to work closely with ML researchers. #J-18808-Ljbffr VMAX LLC
$300k
A rare infrastructure role in a frontier RL research operation. Compensation: $300K-$500K base + equity. San Francisco, on-site. Hiring urgent. Opportunity... ...In a seed‑stage, well‑funded AI company, a small engineering team works with top researchers to automate task objectives...H1b- Introduction The Center for AI Safety is a research and field-building nonprofit located in... ...building and technical research. As a research engineer here, you will pursue a variety of... ...with teammates. Have co-authored an NLP or RL paper in a top conference. About Us The...Work at office
- ...the gap between cutting-edge research and production systems.... ...research papers and ideas into working prototypes Collaborate with... ...communication between research and engineering teams What We're Looking For... ...and prototyping novel ideas Knowledge of machine learning...
$295k
Research Engineer - Speech & Realtime Models B2B Applications - San Francisco About the Team OpenAI... ...Team, you will have the opportunity to work with some of the brightest minds in AI.... ...AI, participate in code reviews, share knowledge, and lead by example. Monitor and...Internship$315k
As a Research Engineer or Research Scientist in Applied Finetuning, you will directly train the... ...about the societal impacts of your work Have clear written and verbal communication... ...models Complex shared codebases and RL infrastructure Authoring research papers in machine...Work at officeHome officeVisa sponsorshipRelocation package
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Engineer, RL Infrastructure (Knowledge Work). Be the first to apply!
- research assistant engineering San Francisco, CA
- ai research engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- research engineer San Francisco, CA
- research programmer San Francisco, CA
- deep learning research engineer San Francisco, CA
- research software engineer San Francisco, CA
- senior research engineer San Francisco, CA
- security infrastructure engineer San Francisco, CA
- infrastructure engineer San Francisco, CA

