Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer, RL Data

$320k

Colorwave Inc

Anthropic Rl Data Engineer

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

About The Role

Anthropic's RL Data team builds the systems that produce high-quality reinforcement learning data for Claude: data collection pipelines, human feedback tooling, the execution environments RL tasks run in, and the quality assurance that keeps training data trustworthy at scale. Our goal is to make Claude genuinely great at complex, real-world work — and to point those capabilities at the things that matter most, including AI safety research and beneficial deployments of AI. (To be upfront: this is dual-use work — it advances general capabilities too, though we aim to differentially advance the beneficial ones.)This is a foundational role on a new team: you'll help shape our technical direction and what we build first. The work is hands-on and varied. Some weeks you'll be deep in pipeline or infrastructure engineering; others you'll be tuning prompts until the output is good, or sitting with a research team that depends on your systems and shipping the fixes they need. We're looking for strong engineers who will also do whatever else it takes to make their systems succeed — reading transcripts, supporting users, and wrangling vendors.

Key Responsibilities
  • Own significant parts of our stack end-to-end, from technical architecture through the unglamorous operational work that makes it succeed
  • Build data collection pipelines, read the transcripts they produce, and iterate on prompts, evals, and graders until the output is good
  • Develop and improve QA frameworks to catch reward hacking and ensure environment quality
  • Build interfaces that make collecting human data fast and painless for the people providing it
  • Harden execution environments — sandboxing, snapshotting, tool coverage — so tasks hold up at training scale
  • Embed with the teams and domain experts who use our systems day-to-day: design pipelines and evals with them, support them directly, and ship the improvements they need
  • Work with operations, security, and compliance partners to roll our systems out to new users, and manage technical relationships with external data vendors
Minimum Qualifications
  • Strong software engineering skills and proficiency in at least one modern programming language — we mostly use Python and TypeScript, and care more that you pick new tools up quickly than that you know our exact stack
  • Experience designing, building, and running backend systems or infrastructure
  • Effective use of AI tools in your own day-to-day work
  • Willingness to own problems end-to-end, including the parts that aren't engineering
  • Proactive, open communication: you can be trusted to run a workstream, and to escalate early when something's off
  • Comfort iterating quickly in ambiguous, fast-changing situations
  • Care about the societal impacts of your work
Preferred Qualifications
  • Experience building LLM-powered systems: prompt pipelines, evals, or products with models in the loop
  • Experience with reinforcement learning on LLMs: creating environments, rewards, graders, or training data
  • Time as a forward deployed engineer, founder, or early startup engineer — roles where you owned the outcome, not just the code
  • Experience shipping user-facing products, or internal platforms people love: interviewing users, hunting down friction, measurably improving the experience
  • Experience building data pipelines or integrations that move, transform, and index data from many sources
  • Experience building connectors or integrations with third-party tools and APIs, such as MCP servers
  • Experience with containers, Kubernetes, or simulation infrastructure
  • Experience handling sensitive data or working under tight security controls
  • Experience working with external data vendors
  • Basic familiarity with AI safety or security research
Representative Projects
  • Take QA checks that a model has learned to game, and make them hold up under heavy optimization pressure
  • Build a review flow that lets a busy expert check an RL task in under five minutes
  • Cut the time from 'rough task idea' to 'QA-passed RL task' from days to hours
  • Sit for a week with a team that uses our platform, then ship the fixes that help them most
  • Harden a sandboxed environment so tasks behave correctly across millions of rollouts
  • Onboard a new data vendor, and fix the rough edges they hit

The annual compensation range for this role is listed below.For sales roles, the range provided is the role's On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role.

Annual Salary

$320,000—$485,000 USD

Logistics

Minimum education: Bachelor's degree or an equivalent combination of education, training, and/or experience

Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience

Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position

Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.

Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.

We encourage you to apply even if you do not believe you meet every single qualification.

Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team.

Your safety matters to us.

To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings.

How We're Different

We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.

The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.

Come Work With Us!

Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues.

Guidance on Candidates'
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Software Engineer, RL Data in San Francisco, CA vacancy
  • $200k

     ...AfterQuery is an applied research lab curating data solutions for foundation model development...  ...if they've worked for/interned for any RL environment companies in the past or any...  ...messy results ~ Former founders and early engineers at early stage startups are a plus. We don... 
    Suggested

    AfterQuery

    San Francisco, CA
    1 day ago
  • $320k - $405k

     ...Software Engineer, Research Data Platform San Francisco, CA | New York City, NY About Anthropic Anthropic's mission is to create reliable,...  ...the internal applications researchers rely on to monitor RL runs, explore finetuning datasets, and understand what's happening... 
    Suggested
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    3 days ago
  •  ...Senior Software Engineer, ML Data San Francisco, CA • Hybrid • Reports to Head of Vision & AI Who We Are Voxel is building the future of Computer Vision and Machine Learning for operations, risk, and safety. We use computer vision and AI to enable existing... 
    Suggested
    Work at office
    Flexible hours

    Voxel Labs

    San Francisco, CA
    3 days ago
  •  ...revolutionizing the lending landscape. SoFi is seeking enthusiastic Senior Software Engineers who are ready to lead the development of key advancement to...  ...next generation of our financial services platform. Data Foundations leads the path on building the central platform-... 
    Suggested
    Full time
    Work experience placement
    Remote work

    SoFi

    San Francisco, CA
    1 day ago
  •  ...billion. We work in-person five days a week in our San Francisco, NYC, or London offices. About the Role As a Senior Software Engineer (AI Data & Evaluation) at Mercor, you will be at the core of building the data infrastructure and evaluation systems that power... 
    Suggested
    Work at office
    Relocation package

    Mercor Alabaster

    San Francisco, CA
    1 day ago
  • $160k - $220k

     ...platform turns siloed and disconnected data into operational intelligence — instantly...  ...enterprise and internationally. Team As an engineering team, we believe strongly that empathy...  ...controls. Role We are looking for a software engineer to join our growing team where... 
    Work at office
    Local area

    Peregrine Technologies

    San Francisco, CA
    2 days ago
  • $230k - $385k

     ...understand and reflect human preferences - the Human Data team is at the heart of that effort. The Human Data engineering team creates the systems that enable scalable,...  ...loops. About the Role We're looking for software engineers to join the Human Data team and build... 
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    4 days ago
  • $179.5k - $221.5k

     ...gets done. At Airtable, we're passionate about democratizing software creation — empowering anyone to build powerful, flexible...  ...full apps and deploy AI agents directly into their workflows. Data engineering plays a critical role in this evolution by delivering the insights... 
    Live in
    Remote work
    Flexible hours
    Shift work

    Airtable

    San Francisco, CA
    2 days ago
  • $144k - $216k

     ...Report, Amplitude is the best-in-class solution for product, data, and marketing teams. Learn more at amplitude.com. As an organization...  ...the lifecycle of those connections and credentials. As a Software Engineer II on the Data Warehouse team, you'll help build and scale... 
    Work at office
    Home office
    Flexible hours

    Amplitude

    San Francisco, CA
    5 days ago
  • $180k - $220k

     ...Software Engineer, Data Los Angeles, Palo Alto, San Francisco About HeyGen At HeyGen, our mission is to make visual storytelling accessible to all. Over the last decade, visual content has become the preferred method of information creation, consumption, and retention... 
    Work experience placement

    HeyGen

    San Francisco, CA
    3 days ago
  • $140k - $265k

     ...work across teams by accessing the industry's broadest range of data: enterprise and world, structured and unstructured,...  ...every company. About the Role: We are looking for a Software Engineer to join Glean's Data Foundations team - the group that owns the... 
    Work at office
    Home office
    Flexible hours

    Glean.info

    San Francisco, CA
    2 days ago
  • $293k - $385k

     ...Overview: The Data Acquisition team within the Foundations organization at OpenAI is responsible for all aspects of data...  ...Architecture, and Scaling teams. We are looking for a skilled Software Engineer to join our Data Acquisition team. Responsibilities:... 

    OpenAI

    San Francisco, CA
    2 days ago
  • $144k - $288k

     ...Senior Software Engineer, Data Cambridge, MA USA; San Francisco, CA USA Your Impact at LILA Join us in shaping the future of science! We are seeking Senior Software Engineers with backend experience to join our Data Platform Team (Data), where you'll collaborate... 
    Full time
    Work at office
    Local area
    Flexible hours

    Lila Sciences

    San Francisco, CA
    5 days ago
  • $120k - $160k

     ...Founding Engineer For Airweave's Data And Infrastructure We're looking for a founding engineer to own Airweave's data and infrastructure layer, the systems that make our distributed search and data pipelines scalable, reliable and observable. At Airweave, you'll... 

    Airweave (yc X25)

    San Francisco, CA
    5 days ago
  • $200k - $275k

     ...capacity of buildout, starting with data centers. The Role As one...  ...foundational members of our Engineering team, you will architect and...  ...develop blazingly fast online RL systems at scale What You'll Bring ~7+ years of software development experience ~... 

    Watney Robotics Inc

    San Francisco, CA
    5 days ago
  • $350k

     ...Software Engineer, Data Infrastructure Thinking Machines Lab's mission is to empower humanity through advancing collaborative general intelligence. We're building a future where everyone has access to the knowledge and tools to make AI work for their unique needs and... 
    Local area
    Immediate start
    Visa sponsorship
    Work visa
    Relocation package

    Thinking Machines Lab

    San Francisco, CA
    4 days ago
  • $200k - $400k

     ...Senior Data Infrastructure Engineer Decagon is the leading conversational AI platform empowering every brand to deliver concierge customer experiences. Our technology enables industry-defining enterprises like Avis Budget Group, Block's Cash App and Square, Chime,... 
    Full time
    Work at office
    Local area

    Decagon

    San Francisco, CA
    6 days ago
  • $250k - $380k

     ...those models to life. About the Role We are looking for an engineer to design and implement the dataset infrastructure that powers OpenAI...  ...standardized dataset APIs, including for multimodal (MM) data that cannot fit in memory. Build proactive testing and scale... 

    OpenAI

    San Francisco, CA
    4 days ago
  • $160k - $225k

     ...enterprise scale, our agentic platform synthesizes complex employee data, pinpoints risky behaviors, and deploys highly relevant...  ...infrastructure powering a category-defining product Work closely with engineering, data science, and product teams to operationalize data at... 
    Work experience placement
    Relocation package
    Flexible hours

    Fable

    San Francisco, CA
    5 days ago
  • $90 - $110 per hour

     ...Angelo , Larry Summers , and Jack Dorsey . Position: Software & Data Science Expert Type: Contract Compensation: $90–$...  ...Must-Have ~3+ years of experience in software engineering or data science & analytics . Application Process (Takes... 
    Contract work
    Summer work
    Remote work

    Mercor

    San Francisco, CA
    6 days ago
  •  ...We are on the lookout for extraordinary engineers and scientists to join our team....  ...ensure tight integration between hardware and software. Apply advanced techniques such as curriculum...  ...continuous development of our in-house RL training pipelines and tooling. What Kind... 

    Foundation Robotics

    San Francisco, CA
    1 day ago
  • $180k - $250k

     ...portfolios with confidence by unifying energy data, planning, forecasting, and operations in...  ...of energy buyers, data scientists, and engineers, Verse enables faster, smarter energy...  ...these solutions with highest standards of software-engineering best practices. You will be an... 
    Remote work
    Flexible hours

    Verse

    San Francisco, CA
    25 days ago
  • $149k - $198.5k

     ...Job Description Job Description Mission Summary: We are seeking an experienced full stack engineer to join our new AI Data Application team. This pivotal role will drive the development and execution of initiatives aimed at significantly accelerating our ML dataset... 
    Work at office
    Remote work

    Motional

    San Francisco, CA
    a month ago
  • $230k - $310k

     ...About the role You'll own Gamma's data infrastructure and architecture as we scale...  ...through Gamma. You'll solve the hardest data engineering challenges we face while setting the...  ...you'll bring ~10+ years as a data or software engineer with deep expertise in distributed... 
    Full time
    Work at office
    Work from home

    Gamma

    San Francisco, CA
    5 days ago
  •  ...AI/ML Engineer (RL & Physical Systems) FLUIX is building the AI Operating System for data centers. We deploy autonomous AI that optimizes, predicts, and controls AI factories...  ...meet. Collaborate with controls, software, and field engineering teams to integrate models... 
    Weekend work

    Fluix AI

    San Francisco, CA
    5 days ago
  • $230k - $385k

     ...entire model stack, we integrate cutting-edge hardware and software to explore a broad range of robotic form factors. We...  ...improve peoples' lives. About the Role As a Software Engineer, Distributed Data Systems, you will design and scale the infrastructure that... 
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    5 days ago
  •  ...Job Description Job Description About the Role Join a startup building an agentic data lakehouse platform. As a Senior Software Engineer, Distributed Data Systems, you'll work on a greenfield project to build scalable data infrastructure that transforms enterprise... 

    Clera

    San Francisco, CA
    11 days ago
  • $164k

     ...Senior Software Engineer, Data Platform Chime San Francisco, CA, US About the Role Chime's Data Platform team builds the infrastructure every engineering and analytics team depends on - ingestion, transformation, quality, governance, and self-serve tooling... 
    Full time
    Work at office
    Local area
    Remote work
    Night shift

    Softbank Investment Advisers

    San Francisco, CA
    2 days ago
  • $202.5k - $247.5k

     ...Software Engineer III/Senior, Data Platform ngrok is an all-in-one cloud networking platform that secures, transforms, and routes traffic to services running anywhere. Instead of cobbling together nginx, NLBs, VPNs, model routers, and oodles of other tools, developers... 
    Permanent employment
    Full time
    Live in
    Work at office
    Local area
    Remote work
    Home office
    Flexible hours

    ngrok

    San Francisco, CA
    4 days ago
  • $200k - $236k

     ...Software Engineer, Data Platform Hybrid - SF Bay Area About GlossGenius GlossGenius is the AI-powered system behind the world's most meaningful appointments, helping 100,000+ service businesses earn more revenue and free up time for the work they love. Our agentic... 
    Work at office
    Home office
    Flexible hours
    3 days per week

    GlossGenius

    San Francisco, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer, RL Data. Be the first to apply!