Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Engineer, Environment Scaling

$350k

Anthropic

About Anthropic

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

About the role

The Environment Scaling team is a team of researchers and engineers whose goal is to improve the intelligence of our public models for novel verticals and use cases. The team builds the training environments that fuel RL at scale. This is a unique role that combines executing directly on ML research, data operations, and project management to improve our models. You'll own the end-to-end process of creating RL environments for new capabilities: identifying high-value tasks, designing reward signals, managing vendor relationships, and measuring impact on model performance.
Responsibilities:
  • Improve and execute our fine-tuning strategies for adapting Claude to new domains and tasks
  • Manage technical relationships with external data vendors, including evaluation of data quality and reward design
  • Collaborate with domain experts to design data pipelines and evaluations
  • Explore novel ways of creating RL environments for high value tasks
  • Develop and improve QA frameworks to catch reward hacking and ensure environment quality
  • Partner with other RL research teams and product teams to translate capability goals into training environments and evals
You may be a good fit if you:
  • Have experience with fine-tuning large language models for specific domains or real-world use cases and/or domain expertise in an area where we would like to make our models more useful.
  • Have experience with reinforcement learning, reward design, or training data curation for LLMs
  • Are comfortable managing technical vendor relationships and iterating quickly on feedback
  • Find value in reading through datasets to understand them and spot issues
  • Have strong project management and interpersonal skills
  • Are passionate about making AI more useful and accessible across different industries
  • Are excited about a role that includes a combination of ML research, data operations, and project management
Strong candidates may also:
  • Have experience training production ML systems
  • Be familiar with distributed systems and cloud infrastructure
  • Have domain expertise in an area where we would like to make our models more useful
  • Have experience working with external vendors or technical partners

The annual compensation range for this role is listed below.


For sales roles, the range provided is the role's On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role.

Annual Salary:

$350,000-$850,000 USD

Logistics

Minimum education: Bachelor's degree or an equivalent combination of education, training, and/or experience

Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience

Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position

Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.

Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.

We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team.

Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links-visit anthropic.com/careers directly for confirmed position openings.
How we're different

We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact - advancing our long-term goals of steerable, trustworthy AI - rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.

The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.
Come work with us!

Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.
Vacancy posted 5 hours ago
Similar jobs that could be interesting for youBased on the Research Engineer, Environment Scaling in United States vacancy
  • $315k

     ...as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. Anthropic's ML Performance and Scaling team trains our production pretrained models, work that... 
    Suggested
    Full time
    Work at office
    Visa sponsorship
    Flexible hours
    Weekend work
    Afternoon shift

    Menlo Ventures

    San Francisco, CA
    3 days ago
  • An AI research lab is seeking a Research Engineer responsible for bridging advanced research with production-scale development of reinforcement learning environments. The ideal candidate will have a strong background in machine learning, specifically in reinforcement learning... 
    Suggested

    Bespoke Labs

    Mountain View, CA
    3 days ago
  • A forward-thinking AI company in San Francisco is seeking a Research Engineer for their ML Performance and Scaling team. The role involves optimizing production pretraining pipelines and solving complex issues, requiring a mix of research and engineering skills. Candidates... 
    Suggested

    Menlo Ventures

    San Francisco, CA
    3 days ago
  • A cutting-edge web technology company is seeking a Research Engineer to enhance its core research product. The role involves improving models for web-scale indexing and establishing training strategies. Candidates should possess deep intuitions for modern model systems... 
    Suggested

    Parallel Web Systems

    Palo Alto, CA
    13 hours ago
  • The Institute of Foundation Models in Sunnyvale, California is seeking a talented researcher to join their Diffusion LLM Team. You will design, build, and release industrial-scale Diffusion Large Language Models while working with leading experts in the field. This role... 
    Suggested

    Institute of Foundation Models

    Sunnyvale, CA
    2 days ago
  • $200k

     ...technology company in San Francisco is seeking a Software Engineer for their RL Research & Environments team. The role focuses on designing and improving data...  ...engineering background and experience in large-scale data or ML systems. Compensation ranges from $200K to... 

    SupportFinity™

    San Francisco, CA
    13 hours ago
  • $184k - $356.5k

    NVIDIA is recruiting top research engineers for their Autonomous Vehicles Research team in Santa Clara. This role requires strong expertise in...  ...learning. Responsibilities include developing large-scale training frameworks and optimizing GPU usage. Candidates should... 

    NVIDIA

    Santa Clara, CA
    2 days ago
  •  ...supported the Regular Toilet is looking for a Staff Software Engineer specializing in AI Research Infrastructure. In this role, you will develop and...  ...The position involves design and implementation of large-scale infrastructure to support complex research experiments.... 

    I did my part and supported the Regular Toilet

    New York, NY
    1 day ago
  • About the team The Frontier Evals & Environments team builds north star model environments to drive progress towards safe AGI/ASI....  ..., this is the team for you. About you We seek exceptional research engineers that can push the boundaries of our frontier models. Specifically... 

    OpenAI

    Los Angeles, CA
    4 days ago
  • $130k - $150k

    Sorion in Houston, TX, is seeking a Senior Research Engineer to design and run mechanochemical experiments. The role involves developing test infrastructures and fostering a collaborative, safety-first lab culture. Ideal candidates should have a PhD in Materials Science... 

    Sorion

    Houston, TX
    2 days ago
  • Kindredventures in San Francisco seeks a Machine Learning Engineer to work with the full ML stack, implementing advanced model architectures and building extensive data pipelines for large datasets. The ideal candidate will have expertise in machine learning principles... 

    Kindredventures

    San Francisco, CA
    1 day ago
  •  ...company whose capital, scientific research capability and engineering expertise are solely dedicated to property...  ...experiments at multiple battery scales – cell, module, unit, and high‑...  ...collaborate in an interdisciplinary research environment. The final salary offer will... 
    Full time
    Flexible hours

    FM

    Norwood, MA
    13 hours ago
  • $125k - $160k

     ...ERG is a research and consulting firm that provides a wide range...  ...nationally recognized skills in engineering, science, economics, public...  ...with a vibrant and flexible environment in which to develop their...  ...Direct medium- to large-scale complex projects to ensure internal... 
    Hourly pay
    Part time
    For subcontractor
    Remote work
    Flexible hours

    ERG Inc

    United States
    4 days ago
  • $59.38k - $92.35k

     ...Early-Career Environmental Engineer or Scientist Primary Location...  ...problems involving our environment, natural resources, and civil...  ...coordination and management of large-scale sediment remediation projects...  ...and Qualifications Prior research experience in contaminated... 
    Full time
    For contractors
    For subcontractor
    Internship
    Work at office
    Remote work
    Night shift

    Geosyntec Consultants

    Pennington, NJ
    4 days ago
  • $150k - $250k

     ...applications and next steps. Our partner is looking for a Research Engineer based in the United States. This role sits at the...  ...ideas into deployable models that can operate at scale in real-world environments. The position involves close collaboration with software... 
    Remote job
    Full time
    Work at office
    Home office

    jobgether

    United States
    4 days ago
  • A leading research organization is seeking a Research Engineer in San Antonio, Texas, to develop testing solutions that ensure product performance in challenging environments. The ideal candidate will have a solid engineering background, experience in project management... 

    Southwest Research Institute

    San Antonio, TX
    1 day ago
  • Lubrizol Corporation seeks an Improvement Engineer in Louisville, responsible for enhancing processes and technology within their operations...  ...salary and various benefits, including 401(k) matching and a flexible work environment. #J-18808-Ljbffr Lubrizol Corporation
    Flexible hours

    Lubrizol Corporation

    Louisville, KY
    4 days ago
  • RESEARCH ENGINEER - SR. RESEARCH ENGINEER - Environmental and Mechanical Test Engineer The Structural Dynamics and Product Assurance team...  ...testing conditions simulate the most adverse and extreme environments these systems might encounter during storage, transportation... 

    Southwest Research Institute

    San Antonio, TX
    1 day ago
  • A leading AI research organization in San Francisco is looking for a Senior Research Engineer. In this role, you will own research projects, design experiments on large language models, and contribute to publications. Ideal candidates hold a Master’s or Ph.D. in ML, have... 

    Center for AI Safety

    San Francisco, CA
    3 days ago
  •  ...Research Engineer San Francisco, CA $750k+ Total Comp. This is a rare opportunity to...  ...researchers across industry and academia. The environment is fast moving, highly technical, and...  ...by significant funding, they are scaling quickly with the ambition to become the... 
    Work at office

    Harnham

    San Mateo, CA
    1 day ago
  •  ...Research Engineer, Foundation Models About the Opportunity We are seeking a Research...  ...advance the next generation of large-scale AI systems. This role sits at the intersection...  ...models across large distributed GPU environments Build and manage large-scale data... 
    Visa sponsorship
    Relocation package
    Flexible hours

    Acceler8 Talent

    Bay County, FL
    1 day ago
  •  ...company. We are currently looking for a Research Scientist / Research Engineer in United States. This role sits...  ...live user and product signals. The environment is highly research-driven but...  ...methods ~ Experience working in large-scale industrial or research lab... 
    Remote job
    Full time
    Flexible hours

    jobgether

    United States
    6 days ago
  •  ...Research Engineer – Applied Machine Learning Novateur stands for Innovation. We value creativity...  ...of computer vision, AI, and large-scale learning. We are seeking Research Engineers...  ...and life insurance. We offer a work environment which fosters individual thinking along... 
    Permanent employment
    Temporary work

    Novateur Research Solutions

    Ashburn, VA
    2 days ago
  •  ...a quickly growing group of committed researchers, engineers, policy experts, and business leaders...  ...decisions that shape how Anthropic does RL at scale You may be a good fit if you...  ...of research and infra in a fast-moving environment Deadline to apply: None. Applications... 
    Work at office
    Remote work
    Visa sponsorship
    Flexible hours

    Anthropic

    United States
    5 hours ago
  • $89.3k

     ...specific area of scientific research or other function, with its own...  ...Sciences, and Energy and Environment. In addition, we have an Environmental...  ...BS&DG is seeking a Research Engineer II - AI for Building Energy...  ...checking, permitting, large-scale performance data analysis,... 
    For contractors
    Work at office
    Local area
    Relocation package
    Flexible hours

    Pacific Northwest National Laboratory

    Richland, WA
    4 days ago
  • $158k - $269k

     ...Research Engineer In Calibration Waabi, founded by AI visionary Raquel Urtasun, is the leader in Physical AI. With a world-class team, we...  ...driving on a diverse fleet of vehicles. Coordinate large-scale deployment of the calibration of an entire fleet of autonomous... 
    Full time
    Work at office
    Remote work
    Work from home
    Flexible hours

    Waabi

    United States
    13 hours ago
  • $100k - $130k

     ...in the delivery of massive-scale web data to organizations developing...  ...Additionally, the team has engineered sophisticated pipelines for...  ...creation for frontier research labs. The organization operates...  ...unstable or adversarial environments. Preferred... 
    Full time
    Remote work

    MLabs

    United States
    3 days ago
  • $160k - $240k

     ...Research Engineer - Evals You'll build the evaluation systems that tell us whether Firecrawl actually works. That sounds simple. It isn...  ...and validate automated judges that score extraction quality at scale, know the failure modes of LLM-based evaluation, and build... 
    Full time
    Temporary work
    Remote work

    Firecrawl

    United States
    1 day ago
  •  ...talent. They will be on the forefront of applied research, engineering, infrastructure and deployment at scale. They will continue to scale their training to...  ...Model ops: Fine-tuning/RL; designing agent environments; managing inference. Enterprise delivery:... 
    Remote work
    Home office
    Flexible hours

    Poolside

    United States
    13 hours ago
  • $150k - $225k

     ...Research Crawling Engineer The employer is a decentralized, Solana-based web-scraping network that...  ...high-quality public web data at global scale. You will join a company at the...  ...Adapting to constantly changing web environments Balancing throughput, coverage, and... 
    Permanent employment
    Contract work
    Remote work

    Startup Talents

    United States
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Engineer, Environment Scaling. Be the first to apply!