Research Engineer, Environment Scaling
$350kAnthropic
About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role The Environment Scaling team is a team of researchers and engineers whose goal is to improve the intelligence of our public models for novel verticals and use cases. The team builds the training environments that fuel RL at scale. This is a unique role that combines executing directly on ML research, data operations, and project management to improve our models. You'll own the end-to-end process of creating RL environments for new capabilities: identifying high-value tasks, designing reward signals, managing vendor relationships, and measuring impact on model performance.
Responsibilities:
For sales roles, the range provided is the role's On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: $350,000-$850,000 USD Logistics Minimum education: Bachelor's degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links-visit anthropic.com/careers directly for confirmed position openings.
How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact - advancing our long-term goals of steerable, trustworthy AI - rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.
Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.
Responsibilities:
- Improve and execute our fine-tuning strategies for adapting Claude to new domains and tasks
- Manage technical relationships with external data vendors, including evaluation of data quality and reward design
- Collaborate with domain experts to design data pipelines and evaluations
- Explore novel ways of creating RL environments for high value tasks
- Develop and improve QA frameworks to catch reward hacking and ensure environment quality
- Partner with other RL research teams and product teams to translate capability goals into training environments and evals
- Have experience with fine-tuning large language models for specific domains or real-world use cases and/or domain expertise in an area where we would like to make our models more useful.
- Have experience with reinforcement learning, reward design, or training data curation for LLMs
- Are comfortable managing technical vendor relationships and iterating quickly on feedback
- Find value in reading through datasets to understand them and spot issues
- Have strong project management and interpersonal skills
- Are passionate about making AI more useful and accessible across different industries
- Are excited about a role that includes a combination of ML research, data operations, and project management
- Have experience training production ML systems
- Be familiar with distributed systems and cloud infrastructure
- Have domain expertise in an area where we would like to make our models more useful
- Have experience working with external vendors or technical partners
For sales roles, the range provided is the role's On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: $350,000-$850,000 USD Logistics Minimum education: Bachelor's degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links-visit anthropic.com/careers directly for confirmed position openings.
How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact - advancing our long-term goals of steerable, trustworthy AI - rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.
Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.
Vacancy posted 5 hours ago
Similar jobs that could be interesting for youBased on the Research Engineer, Environment Scaling in United States vacancy
$315k
...as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. Anthropic's ML Performance and Scaling team trains our production pretrained models, work that...SuggestedFull timeWork at officeVisa sponsorshipFlexible hoursWeekend workAfternoon shift- An AI research lab is seeking a Research Engineer responsible for bridging advanced research with production-scale development of reinforcement learning environments. The ideal candidate will have a strong background in machine learning, specifically in reinforcement learning...Suggested
- A forward-thinking AI company in San Francisco is seeking a Research Engineer for their ML Performance and Scaling team. The role involves optimizing production pretraining pipelines and solving complex issues, requiring a mix of research and engineering skills. Candidates...Suggested
- A cutting-edge web technology company is seeking a Research Engineer to enhance its core research product. The role involves improving models for web-scale indexing and establishing training strategies. Candidates should possess deep intuitions for modern model systems...Suggested
- The Institute of Foundation Models in Sunnyvale, California is seeking a talented researcher to join their Diffusion LLM Team. You will design, build, and release industrial-scale Diffusion Large Language Models while working with leading experts in the field. This role...Suggested
$200k
...technology company in San Francisco is seeking a Software Engineer for their RL Research & Environments team. The role focuses on designing and improving data... ...engineering background and experience in large-scale data or ML systems. Compensation ranges from $200K to...$184k - $356.5k
NVIDIA is recruiting top research engineers for their Autonomous Vehicles Research team in Santa Clara. This role requires strong expertise in... ...learning. Responsibilities include developing large-scale training frameworks and optimizing GPU usage. Candidates should...- ...supported the Regular Toilet is looking for a Staff Software Engineer specializing in AI Research Infrastructure. In this role, you will develop and... ...The position involves design and implementation of large-scale infrastructure to support complex research experiments....
- About the team The Frontier Evals & Environments team builds north star model environments to drive progress towards safe AGI/ASI.... ..., this is the team for you. About you We seek exceptional research engineers that can push the boundaries of our frontier models. Specifically...
$130k - $150k
Sorion in Houston, TX, is seeking a Senior Research Engineer to design and run mechanochemical experiments. The role involves developing test infrastructures and fostering a collaborative, safety-first lab culture. Ideal candidates should have a PhD in Materials Science...- Kindredventures in San Francisco seeks a Machine Learning Engineer to work with the full ML stack, implementing advanced model architectures and building extensive data pipelines for large datasets. The ideal candidate will have expertise in machine learning principles...
- ...company whose capital, scientific research capability and engineering expertise are solely dedicated to property... ...experiments at multiple battery scales – cell, module, unit, and high‑... ...collaborate in an interdisciplinary research environment. The final salary offer will...Full timeFlexible hours
$125k - $160k
...ERG is a research and consulting firm that provides a wide range... ...nationally recognized skills in engineering, science, economics, public... ...with a vibrant and flexible environment in which to develop their... ...Direct medium- to large-scale complex projects to ensure internal...Hourly payPart timeFor subcontractorRemote workFlexible hours$59.38k - $92.35k
...Early-Career Environmental Engineer or Scientist Primary Location... ...problems involving our environment, natural resources, and civil... ...coordination and management of large-scale sediment remediation projects... ...and Qualifications Prior research experience in contaminated...Full timeFor contractorsFor subcontractorInternshipWork at officeRemote workNight shift$150k - $250k
...applications and next steps. Our partner is looking for a Research Engineer based in the United States. This role sits at the... ...ideas into deployable models that can operate at scale in real-world environments. The position involves close collaboration with software...Remote jobFull timeWork at officeHome office- A leading research organization is seeking a Research Engineer in San Antonio, Texas, to develop testing solutions that ensure product performance in challenging environments. The ideal candidate will have a solid engineering background, experience in project management...
- Lubrizol Corporation seeks an Improvement Engineer in Louisville, responsible for enhancing processes and technology within their operations... ...salary and various benefits, including 401(k) matching and a flexible work environment. #J-18808-Ljbffr Lubrizol CorporationFlexible hours
- RESEARCH ENGINEER - SR. RESEARCH ENGINEER - Environmental and Mechanical Test Engineer The Structural Dynamics and Product Assurance team... ...testing conditions simulate the most adverse and extreme environments these systems might encounter during storage, transportation...
- A leading AI research organization in San Francisco is looking for a Senior Research Engineer. In this role, you will own research projects, design experiments on large language models, and contribute to publications. Ideal candidates hold a Master’s or Ph.D. in ML, have...
- ...Research Engineer San Francisco, CA $750k+ Total Comp. This is a rare opportunity to... ...researchers across industry and academia. The environment is fast moving, highly technical, and... ...by significant funding, they are scaling quickly with the ambition to become the...Work at office
- ...Research Engineer, Foundation Models About the Opportunity We are seeking a Research... ...advance the next generation of large-scale AI systems. This role sits at the intersection... ...models across large distributed GPU environments Build and manage large-scale data...Visa sponsorshipRelocation packageFlexible hours
- ...company. We are currently looking for a Research Scientist / Research Engineer in United States. This role sits... ...live user and product signals. The environment is highly research-driven but... ...methods ~ Experience working in large-scale industrial or research lab...Remote jobFull timeFlexible hours
- ...Research Engineer – Applied Machine Learning Novateur stands for Innovation. We value creativity... ...of computer vision, AI, and large-scale learning. We are seeking Research Engineers... ...and life insurance. We offer a work environment which fosters individual thinking along...Permanent employmentTemporary work
- ...a quickly growing group of committed researchers, engineers, policy experts, and business leaders... ...decisions that shape how Anthropic does RL at scale You may be a good fit if you... ...of research and infra in a fast-moving environment Deadline to apply: None. Applications...Work at officeRemote workVisa sponsorshipFlexible hours
$89.3k
...specific area of scientific research or other function, with its own... ...Sciences, and Energy and Environment. In addition, we have an Environmental... ...BS&DG is seeking a Research Engineer II - AI for Building Energy... ...checking, permitting, large-scale performance data analysis,...For contractorsWork at officeLocal areaRelocation packageFlexible hours$158k - $269k
...Research Engineer In Calibration Waabi, founded by AI visionary Raquel Urtasun, is the leader in Physical AI. With a world-class team, we... ...driving on a diverse fleet of vehicles. Coordinate large-scale deployment of the calibration of an entire fleet of autonomous...Full timeWork at officeRemote workWork from homeFlexible hours$100k - $130k
...in the delivery of massive-scale web data to organizations developing... ...Additionally, the team has engineered sophisticated pipelines for... ...creation for frontier research labs. The organization operates... ...unstable or adversarial environments. Preferred...Full timeRemote work$160k - $240k
...Research Engineer - Evals You'll build the evaluation systems that tell us whether Firecrawl actually works. That sounds simple. It isn... ...and validate automated judges that score extraction quality at scale, know the failure modes of LLM-based evaluation, and build...Full timeTemporary workRemote work- ...talent. They will be on the forefront of applied research, engineering, infrastructure and deployment at scale. They will continue to scale their training to... ...Model ops: Fine-tuning/RL; designing agent environments; managing inference. Enterprise delivery:...Remote workHome officeFlexible hours
$150k - $225k
...Research Crawling Engineer The employer is a decentralized, Solana-based web-scraping network that... ...high-quality public web data at global scale. You will join a company at the... ...Adapting to constantly changing web environments Balancing throughput, coverage, and...Permanent employmentContract workRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Engineer, Environment Scaling. Be the first to apply!
Related searches
- deep learning research engineer United States
- engineering business analyst United States
- junior research engineer United States
- research software engineer United States
- cyber research engineer United States
- robotics research engineer United States
- research programmer United States
- senior research engineer United States
- engineering analyst United States
- research assistant engineering United States




