Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Reinforcement Learning Engineer

$188k - $275k

CoreWeave

CoreWeave, the AI Hyperscaler™, acquired Weights & Biases to create the most powerful end-to-end platform to develop, deploy, and iterate AI faster. Since 2017, CoreWeave has operated a growing footprint of data centers covering every region of the US and across Europe, and was ranked as one of the TIME100 most influential companies of 2024. By bringing together CoreWeave's industry-leading cloud infrastructure with the best-in-class tools AI practitioners know and love from Weights & Biases, we're setting a new standard for how AI is built, trained, and scaled.

The integration of our teams and technologies is accelerating our shared mission: to empower developers with the tools and infrastructure they need to push the boundaries of what AI can do. From experiment tracking and model optimization to high-performance training clusters, agent building, and inference at scale, we're combining forces to serve the full AI lifecycle - all in one seamless platform.

Weights & Biases has long been trusted by over 1,500 organizations - including AstraZeneca, Canva, Cohere, OpenAI, Meta, Snowflake, Square,Toyota, and Wayve - to build better models, AI agents and applications. Now, as part of CoreWeave, that impact is amplified across a broader ecosystem of AI innovators, researchers, and enterprises.

As we unite under one vision, we're looking for bold thinkers and agile builders who are excited to shape the future of AI alongside us. If you're passionate about solving complex problems at the intersection of software, hardware, and AI, there's never been a more exciting time to join our team.

Our Team

The OpenPipe team at CoreWeave is building tools to help agents learn from experience . This is a critical step to make agents reliable enough to perform long tasks autonomously, in the same way human employees are. We're systematically identifying and solving the major bottlenecks between today's tech and those future self-improving agents. So far, we've:
  • Released ART, the easiest library for getting started with RL.
  • Developed RULER, a general-purpose reward function that works across many diverse tasks.
  • Built Serverless RL, an elegant API that gives RL practitioners full control over their data, environment and reward function while letting them outsource the headaches of managing GPU infrastructure.
These releases have a theme: we're systematically tackling each major roadblock to successfully training self-improving agents. Several serious challenges remain. Building simulated environments often requires substantial human labor, and existing training methods are not data efficient enough. We're laser-focused on solving these problems and making self-improvement a reality for agent developers.

In startup terms, this is a classic hard-tech bet. Our roadmap involves substantial technical risk ; there are still major technical problems we're facing without a proven solution. However, there is very little market risk . We've worked closely with the teams building agents at many of the top AI-native startups as well as large enterprises. If we can build this, everyone will want it . A self-improving agent that learns from experience the way a human employee would could quickly capture a large fraction of the total inference market, which is worth tens of billions of dollars today and will be worth hundreds of billions in a few years.

About the Role

You have trained LLMs to be SOTA on specific tasks. You have opinions on whether sequence-level or token-level importance ratios are more effective. You probably shared the ScaleRL paper in your group chats, and kicked off a few ablations after you read it.

This is an applied research role. You will be expected to generate and investigate research ideas towards solving the remaining obstacles to continuous learning in production. You will work with the broader OpenPipe team to validate these research directions across real customer tasks. We are very GPU rich and are ready to direct an enormous amount of compute at this effort.

Beyond your role's specific qualifications, we're looking for strong engineers with great taste. The most important qualification by far is that you learn fast and can ship. This role will inevitably involve a lot of learning on the job; we're building this airplane as we fly it. Engineers on our team touch everything from CUDA kernels to high-performance LLM tracing dashboards, and you will have an opportunity to touch many parts of this stack.

Although we operate as part of a larger company, the OpenPipe team is small, has a large degree of autonomy and drives our own roadmap and priorities. This is an excellent role for someone looking to found their own company in the future.
Required Qualifications
  • Bachelor's or Master's degree in Computer Science, Machine Learning, PhD in Robotics, or a related field
  • 5+ years of experience in machine learning, with a strong focus on reinforcement learning or PhD + 2 years experience
  • Strong programming skills in Python and experience with ML frameworks (PyTorch, TensorFlow, or JAX)
  • Strong understanding of RL fundamentals: MDPs, policy optimization, value functions, exploration/exploitation trade-offs
  • Experience building and deploying ML models in production environments
  • Strong problem-solving skills and ability to work in ambiguous, research-driven environments
Preferred Qualifications
  • Publications in top-tier ML/AI conferences (NeurIPS, ICML, ICLR)
  • Familiarity with distributed training, GPU/TPU acceleration, and large-scale data pipelines
  • Knowledge of MLOps practices, CI/CD for ML, and model monitoring
  • Experience with cloud platforms (AWS, GCP, Azure)
  • Experience leading projects or small teams
Our Stack

We strive to use the best tool for the job when building and deploying our production services. Sometimes that means writing our own custom code, and often it means leaning on the work of others. As part of building Serverless RL, we depend on the following libraries and frameworks (among many others):
  • Kubernetes
  • Megatron
  • Unsloth
  • Temporal
  • Postgres
  • FastAPI
Why CoreWeave?

We work hard, have fun, and move fast! We're in an exciting stage of hyper-growth that you will not want to miss out on. We're not afraid of a little chaos, and we're constantly learning. Our team cares deeply about how we build our product and how we work together, which is represented through our core values:
  • Be Curious at Your Core
  • Act Like an Owner
  • Empower Employees
  • Deliver Best-in-Class Client Experiences
  • Achieve More Together
We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and provides the opportunity to develop innovative solutions to complex problems. As we get set for takeoff, the growth opportunities within the organization are constantly expanding. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Come join us!

The base salary range for this role is $188,000 to $275,000. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility).


What We Offer

The range we've posted represents the typical compensation range for this role. To determine actual compensation, we review the market rate for each candidate which can include a variety of factors. These include qualifications, experience, interview performance, and location.

In addition to a competitive salary, we offer a variety of benefits to support your needs. The benefits below reflect our US-based offerings; for roles in other locations, benefits vary and are shared during the hiring process. These include:
  • Medical, dental, and vision insurance - 100% paid for by CoreWeave
  • Company-paid Life Insurance
  • Voluntary supplemental life insurance
  • Short and long-term disability insurance
  • Flexible Spending Account
  • Health Savings Account
  • Tuition Reimbursement
  • Ability to Participate in Employee Stock Purchase Program (ESPP)
  • Mental Wellness Benefits through Spring Health
  • Family-Forming support provided by Carrot
  • Paid Parental Leave
  • Flexible, full-service childcare support with Kinside
  • 401(k) with a generous employer match
  • Flexible PTO
  • Catered lunch each day in our office and data center locations
  • A casual work environment
  • A work culture focused on innovative disruption

California Applicants

California Consumer Privacy Act


Equal Opportunity & Accommodations

CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information.

As part of this commitment and consistent with the Americans with Disabilities Act (ADA), CoreWeave will ensure that qualified applicants and candidates with disabilities are provided reasonable accommodations for the hiring process, unless such accommodation would cause an undue hardship. If reasonable accommodation is needed, please contact: View email address on click.appcast.io.


Export Control Compliance

This position requires access to export controlled information. To conform to U.S. Government export regulations applicable to that information, applicant must either be (A) a U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii) refugee under 8 U.S.C. § 1157, or (iv) asylee under 8 U.S.C. § 1158, (B) eligible to access the export controlled information without a required export authorization, or (C) eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency. CoreWeave may, for legitimate business reasons, decline to pursue any export licensing process.
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Reinforcement Learning Engineer in Bellevue, WA vacancy
  •  ...Role Number: 200651094-3337 Summary We're seeking research engineers to build infrastructure for breakthrough innovations in AI agents, reinforcement learning, and simulation environments. You will design and implement high-quality data pipelines, simulation systems... 
    Suggested

    Apple

    Seattle, WA
    3 days ago
  •  ...growing group of committed researchers, engineers, policy experts, and business leaders...  ...excited to work at the frontier of machine learning, implementing and improving advanced...  ...steerable AI. As an ML Systems Engineer on our Reinforcement Learning Engineering team, you'll be... 
    Suggested
    Work at office
    Visa sponsorship
    Flexible hours

    Menlo Ventures

    Seattle, WA
    4 days ago
  • $171.6k - $302.2k

     ...Senior Machine Learning Engineer, Apple Services Engineering Wonder how Apple's Media Products show relevant search results and recommendations across Apple's media offerings - including App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books? Come join us... 
    Suggested
    Work experience placement
    Worldwide
    Relocation
    Flexible hours

    Apple

    Seattle, WA
    1 day ago
  • $127.1k - $171.9k

     ...team of software, hardware, and network engineers, supply chain specialists, security experts...  ...in Data Center Engineering. • Learn and understand the AWS data center lifecycle...  ...conferences. Amazon's culture of inclusion is reinforced within our 16 Leadership Principles,... 
    Suggested
    Flexible hours

    Amazon

    Bellevue, WA
    1 day ago
  • $90k - $133k

     ...Job Description Job Title: Project Civil Engineer Job Description The Project Civil...  ...engineers and designers, fostering a culture of learning, collaboration, and technical excellence...  ...participate in company events that reinforce a strong, supportive culture focused on... 
    Suggested
    Permanent employment
    Full time
    Temporary work
    Work at office
    Local area
    Remote work

    Actalent

    Bellevue, WA
    10 days ago
  • $60 - $70 per hour

     ...Overview: We are seeking a Machine Learning Engineer to join a high-impact team focused on advancing LLM evaluation, NLP, and AI-driven automation. This role centers on designing scalable evaluation frameworks, optimizing prompt strategies, and building systems that... 
    Contract work
    Temporary work
    Remote work
    3 days per week

    TEKsystems

    Seattle, WA
    5 days ago
  • $143.7k - $194.4k

     ...The Conversational AI Modeling and Learning (CAMEL) team is looking for a passionate, talented, and inventive SDE/MLE to play pivotal...  ...with top minds pushing boundaries in deep learning, reinforcement learning, and more. Gain valuable experience and accelerate your... 
    Internship
    Flexible hours

    Amazon

    Bellevue, WA
    2 days ago
  • $95k - $120k

     ...Summary The Audio-Visual Commissioning Engineer plays a critical role in ensuring the seamless...  ...rack layouts, wall and ceiling reinforcement, floor plans, power/cooling required. •...  ...AutoCAD experience is a plus. • Ability to learn and effectively utilize new design software... 
    Local area
    Remote work

    Prime Electric

    Bellevue, WA
    1 day ago
  • $115.8k - $160k

     ...experienced ServiceNow Application Development Engineer to join our team. This role will be responsible...  ...benefit offerings, and host annual and ongoing learning experiences. Amazon's culture of inclusion is reinforced within our 16 Leadership Principles, which remind... 
    Flexible hours

    Amazon

    Bellevue, WA
    1 day ago
  • $184k - $287.5k

     ...deployments as well as establishing a data-driven approach to hardware design and system software development. The role of a Deep Learning Systems Engineer would be to analyze the performance and power consumption of deep learning applications on datacenter-class hardware and... 

    NVIDIA

    Redmond, WA
    2 days ago
  • $152k - $241.5k

    We are looking for a Deep Learning and Computer Vision engineer for our Autonomous Vehicles team. The role involves applying state‑of‑the‑art techniques to build ground truth for autonomous vehicles, a critical aspect of our next‑generation products. You will have the opportunity... 

    NVIDIA

    Seattle, WA
    1 day ago
  • $90 - $109 per hour

     ...Computer Vision / Machine Learning Engineer Senior Software Engineer (Machine Learning) – Long Term Project – Remote (US/PT) Title: Senior Software Engineer (Machine Learning) Location: Remote (US/PT) Duration: Long Term Project – 12+ months Compensation: $90.00 – $109... 
    Contract work
    Work experience placement
    Local area
    Remote work
    Flexible hours

    INSPYR Solutions

    Seattle, WA
    6 days ago
  • $106.4k - $177.3k

     ...exciting new opportunity to join us as a Senior Test Automation Engineer! About the role As a Senior Test Automation Engineer,...  ...simultaneously put you at ease. Failure is seen as integral to the learning process so there is less reason to be fearful of it." - Kerry S... 
    Full time
    Immediate start
    Remote work
    Work from home
    Flexible hours

    Symetra

    Bellevue, WA
    7 days ago
  • $173k - $244k

     ...The 3D Simulation group at Zoox is looking for 3D Machine Learning engineers to simulate sensors (cameras, lidar, radar), combining GenAI/ML and modern 3D graphics techniques to close the gap between simulation and reality. You will have access to the best sensor data... 
    Temporary work
    Remote work
    Relocation package

    Zoox

    Seattle, WA
    15 days ago
  • $320k - $405k

     ...Machine Learning Systems Engineer, Research Tools San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for... 
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    Seattle, WA
    5 days ago
  •  ...position is intended for highly skilled, experienced Test Automation Engineers with advanced knowledge of automated testing frameworks, tools,...  ...process, please send a request to ****@*****.*** learn more about how we collect, keep, and process your private... 
    Immediate start

    Insight Global

    Bellevue, WA
    3 days ago
  • $75k - $85k

     ...Consulting is a design-led, software development and hardware engineering company, offering end-to-end digital services to help companies...  ...-world problems for people by leveraging robotics, machine learning, and computer vision, among other technologies, with an eye toward... 
    Internship
    Live in
    Work at office

    Fresh Consulting

    Bellevue, WA
    4 days ago
  • $164k - $313.3k

     ...The Opportunity Photoshop ART is seeking a Senior Machine Learning (ML) Systems & Efficiency Engineer to join our R&D team focused on delivering practical, production-ready improvements in inference performance, latency, and cost efficiency across image editing applications... 
    Temporary work
    Local area
    Worldwide

    Adobe

    Seattle, WA
    4 days ago
  • $171.6k - $302.2k

     ...Staff Machine Learning Engineer, Search & Knowledge Platform Apple is where individual imaginations come together, committing to the...  ...needed for AI powered experiences such as fine-tuning and reinforcement learning. This involves pushing the boundaries on document... 
    Work experience placement
    Relocation

    Apple

    Seattle, WA
    5 days ago
  • $75k - $100k

     ...is looking for an enthusiastic junior System Administrator/Test Engineer to join our team on a contract basis. The ideal candidate will...  ...sandbox and lab environments ~ Ability to conduct research to learn and understand new technologies ~ Additional experience in... 
    Full time
    Contract work
    Remote work
    Flexible hours

    Prowess Consulting

    Bellevue, WA
    2 days ago
  • $117.3k - $160k

     ...The Industrialization Deployment Engineer for LEO will support the Production Operations for Manufacturing Systems Engineering. They will...  ...coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at USA, WA, BELLEVUE - 117,300.00 -... 
    Permanent employment
    Flexible hours

    Amazon

    Bellevue, WA
    5 days ago
  • $132.1k - $178.8k

     ...millions of customers get their orders. If the notion of leading engineering projects amongst the world's brightest talent to full potential...  ..., and engineering processes. You will be challenged, you will learn, and you will work on high-impact, high-visibility programs.... 
    Work experience placement
    Worldwide
    Flexible hours

    Amazon

    Bellevue, WA
    3 days ago
  • $80.2k - $88.5k

    Job Summary Langan provides expert land development engineering and environmental consulting services for major developers, renewable energy...  ...and other exciting land development projects in a continuous learning environment. Responsibilities Assist with the planning,... 
    Full time
    Temporary work
    Internship
    Work at office
    Worldwide
    Flexible hours

    Langan Engineering & Environmental Services

    Bellevue, WA
    4 days ago
  •  ...difference in our ability to change the world for the better. Read further to learn how you could help make great things possible not only in your community, but around the world. Building Engineering Services Group We believe building engineering is more than systems and... 
    Full time
    Contract work
    Work at office

    Fashion Institute of Design & Merchandising

    Bellevue, WA
    11 hours ago
  • Join to apply for the Engineer- Construction - III role at Apex Systems 2 days ago Be among the first 25 applicants Join to apply for the...  ...a team that values innovation, collaboration, and continuous learning. We offer quality career resources, training, certifications, development... 
    16 hours
    Contract work
    Local area

    Apex Systems

    Bellevue, WA
    1 day ago
  • $139k - $204k

     ...Systems Engineer, People Systems CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform...  ...became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at CoreWeave is seeking a highly skilled and motivated... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Bellevue, WA
    4 days ago
  • $91.48k

     ...general direction, creates and signs off on basic to highly complex engineering designs involving site layout development, site grading and...  ...the best people in the industry, supporting their efforts to learn and grow. We strive to create a challenging and progressive work... 
    Full time
    Work experience placement
    H1b
    Local area

    CDM Smith

    Bellevue, WA
    2 days ago
  •  ...Sesame Systems Engineer Role Sesame believes in a future where computers are lifelike - with the ability to see, hear, and collaborate...  ...coupling and sensor response. Experience with Machine Learning for signal processing. Direct experience defining system architecture... 
    Full time
    Contract work
    Flexible hours

    SESAME

    Bellevue, WA
    3 days ago
  • $80.2k - $88.5k

    Job Summary Langan is seeking a Site/Civil Engineer to join its collaborative team in Bellevue, WA . This individual will serve a key function...  ..., and other exciting land development projects in a continuous learning environment. Job Responsibilities Assist with the planning,... 
    Hourly pay
    Full time
    Temporary work
    Work experience placement
    Internship
    Local area
    Flexible hours

    Langan International, LLC

    Bellevue, WA
    1 day ago
  • $140k - $220k

     ...LaserWeeder™ leverages advanced robotics, computer vision, AI/deep learning, and lasers to eliminate weeds with sub-millimeter accuracy-all...  ...YouTube | X | Instagram | LinkedIn | News Deep Learning Engineer As a Deep Learning Engineer at Carbon Robotics, you will contribute... 
    Full time
    For contractors
    Work at office
    Worldwide
    Flexible hours

    Carbon Robotics

    Seattle, WA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Reinforcement Learning Engineer. Be the first to apply!