Research Engineer - Reinforcement Learning
Prime-Intellect
Building Open Superintelligence Infrastructure Prime Intellect is building the open superintelligence stack - from frontier agentic models to the infra that enables anyone to create, train, and deploy them. We aggregate and orchestrate global compute into a single control plane and pair it with the full rl post-training stack: environments, secure sandboxes, verifiable evals, and our async RL trainer. We enable researchers, startups and enterprises to run end-to-end reinforcement learning at frontier scale, adapting models to real tools, workflows, and deployment contexts. As a Research Engineer in our Reasoning team, you'll play a crucial role in shaping our technological direction, focusing on our test-time compute scaling research ideas. If you love working with synthetic data and teach LLMs reasoning abilities, this role is for you. For more details about the project you would be working on, check out our outlook on decentralized training in the inference-compute paradigm. Responsibilities Lead and participate in novel research to build a massive scale synthetic data generation pipeline and orchestration solution Optimize the performance, cost, and resource utilization of AI inference workloads by leveraging the most recent advances for compute & memory optimization techniques Contribute to the development of our open-source libraries and frameworks for synthetic data generation and distributed RL frameworks Publish research in top-tier AI conferences such as ICML & NeurIPS Distill highly technical project outcomes in layman approachable technical blogs to our customers and developers Stay up-to-date with the latest advancements in AI/ML infrastructure and tools, synthetic data gen research and proactively identify opportunities to enhance our platform's capabilities and user experience Requirements Strong background in AI/ML engineering, with extensive experience in designing and implementing end-to-end pipelines for the inference or training of large-scale AI models Deep expertise in distributed inference techniques and frameworks (e.g. vllm, sglang) for optimizing the performance and scalability of AI workloads Solid understanding of MLOps best practices, including model versioning, experiment tracking, and continuous integration/deployment (CI/CD) pipelines Passion for advancing the state-of-the-art in reasoning and democratizing access to AI capabilities for researchers, developers, and businesses worldwide If you're not familiar with these, but feel like that you can contribute to our mission and you're a high-energy person, get familiar with these resources here, here and here and please reach out! Benefits & Perks Cash Compensation Range of $150-300k, including equity incentives, aligning your success with the growth and impact of Prime Intellect Flexible work arrangements, with the option to work remotely or in-person at our offices in San Francisco Visa sponsorship and relocation assistance for international candidates Quarterly team off-sites, hackathons, conferences and learning opportunities Opportunity to work with a talented, hard-working and mission-driven team, united by a shared passion for leveraging technology to accelerate science and AI #J-18808-Ljbffr Prime-Intellect
$180k - $270k
Research Engineer (Focused on RL) You'll bring reinforcement learning to Firecrawl's core product — building the training infrastructure, reward pipelines, and fine-tuning systems that make our models meaningfully better at extracting, understanding, and structuring web...SuggestedFull timeTemporary workRemote work- ...whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the RL Teams Our Reinforcement Learning teams play a critical role in advancing our AI systems....SuggestedVisa sponsorship
$350k
...whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the RL Teams Our Reinforcement Learning teams lead Anthropic's reinforcement learning research and...SuggestedWork at officeVisa sponsorshipFlexible hours$192.6k - $344.85k
## AI Research Manager/Scientist, Reinforcement LearningApplylocations: San Francisco, CA, USA: AMER - United... ...AI Scientist Manager Reinforcement Learning** at Autodesk Research, you will be... ...Architecture, Civil or Mechanical Engineering, Construction, Manufacturing,...SuggestedRemote work$168k - $255k
...the Fortune 100, use Roboflow’s machine learning open source and hosted tools. That includes counting cells to accelerate cancer research, improving construction site safety, digitizing... ...on all roles (not only product and engineering), so Roboflow employs developers across...SuggestedRemote workWork from homeHome officeRelocation package- Why Achira Join a world-class team of scientists, ML researchers, and engineers working together to make the physical microcosm predictable and... ...) as world models of the physical microcosm span machine learning interaction potentials (MLIPs), neural network potentials...Full timeTemporary work
- ...and shaping the future with cutting‑edge research. Our mission is to ensure that AI's... .... We are looking for visionary Research Engineers to join our Applied Voice Team, where you... ...Build: Design and build advanced machine learning models that solve real-world problems. Bring...Internship
- ...Achira, we are building a team of world-class scientists, ML researchers, and engineers to work together to move beyond the beaten path in drug... ...make biology at the molecular level something that can be learned, predicted, and designed. At Achira, you’ll operate at the...Work at office
$180.6k - $315k
...complex agents in enterprises around the world. The Enterprise ML Research Lab works on the front lines of this AI revolution. We are... ...Comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this...Full time$340k
...whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to... ...About Horizons The Horizons team leads Anthropic's reinforcement learning research and development, playing a critical role in advancing...Work at officeVisa sponsorshipFlexible hours- ...The Role: We are looking for Research Engineers to build AI systems that use agent interaction... ...at scale, and improve them through learning and feedback. Your research will... ...settings You have a strong background in reinforcement learning, agents, or machine learning...Immediate start
$180k - $340k
...Research Engineer You'll own the quality of AI across everything Gamma creates. As our Research Engineer, you'll design evaluation... ...with post-training techniques for LLMs including reinforcement learning and supervised fine-tuning ~ Exceptional attention to detail...Full timeWork at officeWork from home- ...Archive Human Archive is a research lab backed by Y Combinator... ...embodied intelligence as a learned model. To achieve this, we... ...Opportunity As a Research Engineer, you'll work on multimodal... ...systems Experience with reinforcement learning, real-world robot deployments...Shift work
- ...troubleshooting have become a massive tax of engineering velocity. Resolve AI is solving this... ...workflows end-to-end, balancing research and engineering to create production-... ...with novel techniques, including reinforcement learning, retrieval-augmented generation, and...Work at officeVisa sponsorshipFlexible hours
- ...Infrastructure. About this role We're seeking an experienced Research Engineer to join our effort in building and training AI agents for... ..., experience in model evaluation, and benchmarks. Reinforcement Learning experience is a plus. Your work will play a crucial role...Full timeWork at office
$176k - $255k
...accelerate progress in GenAI research. We are looking for Research Scientists and Research Engineers with expertise in LLM post-training... ...in Computer Science, Machine Learning, AI, or a related field.... ...of deep learning, reinforcement learning, and large-scale model...Full timeShift work- ...architecture: classical precision algorithms orchestrated alongside learned policies. The job is systematically expanding the learned... ...model training through edge deployment on Jetson AGX Orin. Every research project will have a deployment milestone. This is not a lab...
$208k - $260k
...This position will be a key contributor in conducting applied research in Robotics and developing ML pipelines for training and... ...robotics, computer vision, embodied AI, sim-to-real, imitation learning, reinforcement learning, and vision language actions models ~ PhD or...Full timeShift work- ...a quickly growing group of committed researchers, engineers, policy experts, and business leaders... ...do today — environments where models learn to navigate ambiguity, handle interruptions... ...'ll work on fundamental research in reinforcement learning, designing training...Work at officeRemote workVisa sponsorshipShift work
$275k - $315k
...Notion and AMD. We are partnering with researchers, engineers, and organizations who share our... ...generation AI compiler that uses machine learning to optimize machine learning. As a... ...compilation, you'll develop the agentic and reinforcement learning systems that guide our...Full timeWork at officeRelocation package$250k - $300k
...breakthrough AI models at leading research labs and enterprises. Since... ...role requires continuous learning and evolution. You'll be... ...Overview As an Applied Research Engineer at Labelbox, you will be at... ...training processes, such as Reinforcement Learning from Human Feedback...Work at officeFlexible hours2 days per week$150k - $300k
...verifiable evals, and our async RL trainer. We enable researchers, startups and enterprises to run end-to-end reinforcement learning at frontier scale, adapting models to real... ..., and deployment contexts. As a Research Engineer working on Distributed Training, you'll play...Remote workWorldwideVisa sponsorshipRelocation packageFlexible hours- ...company based in San Francisco, California. The Role: As a Research Engineer - Agency and Reasoning , you will be a core contributor... .... You will be involved with performing novel research in reinforcement learning, post-training, and human preference learning, and applying...Work at officeRelocation package
- We're building robots that learn through exploration in the real world. We're looking for research engineers with strong foundations in reinforcement learning, multimodal representation learning, or large-scale model training. Qualifications: You’ve worked on large GPU...
$315k
As a Research Engineer or Research Scientist in Applied Finetuning, you will directly train the... ...of‑the‑art research in AI and machine learning, and propose ways to apply these... ...language models with supervised learning or reinforcement learning Developing evaluations for...Work at officeHome officeVisa sponsorshipRelocation package- San Francisco Tensor Company is looking for a Founding Research Engineer to develop advanced AI-driven compiler systems at the intersection of compilers and machine learning. You'll focus on designing reinforcement learning systems to optimize compilation processes. We...Full timeWork at officeRelocation package
- Research Engineer, Virtual Collaborator at Anthropic - San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic... ...and workflows. Your job will be to design and implement reinforcement learning environments that transform Claude into the best virtual...Work at officeVisa sponsorshipFlexible hours
- # Research Engineer, BenchmarkingEngineeringSan FranciscoFull-timeBuild the benchmarks frontier labs use to measure real-world coding... ..., Render) when it's needed, and you're comfortable with reinforcement learning and supervised fine-tuning at a high level. #J-18808-Ljbffr...
$200k - $350k
...training), second-time technical founders, engineers that made 100+ games for Voodoo,... ...games & 3D environments. Our current research spans: Distributed multi-agent orchestration... ...population, and behavior trees. Reinforcement learning pipelines for adaptive, open-ended game...Visa sponsorshipRelocation package- ...whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to... ...domains or real-world use cases Have experience with reinforcement learning, reward design, or training data curation for LLMs Are comfortable...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Engineer - Reinforcement Learning. Be the first to apply!
- ai research engineer San Francisco, CA
- research software engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- deep learning research engineer San Francisco, CA
- senior research engineer San Francisco, CA
- research programmer San Francisco, CA
- research assistant engineering San Francisco, CA
- research engineer San Francisco, CA
- research nurse practitioner San Francisco, CA
- scientific research San Francisco, CA

