Member of Technical Staff, Integration/RL Team (Research Engineer)
Cohere
Member Of Technical Staff
Cohere is the leading security-first enterprise AI company. We build cutting-edge foundation AI models and end-to-end products that are designed to solve real-world business problems.
We're training and deploying frontier models for enterprises who are building AI systems. We believe that our work is instrumental to the widespread adoption of AI and we are looking for folks that want to be part of that.
We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. Cohere is a team of researchers, engineers, designers, and more, who are all passionate about their craft.
We are a global technology company co-headquartered in Toronto and San Francisco, with key offices in London, New York City, Montreal, Seoul, Germany and Paris. Join us!
The integration team is responsible for developing and scaling machine learning algorithms and infrastructure for LLM post-training, with a focus on large-scale, distributed RL methods. We strive for excellence in both engineering and science by meticulously designing experiments and design docs. While tasks are assigned according to everyone's expertise, there is a global team effort to write production code and support the team research efforts, depending on individual interests and organizational needs.
In particular, this role aims to enhance the global quality of the post-training codebase by implementing new tools to ease and support research, optimizing post-training algorithms, and scaling distributed RL to unprecedented levels.
Please Note: We have offices in London, Paris, Toronto, San Francisco, New York but we are also remote-friendly! Applicants for this role may work anywhere between UTC−06:00 and UTC+01:00.
As a Member of Technical Staff, you will:
- Design and write high-performing and scalable software for training models.
- Develop new tools to support and accelerate research and LLM training.
- Coordinate with other engineering teams (Infrastructure, Efficiency, Serving) and the scientific teams (Agent, Multimodal, Multilingual, etc.) to create a strong and integrated post-training ecosystem.
- Craft and implement techniques to improve performance and speed up our training cycles, both on SFT, offline preference, and the RL regime.
- Research, implement, and experiment with ideas on our cluster and data infrastructure.
- Collaborate, Collaborate, and Collaborate with other scientists, engineers, and teams!
You are an ideal candidate if you have:
- Extremely strong software engineering skills.
- Value test-driven development methods, clean code, and strive to reduce technical debts at all levels.
- Proficiency in Python and related ML frameworks such as JAX, Pytorch and/or XLA/MLIR.
- Experience using and debugging large-scale distributed training strategies (memory/speed profiling).
- [Bonus] Experience with distributed training infrastructures (Kubernetes) and associated frameworks (Ray).
- [Bonus] Hands-on experience with the post-training phase of model training, with a strong emphasis on scalability and performance.
- [Bonus] Experience in ML, LLM and RL academic research.
This role is perfect for you if you:
- Have a deep passion for quality work.
- Enjoy tuning and optimising large LLM models.
- Comfortable working with people with different levels of software engineering skills, from beginner to more advanced.
- Comfortable diving into complex ML codebases to identify and resolve issues, ensuring the smooth operation of our systems.
- Thrive in a fast-paced, technically challenging environment, where you can contribute your innovative ideas and solutions.
How and Where We Work:
- Cohere is remote-friendly. We have offices in Toronto, San Francisco, New York City, London, Paris, Montreal, and more coming soon.
- For those in the office: a daily lunch program, plenty of snacks, and regular community and social events.
- For those not near an office: a co-working benefit so you can work alongside others in your city.
If any of the above doesn't line up exactly with your experience, we still encourage you to apply.
We strive to create an inclusive work environment for all; we welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.
We may use AI-enabled tools to screen and assess applicants against the criteria for this position. This helps our recruiters identify potentially qualified candidates, but it doesn't limit the applications our recruiters may review or consider.
- ...customers. Cohere is a team of researchers, engineers, designers, and more, who... ...Paris. Join us! The integration team is responsible for... ...large-scale, distributed RL methods. We strive for excellence... ...and UTC+01:00. As a Member of Technical Staff, you will: Design...SuggestedFull timeWork at officeLocal areaRemote workHome office
$300k
...AGI, and Discord. The team is small, the work is... ...looking for a deeply technical Member of Technical Staff to own RL and post-training for... ...aimed at experienced researchers and engineers who've operated at a senior... ...systems, including integration with rollout serving systems...SuggestedH1bWork at officeVisa sponsorshipShift work$200k
Member of Technical Staff, RL Research & Environments Posted Feb 28, 2026 | Full-time | Advanced... ...The Role As a Software Engineer on the RL Research & Environments team, you will design and operate... ...focused team Our culture Integrity. Words and actions should be...SuggestedFull timeRelocationVisa sponsorship$180k
...Member Of Technical Staff - RL Infrastructure Palo Alto, CA About XAI XAI's... ...pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization... ...increase the productivity of researchers and engineers. Typical...SuggestedTemporary work$300k
Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing AI capable of open-ended learning. We are... ...for strong infrastructure engineers who can build the systems... ...Experience supporting research teams or fast-moving ML teams....SuggestedWork at officeLocal area$180k
Member of Technical Staff - Post-Training and RL ABOUT xAI xAI’s mission is to create AI systems that can accurately understand the... ...in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals...Temporary work- Member of Technical Staff: AI Research & Engineering in Media Integrity About Synhawk Synhawk builds omnimodal foundation models for communication integrity, aimed at infrastructure... .... We're a small, highly technical, founder-led team. You'll play an integral part in shaping our...Immediate startShift work
$200k - $400k
...generative agents based on real humans. Our research pioneered the field of AI-based... ...Rauch. About the Role As a Member of Technical Staff (MTS) in Research, you will work across... ...our experimental methods as they are integrated into production systems that our customers...Flexible hours- ...Machine Learning Engineer You'll bridge research and engineering—rapidly implementing, experimenting with... ...and scale experimental models (LLMs, RL agents, agentic systems) on large, real... ...massive market opportunity Early team of repeat founders backed by top investors...Remote work
- ..., Notion, Salesforce, etc. We are a small team of engineers wrangling problems from context to search,... ...baseline data SFT on our agentic traces and RL models on top of our agentic harness and app sandboxes Qualifications research you can independently execute against the...
$170k - $230k
...Software Engineer - Member of Technical Staff (Consumption Team) Palo Alto / San Francisco Bay Area About Mithril... ...enterprises, AI startups, and the AI research community, including LG AI... ...marketplace engine), and Supply (integrations with cloud providers and capacity...Work at officeLocal areaFlexible hours1 day per week- ...and even nation states. Our team of AI researchers and company builders come... ...The goal is to build the engineering foundation that allows researchers... ...our models. This includes RL training loops, distributed... ...and enjoy solving hard technical problems. What We Offer...Relocation package
$150k
...the ground up. Our team is building the... ...combination of ambitious research vision and... .... Participate in technical discussions about... ...lead, or leading an engineering team. Expertise in... ...model optimization or RL framework... ...supervisors, and staff; adhere to standards...InternshipLocal area$200k
...lies in automating research and code... ...domain-specific RL, ultra-long context... ...internal platform that teams across Magic use... ...decisions. As a Member of Technical Staff on Evals, you will... ...researchers and engineers make better decisions... .... Our culture Integrity. Words and...Visa sponsorshipRelocation package- ...with the full rl post-training stack... .... We enable researchers, startups and enterprises... ...and integration of inference systems... .... Core Technical... ...LLM Inference engine development and... ...budget Regular team off-sites and... ...encourage team members to contribute to...Work at officeRemote workVisa sponsorshipRelocation packageFlexible hoursShift work
- ...Francisco, and a distributed team across North America.... .... The Role As a Member of Technical Staff, you'll be a core... ...products. As an early engineer, you'll set technical... ...data systems that integrate first‑ and third‑party... ...familiarity with Evals and RL We are focused on...Local area
$300k
Member of Technical Staff - RL Algorithms About V max V max is an applied research lab developing AI capable of open-ended learning. We are building systems to exceed humans... ...evaluation systems. Demonstrated software engineering ability. Strong communication skills,...Work at officeLocal areaShift work- ...Founding Generalist Engineer at Trajectory,... ...surfaces , and integration infrastructure... ...of our leading RL infrastructure.... ...is founded by a team of ex-Deepmind,... ...and Windsurf RL researchers, raised $15M led... ...team of founding Members of Technical Staff to design the frontier...
- Member of Technical Staff - Applied Research Patronus AI is a frontier lab developing... .... We are the team behind some of the... ...AI researchers and engineers from companies like... ...methods for efficient RL Drive novel research... ...research code that integrates cleanly into...
- ...best-in-class quality. We are the AI researchers and engineers behind such breakthrough AI technologies... ...collaborating closely with researchers to make RL stable, fast, and production-ready.... ...it the standard for LLMs as well. Our team includes engineers from AWS, Google...Immediate startFlexible hours
$180k
...pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This... ...important. All engineers and researchers are expected to have... ...”) during which a member of our team will ask... ...which consists of four technical interviews: Coding assessment...Local areaRelocation$180k
...Member Of Technical Staff - Imagine Model Palo Alto, CA; Seattle... ...pursuit of knowledge. Our team is small, highly... ...motivated, and focused on engineering excellence. This... ...serving, and product integration, covering both pretraining... ..., agentic planning, RL training, and world...Temporary work$180k
...pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This... ...training data, and advancing RL algorithms. About... ...interview”) during which a member of our team will ask... ...which consists of four technical interviews: # Coding...Temporary workWork at officeWork from homeRelocation- ...and we're forming a world-class team of engineers, designers, marketers, sellers, researchers, and operational experts to... ...will design and build robust API integrations within the rapidly evolving AI... ...dedicated to solving technical and creative problems. We seek...Work at officeVisa sponsorshipFlexible hours
- ...Research Engineer We are building AI to simulate the world through merging art and science. We believe that world models are at the frontier... ...and how the next frontiers of humanity are reached. Our team consists of creative, open minded, caring and ambitious...
$150k - $250k
..., Charlie Songhurst (Board Member, Meta), and Michael Jones (... .... We are a talent dense team comprising of ex-Figure Robotics... ..., UChicago, and Oxford engineers and researchers. Our omnichannel agents... ...NBFIs and FIs to deploy and integrate Krew's platform Support...Full timeWork experience placementInternshipWorldwide- ...looking for? Seeking a Member of Technical Staff - Backend with 5+ years of... ...output Design and build the integration of ML inference,... ...experience in backend software engineering, with a focus on Python in... ...well-established engineering teams Work experience...Work experience placement
$130k - $240k
...the first AI junior architect. We integrate deeply into architectural workflows... ...grunt work of architecture. We’re a team of AI engineers and seasoned architects, bridging domain... ...technology. The Role Being a Member of Technical Staff at SketchPro means the problem in...Work at officeShift work$350k
...society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and... ...health and evaluation integrity, and the primary point... ...building or operating RL environments, agent harnesses... ..., we expect all staff to be in one of our...Visa sponsorshipShift work$180k
...pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This... ...perfect reliability. As a Member of Technical Staff - Inference, you will... ...updates. Accelerate research on scaling test-time compute, RL rollout, and model-hardware...Temporary work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Member of Technical Staff, Integration/RL Team (Research Engineer). Be the first to apply!
- IT assistant United States
- junior it support analyst United States
- technical operations specialist United States
- technical marketing specialist United States
- desktop support analyst United States
- senior IT support technician United States
- personal computer support technician United States
- technical analyst United States
- customer support technician United States
- tech assistant United States


