Senior ML Systems Engineer, Frameworks & Tooling
Cohere
Who are we? Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. We like to work hard and move fast to do what's best for our customers. Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do. We believe that a diverse range of perspectives is a requirement for building great products. Join us on our mission and shape the future! We're looking for a senior engineer to help build, maintain and evolve the training framework that powers our frontier-scale language models. This role sits at the intersection of large-scale training, distributed systems, and HPC infrastructure. You will design and maintain the core components that enable fast, reliable, and scalable model training - and build the tooling that connects research ideas to thousands of GPUs. If you enjoy working across the full stack of ML systems, this role gives you the opportunity and autonomy to have massive impact. What You'll Work On
If some of the above doesn't line up perfectly with your experience, we still encourage you to apply!
We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.
We may use AI-enabled tools to screen and assess applicants against the criteria for this position. This helps our recruiters identify potentially qualified candidates, but it doesn't limit the applications our recruiters may review or consider. Full-Time Employees at Cohere enjoy these Perks: An open and inclusive culture and work environment Work closely with a team on the cutting edge of AI research Weekly lunch stipend, in-office lunches & snacks Full health and dental benefits, including a separate budget to take care of your mental health 100% Parental Leave top-up for up to 6 months Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend 6 weeks of vacation (30 working days!)
- Build and own the training framework responsible for large-scale LLM training.
- Design distributed training abstractions (data/tensor/pipeline parallelism, FSDP/ZeRO strategies, memory management, checkpointing).
- Improve training throughput and stability on multi-node clusters (e.g., GB200/300, AMD, H200/100).
- Develop and maintain tooling for monitoring, logging, debugging, and developer ergonomics.
- Collaborate closely with infra teams to ensure our cluster, container environments, and hardware configurations support high-performance training.
- Investigate and resolve performance bottlenecks across the ML systems stack.
- Build robust systems that ensure reproducible, debuggable, large-scale runs.
- Strong engineering experience in large-scale distributed training or HPC systems.
Deep familiarity with JAX internals, distributed training libraries, or custom kernels/fused ops. - Experience with multi-node cluster orchestration (Slurm, Ray, Kubernetes, or similar).
- Comfort debugging performance issues across CUDA/NCCL, networking, IO, and data pipelines.
- Experience working with containerized environments (Docker, Singularity/Apptainer).
- A track record of building tools that increase developer velocity for ML teams.
- Excellent judgment around trade-offs: performance vs complexity, research velocity vs maintainability.
- Strong collaboration skills - you'll work closely with infra, research, and deployment teams.
- Experience with training LLMs or other large transformer architectures.
- Contributions to ML frameworks (PyTorch, JAX, DeepSpeed, Megatron, xFormers, etc.).
- Familiarity with evaluation and serving frameworks (vLLM, TensorRT-LLM, custom KV caches).
- Experience with data pipeline optimization, sharded datasets, or caching strategies.
- Background in performance engineering, profiling, or low-level systems.
- You'll work on some of the most challenging and consequential ML systems problems today.
- You'll collaborate with a world-class team working fast and at scale.
- You'll have end-to-end ownership over critical components of the training stack.
- You'll shape the next generation of infrastructure for frontier-scale models.
- You'll build tools and systems that directly accelerate research and model quality.
- Build a high-performance data loading and caching pipeline.
- Implement performance profiling across the ML systems stack
- Develop internal metrics and monitoring for training runs.
- Build reproducibility and regression testing infrastructure.
- Develop a performant fault-tolerant distributed checkpointing system.
If some of the above doesn't line up perfectly with your experience, we still encourage you to apply!
We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.
We may use AI-enabled tools to screen and assess applicants against the criteria for this position. This helps our recruiters identify potentially qualified candidates, but it doesn't limit the applications our recruiters may review or consider. Full-Time Employees at Cohere enjoy these Perks: An open and inclusive culture and work environment Work closely with a team on the cutting edge of AI research Weekly lunch stipend, in-office lunches & snacks Full health and dental benefits, including a separate budget to take care of your mental health 100% Parental Leave top-up for up to 6 months Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend 6 weeks of vacation (30 working days!)
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior ML Systems Engineer, Frameworks & Tooling in Paris, NY vacancy
$115.7k - $150.5k
...is looking for an Information Systems Security Manager (ISSM) to... ...expertise in the Risk Management Framework (RMF), defense cybersecurity... ...ISSM will work closely with engineering and program leadership to ensure... ...package submission tools (e.g., eMASS) leading to successful...SuggestedTemporary workFor contractorsWork experience placementCasual workLocal areaRelocation package- ...Senior Java Developer Location: Madison Ave, NY or San Francisco, CA (Hybrid... ...above) Expertise in Spring Boot framework Strong proficiency in AWS... ...Experience with version control systems such as Git/GitLab Build tools: Maven and/or Gradle Preferred...Senior
$62.35k - $75k
...Job Description Title: School Systems Manager Supervisor: Director of School Services... ..., and school-based mental health frameworks. Key Responsibilities 1. Collaborate... ...guide decision-making 7. Apply fidelity tools and progress-monitoring strategies to ensure...Suggested$65.12k - $97.68k
...and manage consolidation sales activity using CRM and calendaring tools, including engagement status, next steps, and outreach cadence,... ...preferred, with the ability to learn and effectively use CRM systems to manage opportunities, document activity, and support sales outcomes...SeniorTemporary workWork at officeFlexible hours3 days per week- ...help build intelligent systems that improve data quality... ..., and better tools so they can focus their... ...Statistics, Mathematics, Engineering, Information Systems, or... ...predictive models or analytical frameworks that influenced... ...production-grade analytics, ML, or AI-assisted systems...Suggested
$23.51 - $28.17 per hour
...NY is seeking an experienced Senior Assembler to join the Manufacturing... ...Work from wire diagrams, engineering drawings (blueprints),... ...products. Use hand and power tools as required to complete assemblies... ...advanced technology and systems, supporting the U.S. Armed Forces...SeniorTemporary workFor contractorsWork experience placementCasual workLocal area- ...Senior Master Technician We are seeking a Senior Master Technician who is Ford-certified... ...complex mechanical issues using diagnostic tools and equipment. Provide accurate and... ...preferred. Thorough knowledge of automotive systems, mechanics, and components. Strong...SeniorLocal areaFlexible hours
- ...Description The Senior DSP will provide oversight to staff working within a 12 person or less ICF or IRA and is responsible for: Serve as a mentor for new employees and assist in on-site orientation and training. Participate in agency activities under the supervision...SeniorFlexible hoursShift work
$82.08k - $193.44k
...sustainable, more inclusive world. • Lead Agile delivery for data engineering teams involving GCP Big Query & Teradata, facilitating Scrum... ...Certifications and licenses, Relevant experience and skills, Seniority and performance, Market and business consideration, Internal...SeniorFull timeLocal area$123.3k - $221.95k
...Intelligence (AI) Security Engineer The Principal... ...securing machine learning (ML), generative artificial... ...(GenAI), and agentic systems in production, with emphasis... ...(RAG) pipelines, agent frameworks, application... ..., agent orchestration, tool calling, and multi-model...Work from homeHome office- ...experience in Desktop Support Engineer. 4-7 years of experience... ...hardware, software, operating systems, directory services, printing... ...effective management of Service Desk Tool Experience of working... ...stakeholders including users, senior management, IT teams, project...SeniorLocal areaImmediate startRemote work
- ...seeking a highly skilled and experienced Senior Accountant to join our dynamic manufacturing... .... 4. Proficient in SAP or similar ERP systems. 5. In-depth knowledge of GAAP and... ...Regarding Automated Employment Decision Tools which are available at jobot.com/legal....SeniorPermanent employmentLocal area
- ...Jack & Jill is seeking a Senior GTM Recruiter to lead a high-velocity recruitment effort amid a growth phase. The role involves owning recruitment strategy to double the Go-To-Market team across New York and San Mateo. The ideal candidate will have extensive experience...Senior
$150k - $200k
The MedElite Group is seeking a compassionate Travel Nurse Practitioner for Endocrinology to join our dedicated healthcare team. This role involves providing comprehensive care to patients with endocrine disorders, including diabetes and thyroid conditions. Candidates ...Senior$62.5k - $80k
...What’s Upstate is seeking a Technical Support Engineer II to provide advanced technical support to customers at Indium Corporation. This role demands expertise in soldering processes and requires a strong technical background along with excellent problem-solving skills...Senior- Job Title Job Description: Experience in detailed requirement gathering and creation of Business Requirement Document and Functional Requirement Document. Experience in working as an Integration Lead (Techno Functional). Experience in working on Duck Creek Product Suite...Senior
- ...A global engineering consultancy is looking for a Construction Manager to oversee construction activities on fast-paced EPC projects in the manufacturing sector. The ideal candidate will have at least 7 years of experience managing industrial engineering projects, a Bachelor...Senior
$110k - $130k
PAR Technology in New Hartford, NY, is seeking a Senior Business Analyst for Salesforce to bridge the gap between business needs and technical solutions. The role involves collaborating with product teams, gathering and documenting business requirements, analyzing data...SeniorRemote work$80.9k - $101.1k
...Statements of Work and driving the SOW through final review and Engineer Technical Review (ETR) release. Issuing RFI and RFQ packages... ...records in accordance with US Government Certified Purchasing System Requirements. Supporting proposal costing efforts for subcontract...SeniorContract workTemporary workFor contractorsWork experience placementFor subcontractorCasual workLocal area- A community service organization is looking for a Care Manager (Level 3) in Utica, NY. The role involves conducting assessments, managing care plans, and engaging with patients and their families to ensure optimum healthcare. Candidates are required to have a Bachelor'...SeniorPart time
- ...Job Description We are currently representing an exceptional Senior Tax Manager who is confidentially exploring the next step in their career. This individual brings a strong mix of technical expertise, leadership ability, and hands-on experience across both tax...SeniorImmediate start
$20 - $20.25 per hour
Hourly rate ranges from $20.00 - $20.25 per hour and is dependent upon qualifications and experience. Benefits include: Company Paid Sick Time, Paid Vacation Time, Paid Holidays, Bereavement Pay, Jury Duty Pay, Contest Prize Awards, 401K Plan with Company Match, Medical...SeniorHourly payLocal area$34.62 - $43.27 per hour
...Job Description: Saab, Inc. is seeking a Senior Depot Specialist to join our team supporting defense products and systems deployed worldwide. The Senior Depot Specialist will provide responsive support for hardware and information needs to customers and Saab colleagues...SeniorHourly payTemporary workFor contractorsWork experience placementCasual workLocal areaRemote workWorldwide$22 - $36 per hour
...Senior Quality Engineering Technician Resonetics is a global leader in advanced engineering, prototyping, product development, and micro manufacturing... ...processes, with increased ownership of quality systems, data-driven decision-making, and cross-functional support...SeniorContract workWork at office$18.65 per hour
JOB OVERVIEW: Are you looking for a career with flexibility? Are you dependable and caring? Then CareGivers is looking for someone just like you! Previous housekeeping experience preferred but not required! NOTE: Must have a reliable vehicle & valid drivers license...SeniorLocal areaFlexible hoursShift workDay shiftWeekday work$77k - $105k
...Senior Property Claims Specialist - Commercial (Inside Desk) At Utica National Insurance Group, 1,400 employees countrywide take our corporate promise to heart every day: To make people feel secure, appreciated, and respected. Utica National Insurance Group is an...SeniorFull timeWork experience placementWork at officeHome officeFlexible hours$78k - $139k
Project Superintendent Pike Construction Services is currently seeking experienced building construction Project Superintendents to join our growing team. We believe our people are the most important asset and we are committed to creating a dynamic and challenging work...SeniorFor subcontractorWork at office- ...energetic, experienced, and results-oriented Vice President, Senior Director for our Response and Recovery Programs (R&R) business... ...Experience: ~ Bachelor’s Degree in Emergency Management, Planning, Engineering/Architecture, Environmental, Finance or related degree. Master...SeniorLocal area
$89.5k - $166.9k
...Company: Marsh McLennan Agency Description: Senior Client Advisor Our not-so-secret sauce. Award-winning, inclusive, Top Workplace culture doesn't happen overnight. It's a result of hard work by extraordinary people. The industry's brightest talent...SeniorMinimum wageLocal areaNight shift- ...play as you manage the construction activities for face paced Engineer-Procure-Construct (EPC) projects in the manufacturing industry.... ...administrators on project related activities. Actively works with the Senior Construction Manager in the management of the overall project...SeniorFull timeContract workFor subcontractorWork at officeFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior ML Systems Engineer, Frameworks & Tooling. Be the first to apply!
Related searches
- entry level machine learning engineer
- senior ml engineer
- data scientist machine learning engineer
- machine learning ai engineer
- lead machine learning engineer
- google ml engineer
- junior machine learning engineer
- staff machine learning engineer
- junior machine learning research engineer
- computer vision machine learning engineer



