Staff Research Engineer, Model Efficiency
Cohere
Staff Research Engineer Large Language Models (LLMs) continue to push the boundaries of what AI systems can do — but inference is still the bottleneck. The Model Efficiency team is responsible for pushing the limits of LLM inference efficiency across our foundation models. We explore and ship breakthroughs across the model execution stack, including: Model architecture and MoE routing optimization Decoding and inference-time algorithm improvements Software/hardware co-design for GPU acceleration Performance optimization without compromising model quality As a Staff Research Engineer, you will develop, prototype, and deploy techniques that materially improve how fast and efficiently our models run in production. You may be a good fit for the model efficiency team if you: Have a PhD in Machine Learning or a related field Understand LLM architecture, and how to optimize LLM inference given resource constraints Have significant experience with one or more techniques that enhance model efficiency Strong software engineering skills An appetite to work in a fast-paced high-ambiguity start-up environment Publications at top-tier conferences and venues (ICLR, ACL, NeurIPS) Passion to mentor others Full-Time Employees at Cohere Enjoy These Perks A weekly lunch stipend of $75/£75 or equivalent in your local currency for lunch. Full health and dental benefits, including a separate budget for mental health. RRSP matching, 401K, Pension Scheme. 100% Parental Leave top-up for up to 6 months, for either parent. Annual enrichment benefits: Arts & culture, fitness/wellness, quality time, and a workspace improvement credit. Education & learning stipend for conferences, courses, and coaching. 6 weeks of paid vacation (30 working days!) Budget for traveling to other offices if you are remote, plus an annual company offsite. How and Where We Work Cohere is remote-friendly. We have offices in Toronto, San Francisco, New York City, London, Paris, Montreal, and more coming soon. For those in the office: a daily lunch program, plenty of snacks, and regular community and social events. For those not near an office: a co-working benefit so you can work alongside others in your city. Everyone receives a $500 home office stipend to set up your workspace properly. If any of the above doesn't line up exactly with your experience, we still encourage you to apply. We strive to create an inclusive work environment for all; we welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs. We may use AI-enabled tools to screen and assess applicants against the criteria for this position. This helps our recruiters identify potentially qualified candidates, but it doesn't limit the applications our recruiters may review or consider. Cohere
- ...Job We are looking for an experienced AI Model Engineer with deep expertise in kernel... ...advanced quantization techniques to improve efficiency and memory usage. Debug and optimize GPU... ...desktop and mobile devices. Collaborate with research and engineering teams to prototype,...SuggestedRemote job
- ...We’re training and deploying frontier models for developers and enterprises who are... ...for our customers. Cohere is a team of researchers, engineers, designers, and more, who are... ...pushing the boundaries of LLM inference efficiency. We develop techniques that improve how...SuggestedFull timeWork at officeRemote workFlexible hours
- Astera is seeking a Research Engineer to focus on world models and quantitative perception systems. This role entails the development of architectures for dynamic real-world systems and building models that extract actionable signals from data. The ideal candidate should...Suggested
- Twilio is seeking a Senior/Staff Applied Research Software Engineer, offering remote opportunities across the United States. This role focuses on developing innovative solutions and collaborating with engineering teams to enhance Twilio's offerings in a fast-paced environment...SuggestedRemote job
- Reddit, Inc. is seeking a Staff Research Engineer for Post-Training & Evaluation to work remotely in the United States. This role involves setting evaluation standards for machine learning models and ensuring their quality and reliability. You will work with a dedicated...SuggestedRemote work
$220.8k - $331.2k
...We are looking for an experienced AI Research Engineer to join our Duolingo Monetization team.... ...learning techniques including large language models, multi-armed bandits, and more. They... ...of high-quality, scalable, and efficient machine learning solutions. ✅ You have...Full timeWork experience placement$264.8k - $331k
...enterprises around the world. The Enterprise ML Research Lab works on the front lines of this AI... ...all of our enterprise clients. As a Staff Agent Post-Training MLRE, you will build... ...to training foundation healthtech search models. If you are excited about shaping the future...Full time- Madrona Venture Labs in New York City is looking for a Research Engineer to lead robotics efforts. The role involves designing robot learning... ...approach is essential as you partner with teams to integrate model capabilities into robotics applications. #J-18808-Ljbffr...
- Runway in New York is hiring a Research Engineer to lead robotics initiatives linked to world model development. The position entails full-stack engagement in robot learning, from data collection to physical evaluation of learned policies. Applicants should have hands-...
$211.37k - $253.64k
...trustworthy Internet. Come join us. Staff Software Engineer - Edge Systems The Edge Systems Team develops... ...The focus is on enhancing the overall efficiency of our edge platform. We are looking... ...currently embraces a largely hybrid model for most roles, allowing employees...Work at officeLocal areaRemote workFlexible hoursNight shiftWeekend work$230k - $322k
Reddit, Inc. is seeking a Staff Research Engineer for Post-training & Evaluation Science to lead the evaluation of AI models. This fully remote role demands solid experience in machine learning and model evaluation techniques. The ideal candidate will define rigorous quality...Remote work- ...world through merging art and science. We believe that world models are at the frontier of progress in artificial intelligence.... ...learned representations and real-world action. We’re looking for a Research Engineer to own the robotics vertical of our world models: taking our...Work at officeRelocation
- ...in New York is looking for a candidate who will help improve AI models through designing tasks, running experiments, and building... ...rewarding systems, and is comfortable working with Python and PyTorch in a research-focused environment. #J-18808-Ljbffr Chakra Data Warehouse
- Cohere is seeking an engineering professional in New York to develop and optimize audio machine... ...cross-functional teams to improve audio model metrics, addressing latency and... ...a vibrant workplace with a focus on AI research, full health benefits, six weeks of vacation...Remote job
$190k - $250k
...The Research Engineering team is dedicated to accelerating the velocity of machine learning research... ...for testing ideas rapidly and efficiently. Research at PDT requires significant... ...differentiating factor for PDT's business. Optimize models for inference and use in real-time...Work at officeWork visa3 days per week$190.58k
...You're an experienced engineer who combines deep technical... ...across Murmuration's research and data science... ...minutes? How do we scale model scoring from millions to... ...with an eye toward cost-efficient architectural choices;... ...care to ensure that our staff are best equipped to lead...Full timeRemote workHome officeFlexible hours$174k - $252k
...degree in Computer Science, Engineering, Computer Information Systems... ...experience in the job offered or in a Research Engineer-related occupation.... ..., and iterate on deep neural models and reinforcement learning... ...and, in order to facilitate efficient collaboration and...Full timeWork at office- ...Basis is a nonprofit applied AI research organization with two... ...first. About the Role Research Engineers in Operations at Basis build... ...productivity and operational efficiency. You will create tools that save... ...automation , including using language models (OpenAI API, Anthropic Claude...Full timeContract workWork at office
$315k
As a Research Engineer or Research Scientist in Applied Finetuning, you will directly train the models we launch to the public via Claude.AI and our... ...finetuning pipelines to efficiently train production‑scale language... ...Currently, we expect all staff to be in one of our...Work at officeHome officeVisa sponsorshipRelocation package- ...and distribution. Higharc is hiring a Research Engineer to join our Special Projects team. In this... ...residential construction, developing models that understand architectural structure... ...foundation models and parameter-efficient fine-tuning approaches (LoRA, QLoRA, adapter...Full timeTemporary workRemote workHome officeFlexible hours
$27 - $43 per hour
...talents. The intern will work in the Research Engineering (RE) group, inside the larger Informatics... ...generative AI, large language models (LLMs), and agentic frameworks to solve... ...have direct impact on improving research efficiency, data‑driven discovery, and automation...Hourly payFull timeTemporary workWork experience placementInternshipSummer internship- ## Part-Time - Graduate Research Assistant - Mechanical EngineeringApplylocations: Penn State... ...CURRENT PENN STATE EMPLOYEE (faculty, staff, technical service, or student), please... ...**The Department of Mechanical Engineering is seeking a Graduate Research Assistant...Part timeRemote work
$200k
...Optiver is a seeking a Machine Learning Research Engineer to join our team, focusing on a pivotal... ...Expertise in building deep-learning models in PyTorch, JAX, or TensorFlow Experience... ...in the safeguarding of healthy and efficient markets for everyone who participates....Work at office$15k
...market makers on the street. As a Senior Research Engineer embedded within our first quant... ...that encourages creative thinking and efficient implementation. We embrace experimentation... ...training and inference for machine learning models (Pytorch, Jax) Deploy machine learning...Local areaNight shift$315k - $340k
[Expression of Interest] Research Scientist/Engineer, Honesty About Anthropic Anthropic... ...truthfulness in language models. Your work will focus on... ...tools to help human evaluators efficiently assess model outputs for... ...: Currently, we expect all staff to be in one of our offices...Full timeWork at officeVisa sponsorshipFlexible hours$350k
...growing group of committed researchers, engineers, policy experts, and business... ...of large language models. In this role, you will work... ...infrastructure to improve efficiency and reliability Develop and... ...: Currently, we expect all staff to be in one of our offices...Work at officeVisa sponsorshipFlexible hours$16 - $22 per hour
## Part-Time Research Assistant- Industrial EngineeringApplylocations: Penn State University... ...CURRENT PENN STATE EMPLOYEE (faculty, staff, technical service, or student), please... ...**Penn State University Park Industrial Engineering professor is seeking Penn State students...Hourly payPart timeRemote work$16 - $22 per hour
Penn State University is seeking part-time Research Assistants in Industrial Engineering. You will gather research materials, process data from human subjects, conduct literature reviews, and write technical reports. This position is only for current Penn State students...Hourly payPart time$320k
...growing group of committed researchers, engineers, policy experts, and business... ...will span safety evaluations, model improvement, institutional... ...AI to support civic life and efficient and accountable government.... ...policy: Currently, we expect all staff to be in one of our offices...Full timeTemporary workWork experience placementWork at officeVisa sponsorshipFlexible hours- ...from the ground up to unlock performance and efficiency that conventional architectures can't reach. As Silicon Research Engineering Lead, You will shepherd these ideas through... ...new ideas coming out of research through modeling, simulation, prototype design, and targeted...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff Research Engineer, Model Efficiency. Be the first to apply!
- research assistant engineering New York, NY
- staff security engineer New York, NY
- staff engineer New York, NY
- assistant chief engineer New York, NY
- senior staff systems engineer New York, NY
- assistant engineering manager New York, NY
- project engineer assistant project manager New York, NY
- staff automation engineer New York, NY
- engineering aide New York, NY
- software engineer staff New York, NY

