Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Research Engineer, Model Efficiency

Cohere

Staff Research Engineer Large Language Models (LLMs) continue to push the boundaries of what AI systems can do — but inference is still the bottleneck. The Model Efficiency team is responsible for pushing the limits of LLM inference efficiency across our foundation models. We explore and ship breakthroughs across the model execution stack, including: Model architecture and MoE routing optimization Decoding and inference-time algorithm improvements Software/hardware co-design for GPU acceleration Performance optimization without compromising model quality As a Staff Research Engineer, you will develop, prototype, and deploy techniques that materially improve how fast and efficiently our models run in production. You may be a good fit for the model efficiency team if you: Have a PhD in Machine Learning or a related field Understand LLM architecture, and how to optimize LLM inference given resource constraints Have significant experience with one or more techniques that enhance model efficiency Strong software engineering skills An appetite to work in a fast-paced high-ambiguity start-up environment Publications at top-tier conferences and venues (ICLR, ACL, NeurIPS) Passion to mentor others Full-Time Employees at Cohere Enjoy These Perks A weekly lunch stipend of $75/£75 or equivalent in your local currency for lunch. Full health and dental benefits, including a separate budget for mental health. RRSP matching, 401K, Pension Scheme. 100% Parental Leave top-up for up to 6 months, for either parent. Annual enrichment benefits: Arts & culture, fitness/wellness, quality time, and a workspace improvement credit. Education & learning stipend for conferences, courses, and coaching. 6 weeks of paid vacation (30 working days!) Budget for traveling to other offices if you are remote, plus an annual company offsite. How and Where We Work Cohere is remote-friendly. We have offices in Toronto, San Francisco, New York City, London, Paris, Montreal, and more coming soon. For those in the office: a daily lunch program, plenty of snacks, and regular community and social events. For those not near an office: a co-working benefit so you can work alongside others in your city. Everyone receives a $500 home office stipend to set up your workspace properly. If any of the above doesn't line up exactly with your experience, we still encourage you to apply. We strive to create an inclusive work environment for all; we welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs. We may use AI-enabled tools to screen and assess applicants against the criteria for this position. This helps our recruiters identify potentially qualified candidates, but it doesn't limit the applications our recruiters may review or consider. Cohere

Vacancy posted 21 hours ago
Similar jobs that could be interesting for youBased on the Staff Research Engineer, Model Efficiency in New York, NY vacancy
  •  ...Job We are looking for an experienced AI Model Engineer with deep expertise in kernel...  ...advanced quantization techniques to improve efficiency and memory usage. Debug and optimize GPU...  ...desktop and mobile devices. Collaborate with research and engineering teams to prototype,... 
    Suggested
    Remote job

    Framework Ventures

    New York, NY
    5 days ago
  •  ...We’re training and deploying frontier models for developers and enterprises who are...  ...for our customers. Cohere is a team of researchers, engineers, designers, and more, who are...  ...pushing the boundaries of LLM inference efficiency. We develop techniques that improve how... 
    Suggested
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    New York, NY
    3 days ago
  • Astera is seeking a Research Engineer to focus on world models and quantitative perception systems. This role entails the development of architectures for dynamic real-world systems and building models that extract actionable signals from data. The ideal candidate should... 
    Suggested

    Austin Community College

    New York, NY
    2 days ago
  • Twilio is seeking a Senior/Staff Applied Research Software Engineer, offering remote opportunities across the United States. This role focuses on developing innovative solutions and collaborating with engineering teams to enhance Twilio's offerings in a fast-paced environment... 
    Suggested
    Remote job

    Twilio

    New York, NY
    5 days ago
  • Reddit, Inc. is seeking a Staff Research Engineer for Post-Training & Evaluation to work remotely in the United States. This role involves setting evaluation standards for machine learning models and ensuring their quality and reliability. You will work with a dedicated... 
    Suggested
    Remote work

    Reddit, Inc.

    New York, NY
    4 days ago
  • $220.8k - $331.2k

     ...We are looking for an experienced AI Research Engineer to join our Duolingo Monetization team....  ...learning techniques including large language models, multi-armed bandits, and more. They...  ...of high-quality, scalable, and efficient machine learning solutions. ✅ You have... 
    Full time
    Work experience placement

    Duolingo

    New York, NY
    2 days ago
  • $264.8k - $331k

     ...enterprises around the world. The Enterprise ML Research Lab works on the front lines of this AI...  ...all of our enterprise clients. As a Staff Agent Post-Training MLRE, you will build...  ...to training foundation healthtech search models. If you are excited about shaping the future... 
    Full time

    Scale AI

    New York, NY
    4 days ago
  • Madrona Venture Labs in New York City is looking for a Research Engineer to lead robotics efforts. The role involves designing robot learning...  ...approach is essential as you partner with teams to integrate model capabilities into robotics applications. #J-18808-Ljbffr... 

    Madrona Venture Labs

    New York, NY
    2 days ago
  • Runway in New York is hiring a Research Engineer to lead robotics initiatives linked to world model development. The position entails full-stack engagement in robot learning, from data collection to physical evaluation of learned policies. Applicants should have hands-... 

    runwayml.com

    New York, NY
    4 days ago
  • $211.37k - $253.64k

     ...trustworthy Internet. Come join us. Staff Software Engineer - Edge Systems The Edge Systems Team develops...  ...The focus is on enhancing the overall efficiency of our edge platform. We are looking...  ...currently embraces a largely hybrid model for most roles, allowing employees... 
    Work at office
    Local area
    Remote work
    Flexible hours
    Night shift
    Weekend work

    I did my part and supported the Regular Toilet

    New York, NY
    5 days ago
  • $230k - $322k

    Reddit, Inc. is seeking a Staff Research Engineer for Post-training & Evaluation Science to lead the evaluation of AI models. This fully remote role demands solid experience in machine learning and model evaluation techniques. The ideal candidate will define rigorous quality... 
    Remote work

    Reddit, Inc.

    New York, NY
    5 days ago
  •  ...world through merging art and science. We believe that world models are at the frontier of progress in artificial intelligence....  ...learned representations and real-world action. We’re looking for a Research Engineer to own the robotics vertical of our world models: taking our... 
    Work at office
    Relocation

    Madrona Venture Labs

    New York, NY
    4 days ago
  •  ...in New York is looking for a candidate who will help improve AI models through designing tasks, running experiments, and building...  ...rewarding systems, and is comfortable working with Python and PyTorch in a research-focused environment. #J-18808-Ljbffr Chakra Data Warehouse

    Chakra Data Warehouse

    New York, NY
    4 days ago
  • Cohere is seeking an engineering professional in New York to develop and optimize audio machine...  ...cross-functional teams to improve audio model metrics, addressing latency and...  ...a vibrant workplace with a focus on AI research, full health benefits, six weeks of vacation... 
    Remote job

    Cohere

    New York, NY
    3 days ago
  • $190k - $250k

     ...The Research Engineering team is dedicated to accelerating the velocity of machine learning research...  ...for testing ideas rapidly and efficiently. Research at PDT requires significant...  ...differentiating factor for PDT's business. Optimize models for inference and use in real-time... 
    Work at office
    Work visa
    3 days per week

    PDT Partners

    New York, NY
    1 day ago
  • $190.58k

     ...You're an experienced engineer who combines deep technical...  ...across Murmuration's research and data science...  ...minutes? How do we scale model scoring from millions to...  ...with an eye toward cost-efficient architectural choices;...  ...care to ensure that our staff are best equipped to lead... 
    Full time
    Remote work
    Home office
    Flexible hours

    murmuration

    New York, NY
    5 days ago
  • $174k - $252k

     ...degree in Computer Science, Engineering, Computer Information Systems...  ...experience in the job offered or in a Research Engineer-related occupation....  ..., and iterate on deep neural models and reinforcement learning...  ...and, in order to facilitate efficient collaboration and... 
    Full time
    Work at office

    Google Inc.

    New York, NY
    3 days ago
  •  ...Basis is a nonprofit applied AI research organization with two...  ...first. About the Role Research Engineers in Operations at Basis build...  ...productivity and operational efficiency. You will create tools that save...  ...automation , including using language models (OpenAI API, Anthropic Claude... 
    Full time
    Contract work
    Work at office

    Basis Research Institute

    New York, NY
    2 days ago
  • $315k

    As a Research Engineer or Research Scientist in Applied Finetuning, you will directly train the models we launch to the public via Claude.AI and our...  ...finetuning pipelines to efficiently train production‑scale language...  ...Currently, we expect all staff to be in one of our... 
    Work at office
    Home office
    Visa sponsorship
    Relocation package

    Anthropic

    New York, NY
    3 days ago
  •  ...and distribution. Higharc is hiring a Research Engineer to join our Special Projects team. In this...  ...residential construction, developing models that understand architectural structure...  ...foundation models and parameter-efficient fine-tuning approaches (LoRA, QLoRA, adapter... 
    Full time
    Temporary work
    Remote work
    Home office
    Flexible hours

    Higharc Inc.

    New York, NY
    5 days ago
  • $27 - $43 per hour

     ...talents. The intern will work in the Research Engineering (RE) group, inside the larger Informatics...  ...generative AI, large language models (LLMs), and agentic frameworks to solve...  ...have direct impact on improving research efficiency, data‑driven discovery, and automation... 
    Hourly pay
    Full time
    Temporary work
    Work experience placement
    Internship
    Summer internship

    Feedinkoo

    New York, NY
    3 days ago
  • ## Part-Time - Graduate Research Assistant - Mechanical EngineeringApplylocations: Penn State...  ...CURRENT PENN STATE EMPLOYEE (faculty, staff, technical service, or student), please...  ...**The Department of Mechanical Engineering is seeking a Graduate Research Assistant... 
    Part time
    Remote work

    Penn State University

    New York, NY
    5 days ago
  • $200k

     ...Optiver is a seeking a Machine Learning Research Engineer to join our team, focusing on a pivotal...  ...Expertise in building deep-learning models in PyTorch, JAX, or TensorFlow Experience...  ...in the safeguarding of healthy and efficient markets for everyone who participates.... 
    Work at office

    Optiver

    New York, NY
    2 days ago
  • $15k

     ...market makers on the street. As a Senior Research Engineer embedded within our first quant...  ...that encourages creative thinking and efficient implementation. We embrace experimentation...  ...training and inference for machine learning models (Pytorch, Jax) Deploy machine learning... 
    Local area
    Night shift

    The-Voleon-Group

    New York, NY
    5 days ago
  • $315k - $340k

    [Expression of Interest] Research Scientist/Engineer, Honesty About Anthropic Anthropic...  ...truthfulness in language models. Your work will focus on...  ...tools to help human evaluators efficiently assess model outputs for...  ...: Currently, we expect all staff to be in one of our offices... 
    Full time
    Work at office
    Visa sponsorship
    Flexible hours

    Aisafety

    New York, NY
    1 day ago
  • $350k

     ...growing group of committed researchers, engineers, policy experts, and business...  ...of large language models. In this role, you will work...  ...infrastructure to improve efficiency and reliability Develop and...  ...: Currently, we expect all staff to be in one of our offices... 
    Work at office
    Visa sponsorship
    Flexible hours

    Menlo Ventures

    New York, NY
    1 day ago
  • $16 - $22 per hour

    ## Part-Time Research Assistant- Industrial EngineeringApplylocations: Penn State University...  ...CURRENT PENN STATE EMPLOYEE (faculty, staff, technical service, or student), please...  ...**Penn State University Park Industrial Engineering professor is seeking Penn State students... 
    Hourly pay
    Part time
    Remote work

    Penn State University

    New York, NY
    1 day ago
  • $16 - $22 per hour

    Penn State University is seeking part-time Research Assistants in Industrial Engineering. You will gather research materials, process data from human subjects, conduct literature reviews, and write technical reports. This position is only for current Penn State students... 
    Hourly pay
    Part time

    Penn State University

    New York, NY
    5 days ago
  • $320k

     ...growing group of committed researchers, engineers, policy experts, and business...  ...will span safety evaluations, model improvement, institutional...  ...AI to support civic life and efficient and accountable government....  ...policy: Currently, we expect all staff to be in one of our offices... 
    Full time
    Temporary work
    Work experience placement
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    New York, NY
    4 days ago
  •  ...from the ground up to unlock performance and efficiency that conventional architectures can't reach. As Silicon Research Engineering Lead, You will shepherd these ideas through...  ...new ideas coming out of research through modeling, simulation, prototype design, and targeted... 

    Normal Computing Corporation

    New York, NY
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Research Engineer, Model Efficiency. Be the first to apply!