Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Research Engineer: AI Model Efficiency & Speed

Cohere

A leading AI research company in San Francisco is seeking a Staff Research Engineer to enhance the efficiency of large language models. In this role, you will develop and implement advanced techniques to optimize model performance in production. Ideal candidates will hold a PhD in Machine Learning and have experience with model architecture and inference optimization. Join a diverse team committed to innovation within a collaborative and remote-friendly work culture, complete with generous benefits and vacation time. #J-18808-Ljbffr

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Staff Research Engineer: AI Model Efficiency & Speed in San Francisco, CA vacancy
  •  ...California. The Role: As a Research Engineer - Model Architectures , you will be...  ...contributor to Zyphra’s AI Architecture Research Team....  ...hardware requirements to design efficient architectures for GPU...  ...codebases and rapidly get up to speed and become productive Previously... 
    Suggested
    Full time
    Work at office
    Relocation package

    Zyphra

    San Francisco, CA
    a month ago
  • Jaide Health is seeking an engineer for their Model Efficiency team in San Francisco. The role focuses on building reliable ML systems while enhancing core performance metrics across model execution. You'll work with advanced performance techniques such as GPU/CUDA optimizations... 
    Suggested
    Remote job

    Jaide Health

    San Francisco, CA
    1 day ago
  • $220.8k - $298.8k

     ...and money by making efficiency a priority. Our...  ...results. Move at Drata Speed (Precision &...  ...thoughtful hybrid model because we believe...  ...path to redefine how AI and General AI...  ...seeking an Applied AI Engineer to drive the...  ...through rigorous research, experimentation,... 
    Suggested
    Full time
    Work at office
    Immediate start
    Worldwide
    Monday to Friday
    Flexible hours

    Drata

    San Francisco, CA
    5 hours ago
  •  ...Francisco, California. The Role: As a Research Engineer - Language Model Pre-Training , you'll shape our...  ...codebases and rapidly get up to speed Published machine learning research...  ...enjoy what we do and love discussing AI Benefits and Perks: Comprehensive... 
    Suggested
    Full time
    Work at office
    Relocation package

    Zyphra

    San Francisco, CA
    a month ago
  • $315k

     ...interpretable, and steerable AI systems. We want AI to...  ...group of committed researchers, engineers, policy experts, and...  ...'s production models undergo sophisticated...  ...build, and run robust, efficient pipelines for model fine...  ...Currently, we expect all staff to be in one of our offices... 
    Suggested
    Work at office
    Visa sponsorship
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    3 days ago
  • Perplexity is seeking a Research Engineering Manager to lead the team of all-star AI researchers and engineers responsible for developing the models that drive our products. Our team has developed...  ...lie on the Pareto frontier of speed and quality. Work closely with engineering... 

    Perplexity AI Inc.

    San Francisco, CA
    13 hours ago
  •  ...Member of Technical Staff, Model EfficiencyWho are we?Our mission...  ...enterprises who are building AI systems to power magical experiences...  ....Cohere is a team of researchers, engineers, designers, and more, who...  ...of LLM inference efficiency. We develop techniques that... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    San Francisco, CA
    3 days ago
  • Refresh AI is seeking a Research Engineer in San Francisco to push the boundaries of benchmarking technology. You will build benchmarks that labs use for evaluating coding abilities and computer-use capability. Your role will require expertise in reinforcement learning... 
    Full time

    Refresh AI

    San Francisco, CA
    1 day ago
  • A leading AI research firm in San Francisco is seeking a Research Engineer specializing in Model Architectures. You will design and rigorously test innovative model architectures, improving core modeling capabilities and collaborating closely with pre-training teams. Candidates... 

    Zyphra

    San Francisco, CA
    2 days ago
  •  ...Francisco Bay Area Work model: On-site (some team...  ...on-site) Industry: AI infrastructure /...  ...that values ownership, speed, and craftsmanship....  ...partner is hiring a Research Engineer to help scale the quality...  ...to improve quality and efficiency Partner with external... 
    Remote work

    talentpluto

    San Francisco, CA
    8 hours ago
  • $155k - $269k

     ...Description Waabi, founded by AI visionary Raquel Urtasun, is the...  ...realistic, scalable, controllable, and efficient simulation. As a Research Engineer in the World Models team, you will develop...  ...training and inference for efficiency, speed, and reliability on large-scale... 
    Full time
    Work at office
    Work from home
    Flexible hours

    Waabi

    San Francisco, CA
    8 hours ago
  •  ...Francisco is seeking talented individuals for AI research roles focused on open superintelligence...  ...to the development of foundational models. The ideal candidate will hold a...  ...related field, possess solid software engineering skills, and have experience with large-... 

    B Capital

    San Francisco, CA
    4 days ago
  • $300k

     ...is the leading conversational AI platform empowering every...  ...as a team. About the Team The Research team at Decagon innovates on...  ...looking for people with strong engineering skills, writing bug-free machine...  ...this role, you will Develop models for customer support tasks... 
    Work at office

    Decagon

    San Francisco, CA
    4 days ago
  •  ...Us At Liquid, we’re not just building AI models—we’re redefining the architecture of...  ...out of MIT, our mission is to build efficient AI systems at every scale. Our Liquid...  ...Liquid team is a community of world-class engineers, researchers, and builders creating the next... 

    Liquid AI

    San Francisco, CA
    3 days ago
  •  ...Senior/Staff ML Research Engineer We're assisting a profitable Enterprise AI Customer Support startup with their search for senior/staff ML research engineers. The...  ...their SF office. What You'll Do: Develop models for customer support tasks that exceed the... 
    Work at office

    DRH Search

    San Francisco, CA
    3 days ago
  •  ...on the frontier of generative AI to join us in building the core...  ...takes the newest open-source models (image, video, 3D, audio, multimodal...  ...You've ported models from research into production and gotten the...  ...natively in the ComfyUI core engine Design and build the native... 

    ComfyUI

    San Francisco, CA
    13 hours ago
  • $250k - $350k

     ...AI is becoming vitally important in every function of our society...  .... The Enterprise ML Research Lab works on the front lines...  ...our enterprise clients. As a Staff Agent Post-Training MLRE, you...  ...foundation healthtech search models. If you are excited about shaping... 
    Full time

    Scale AI

    San Francisco, CA
    20 hours ago
  • Arcada is seeking an ML Research Engineer to improve evaluation methods and understanding of AI capabilities. You will design large-scale experiments to analyze performance...  ...candidate has deep expertise in evaluating models, statistical analysis, and transforming real-... 

    Arcada

    San Francisco, CA
    1 day ago
  •  ...that powers breakthrough AI models at leading research labs and enterprises. Since...  ...intelligence. Innovation at Speed : We celebrate those who...  ...Overview As an Applied Research Engineer, you will be at the...  ...reliable AI training. Increase efficiency and effectiveness in AI-... 
    Flexible hours

    HRB

    San Francisco, CA
    13 hours ago
  •  ...building state-of-the-art AI systems that can write...  .... We operate across research, engineering, product, and...  ...reliability of AI coding models through a combination...  ...ensuring these systems are efficient, cost-effective, and production...  ..., balancing speed, cost, and user experience... 
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    2 days ago
  • A tech-focused company in San Francisco seeks candidates with expertise in AI simulation development. The role emphasizes optimizing training efficiency, enhancing GPU performance, and ensuring low-latency inference. Applicants should be proficient in methodologies for... 

    Embedding VC

    San Francisco, CA
    1 day ago
  • A leading conversational AI platform in San Francisco seeks an AI/ML Engineer to build advanced systems for unprecedented performance. The ideal candidate will...  .... You'll design state-of-the-art methods, develop models for customer support, and tackle complex challenges... 

    Decagon

    San Francisco, CA
    4 days ago
  • $350k

     ...interpretable, and steerable AI systems. We want...  ...of committed researchers, engineers, policy experts,...  ...researchers to efficiently run experiments...  ...help improve our models and products...  ...comfortable getting up-to-speed quickly on...  ...Currently, we expect all staff to be in one of... 
    Full time
    Contract work
    For contractors
    For subcontractor
    Work at office
    Visa sponsorship
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    3 days ago
  •  ...We are an applied AI lab building end-to...  ...first AI software engineer, and Windsurf, an...  ...former founders, and researchers from the frontier...  ...moves at the speed of the infrastructure...  ...utilization, and compute efficiency. Implement...  ...systems for large models; comfortable owning... 

    Cognition

    San Francisco, CA
    4 days ago
  • $280k

     ...interpretable, and steerable AI systems. We want...  ...of committed researchers, engineers, policy experts,...  ...highly capable models helpful and...  ...system that can speed up & improve alignment...  ...Build tooling to efficiently evaluate the...  ...Currently, we expect all staff to be in one of... 
    Contract work
    For contractors
    For subcontractor
    Work at office
    Relocation
    Visa sponsorship
    Work visa
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    13 hours ago
  •  ...the world’s leading research accelerator for frontier AI labs and a trusted...  ...improve our customers' models on complex, long-...  ...for Software Engineering / coding agents UI...  ...to scale production efficiently. In short, you’ll...  ...We work at Start‑Up Speed : We move fast, stay... 
    For contractors
    Flexible hours

    Cerebras

    San Francisco, CA
    2 days ago
  • $160k - $300k

    Hebbia is the AI platform for investors and...  ...-leading accuracy, speed, and transparency...  ...deep, multi-source research. We’ve built our own...  ...LLM inference engine - a distributed, asynchronous...  ...with foundation model providers -...  ...Ensure that systems are efficient, maintainable and well... 
    Contract work
    For contractors
    For subcontractor
    Work at office

    Hebbia

    San Francisco, CA
    2 days ago
  • A leading data and AI company is seeking a Staff Engineer to design and implement core systems for their Foundation Model Serving. The position focuses on large-scale distributed systems, optimizing GPU workloads, and collaborating across teams. Applicants should have... 

    Menlo Ventures

    San Francisco, CA
    1 day ago
  • $168k - $255k

     ...using computer vision models. Today, over 1M+...  ...accelerate cancer research, improving...  ...only product and engineering), so Roboflow employs...  ...diverse data, run efficiently on different hardware...  ...Masters / PhD in AI, machine learning,...  ...productive, like high-speed internet at home... 
    Remote work
    Work from home
    Home office
    Relocation package

    Roboflow, Inc.

    San Francisco, CA
    2 days ago
  •  ...team of scientists, ML researchers, and engineers working together to make...  ...exploring the next frontier of model architectures for AI x chemistry. Operate at...  ...that rewards rigor, speed, execution, and an...  ...obsessive eye on fidelity, efficiency, and scale. Map strategy... 
    Full time
    Temporary work

    Achira

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Research Engineer: AI Model Efficiency & Speed. Be the first to apply!