Staff Research Engineer: AI Model Efficiency & Speed

Cohere

A leading AI research company in San Francisco is seeking a Staff Research Engineer to enhance the efficiency of large language models. In this role, you will develop and implement advanced techniques to optimize model performance in production. Ideal candidates will hold a PhD in Machine Learning and have experience with model architecture and inference optimization. Join a diverse team committed to innovation within a collaborative and remote-friendly work culture, complete with generous benefits and vacation time. #J-18808-Ljbffr

Apply

Vacancy posted 3 days ago

Similar jobs that could be interesting for youBased on the Staff Research Engineer: AI Model Efficiency & Speed in San Francisco, CA vacancy

Research Engineer - Model Architectures
...California. The Role: As a Research Engineer - Model Architectures , you will be... ...contributor to Zyphra’s AI Architecture Research Team.... ...hardware requirements to design efficient architectures for GPU... ...codebases and rapidly get up to speed and become productive Previously...
Suggested
Full time
Work at office
Relocation package
Zyphra
San Francisco, CA
a month ago
Staff ML Inference Engineer — Model Efficiency (Remote)
Jaide Health is seeking an engineer for their Model Efficiency team in San Francisco. The role focuses on building reliable ML systems while enhancing core performance metrics across model execution. You'll work with advanced performance techniques such as GPU/CUDA optimizations...
Suggested
Remote job
Jaide Health
San Francisco, CA
1 day ago
Staff Applied Research Engineer
$220.8k - $298.8k
...and money by making efficiency a priority. Our... ...results. Move at Drata Speed (Precision &... ...thoughtful hybrid model because we believe... ...path to redefine how AI and General AI... ...seeking an Applied AI Engineer to drive the... ...through rigorous research, experimentation,...
Suggested
Full time
Work at office
Immediate start
Worldwide
Monday to Friday
Flexible hours
Drata
San Francisco, CA
5 hours ago
Research Engineer - Language Model Pre-Training
...Francisco, California. The Role: As a Research Engineer - Language Model Pre-Training , you'll shape our... ...codebases and rapidly get up to speed Published machine learning research... ...enjoy what we do and love discussing AI Benefits and Perks: Comprehensive...
Suggested
Full time
Work at office
Relocation package
Zyphra
San Francisco, CA
a month ago
Research Engineer, Production Model Post-Training
$315k
...interpretable, and steerable AI systems. We want AI to... ...group of committed researchers, engineers, policy experts, and... ...'s production models undergo sophisticated... ...build, and run robust, efficient pipelines for model fine... ...Currently, we expect all staff to be in one of our offices...
Suggested
Work at office
Visa sponsorship
Flexible hours
Menlo Ventures
San Francisco, CA
3 days ago
Research Engineering Manager - Model Training
Perplexity is seeking a Research Engineering Manager to lead the team of all-star AI researchers and engineers responsible for developing the models that drive our products. Our team has developed... ...lie on the Pareto frontier of speed and quality. Work closely with engineering...
Perplexity AI Inc.
San Francisco, CA
13 hours ago
Member of Technical Staff, Model Efficiency
...Member of Technical Staff, Model EfficiencyWho are we?Our mission... ...enterprises who are building AI systems to power magical experiences... ....Cohere is a team of researchers, engineers, designers, and more, who... ...of LLM inference efficiency. We develop techniques that...
Full time
Work at office
Remote work
Flexible hours
Cohere
San Francisco, CA
3 days ago
Benchmarking Research Engineer: Frontier Model Evaluations
Refresh AI is seeking a Research Engineer in San Francisco to push the boundaries of benchmarking technology. You will build benchmarks that labs use for evaluating coding abilities and computer-use capability. Your role will require expertise in reinforcement learning...
Full time
Refresh AI
San Francisco, CA
1 day ago
Model Architecture Research Engineer — Pioneering AI
A leading AI research firm in San Francisco is seeking a Research Engineer specializing in Model Architectures. You will design and rigorously test innovative model architectures, improving core modeling capabilities and collaborating closely with pre-training teams. Candidates...
Zyphra
San Francisco, CA
2 days ago
Research Engineer
...Francisco Bay Area Work model: On-site (some team... ...on-site) Industry: AI infrastructure /... ...that values ownership, speed, and craftsmanship.... ...partner is hiring a Research Engineer to help scale the quality... ...to improve quality and efficiency Partner with external...
Remote work
talentpluto
San Francisco, CA
8 hours ago
Research Engineer, World Models
$155k - $269k
...Description Waabi, founded by AI visionary Raquel Urtasun, is the... ...realistic, scalable, controllable, and efficient simulation. As a Research Engineer in the World Models team, you will develop... ...training and inference for efficiency, speed, and reliability on large-scale...
Full time
Work at office
Work from home
Flexible hours
Waabi
San Francisco, CA
8 hours ago
Staff Research Engineer - Pre-Training for Open LLMs
...Francisco is seeking talented individuals for AI research roles focused on open superintelligence... ...to the development of foundational models. The ideal candidate will hold a... ...related field, possess solid software engineering skills, and have experience with large-...
B Capital
San Francisco, CA
4 days ago
Staff Research Engineer
$300k
...is the leading conversational AI platform empowering every... ...as a team. About the Team The Research team at Decagon innovates on... ...looking for people with strong engineering skills, writing bug-free machine... ...this role, you will Develop models for customer support tasks...
Work at office
Decagon
San Francisco, CA
4 days ago
Member of Technical Staff - ML Research Engineer, Data
...Us At Liquid, we’re not just building AI models—we’re redefining the architecture of... ...out of MIT, our mission is to build efficient AI systems at every scale. Our Liquid... ...Liquid team is a community of world-class engineers, researchers, and builders creating the next...
Liquid AI
San Francisco, CA
3 days ago
Senior/Staff ML Research Engineer
...Senior/Staff ML Research Engineer We're assisting a profitable Enterprise AI Customer Support startup with their search for senior/staff ML research engineers. The... ...their SF office. What You'll Do: Develop models for customer support tasks that exceed the...
Work at office
DRH Search
San Francisco, CA
3 days ago
Senior/Staff ML Engineer, Model Integration
...on the frontier of generative AI to join us in building the core... ...takes the newest open-source models (image, video, 3D, audio, multimodal... ...You've ported models from research into production and gotten the... ...natively in the ComfyUI core engine Design and build the native...
ComfyUI
San Francisco, CA
13 hours ago
Staff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAI
$250k - $350k
...AI is becoming vitally important in every function of our society... .... The Enterprise ML Research Lab works on the front lines... ...our enterprise clients. As a Staff Agent Post-Training MLRE, you... ...foundation healthtech search models. If you are excited about shaping...
Full time
Scale AI
San Francisco, CA
20 hours ago
ML Research Engineer: Real-World Model Evaluation
Arcada is seeking an ML Research Engineer to improve evaluation methods and understanding of AI capabilities. You will design large-scale experiments to analyze performance... ...candidate has deep expertise in evaluating models, statistical analysis, and transforming real-...
Arcada
San Francisco, CA
1 day ago
Applied Research Engineer
...that powers breakthrough AI models at leading research labs and enterprises. Since... ...intelligence. Innovation at Speed : We celebrate those who... ...Overview As an Applied Research Engineer, you will be at the... ...reliable AI training. Increase efficiency and effectiveness in AI-...
Flexible hours
HRB
San Francisco, CA
13 hours ago
Research Engineer, Codex
...building state-of-the-art AI systems that can write... .... We operate across research, engineering, product, and... ...reliability of AI coding models through a combination... ...ensuring these systems are efficient, cost-effective, and production... ..., balancing speed, cost, and user experience...
Work at office
Relocation package
OpenAI
San Francisco, CA
2 days ago
Staff ML Engineer: Efficient ML & Low-Latency AI
A tech-focused company in San Francisco seeks candidates with expertise in AI simulation development. The role emphasizes optimizing training efficiency, enhancing GPU performance, and ensuring low-latency inference. Applicants should be proficient in methodologies for...
Embedding VC
San Francisco, CA
1 day ago
Staff Research Engineer, AI/ML Systems & Production
A leading conversational AI platform in San Francisco seeks an AI/ML Engineer to build advanced systems for unprecedented performance. The ideal candidate will... .... You'll design state-of-the-art methods, develop models for customer support, and tackle complex challenges...
Decagon
San Francisco, CA
4 days ago
Research Engineer / Scientist, Societal Impacts
$350k
...interpretable, and steerable AI systems. We want... ...of committed researchers, engineers, policy experts,... ...researchers to efficiently run experiments... ...help improve our models and products... ...comfortable getting up-to-speed quickly on... ...Currently, we expect all staff to be in one of...
Full time
Contract work
For contractors
For subcontractor
Work at office
Visa sponsorship
Flexible hours
Menlo Ventures
San Francisco, CA
3 days ago
Research Engineer, Infrastructure
...We are an applied AI lab building end-to... ...first AI software engineer, and Windsurf, an... ...former founders, and researchers from the frontier... ...moves at the speed of the infrastructure... ...utilization, and compute efficiency. Implement... ...systems for large models; comfortable owning...
Cognition
San Francisco, CA
4 days ago
Research Engineer / Scientist, Alignment Science
$280k
...interpretable, and steerable AI systems. We want... ...of committed researchers, engineers, policy experts,... ...highly capable models helpful and... ...system that can speed up & improve alignment... ...Build tooling to efficiently evaluate the... ...Currently, we expect all staff to be in one of...
Contract work
For contractors
For subcontractor
Work at office
Relocation
Visa sponsorship
Work visa
Flexible hours
Menlo Ventures
San Francisco, CA
13 hours ago
Principal Research Engineer - Code
...the world’s leading research accelerator for frontier AI labs and a trusted... ...improve our customers' models on complex, long-... ...for Software Engineering / coding agents UI... ...to scale production efficiently. In short, you’ll... ...We work at Start‑Up Speed : We move fast, stay...
For contractors
Flexible hours
Cerebras
San Francisco, CA
2 days ago
Applied Research Engineer (Agents)
$160k - $300k
Hebbia is the AI platform for investors and... ...-leading accuracy, speed, and transparency... ...deep, multi-source research. We’ve built our own... ...LLM inference engine - a distributed, asynchronous... ...with foundation model providers -... ...Ensure that systems are efficient, maintainable and well...
Contract work
For contractors
For subcontractor
Work at office
Hebbia
San Francisco, CA
2 days ago
Staff Backend Engineer, Foundation Model Serving
A leading data and AI company is seeking a Staff Engineer to design and implement core systems for their Foundation Model Serving. The position focuses on large-scale distributed systems, optimizing GPU workloads, and collaborating across teams. Applicants should have...
Menlo Ventures
San Francisco, CA
1 day ago
Machine Learning Research Engineer
$168k - $255k
...using computer vision models. Today, over 1M+... ...accelerate cancer research, improving... ...only product and engineering), so Roboflow employs... ...diverse data, run efficiently on different hardware... ...Masters / PhD in AI, machine learning,... ...productive, like high-speed internet at home...
Remote work
Work from home
Home office
Relocation package
Roboflow, Inc.
San Francisco, CA
2 days ago
Machine Learning Research Engineer (MLRE) - GPUs
...team of scientists, ML researchers, and engineers working together to make... ...exploring the next frontier of model architectures for AI x chemistry. Operate at... ...that rewards rigor, speed, execution, and an... ...obsessive eye on fidelity, efficiency, and scale. Map strategy...
Full time
Temporary work
Achira
San Francisco, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Research Engineer: AI Model Efficiency & Speed. Be the first to apply!