Staff Research Engineer: AI Model Efficiency & Speed
Cohere
A leading AI research company in San Francisco is seeking a Staff Research Engineer to enhance the efficiency of large language models. In this role, you will develop and implement advanced techniques to optimize model performance in production. Ideal candidates will hold a PhD in Machine Learning and have experience with model architecture and inference optimization. Join a diverse team committed to innovation within a collaborative and remote-friendly work culture, complete with generous benefits and vacation time. #J-18808-Ljbffr
- ...California. The Role: As a Research Engineer - Model Architectures , you will be... ...contributor to Zyphra’s AI Architecture Research Team.... ...hardware requirements to design efficient architectures for GPU... ...codebases and rapidly get up to speed and become productive Previously...SuggestedFull timeWork at officeRelocation package
- Jaide Health is seeking an engineer for their Model Efficiency team in San Francisco. The role focuses on building reliable ML systems while enhancing core performance metrics across model execution. You'll work with advanced performance techniques such as GPU/CUDA optimizations...SuggestedRemote job
$220.8k - $298.8k
...and money by making efficiency a priority. Our... ...results. Move at Drata Speed (Precision &... ...thoughtful hybrid model because we believe... ...path to redefine how AI and General AI... ...seeking an Applied AI Engineer to drive the... ...through rigorous research, experimentation,...SuggestedFull timeWork at officeImmediate startWorldwideMonday to FridayFlexible hours- ...Francisco, California. The Role: As a Research Engineer - Language Model Pre-Training , you'll shape our... ...codebases and rapidly get up to speed Published machine learning research... ...enjoy what we do and love discussing AI Benefits and Perks: Comprehensive...SuggestedFull timeWork at officeRelocation package
$315k
...interpretable, and steerable AI systems. We want AI to... ...group of committed researchers, engineers, policy experts, and... ...'s production models undergo sophisticated... ...build, and run robust, efficient pipelines for model fine... ...Currently, we expect all staff to be in one of our offices...SuggestedWork at officeVisa sponsorshipFlexible hours- Perplexity is seeking a Research Engineering Manager to lead the team of all-star AI researchers and engineers responsible for developing the models that drive our products. Our team has developed... ...lie on the Pareto frontier of speed and quality. Work closely with engineering...
- ...Member of Technical Staff, Model EfficiencyWho are we?Our mission... ...enterprises who are building AI systems to power magical experiences... ....Cohere is a team of researchers, engineers, designers, and more, who... ...of LLM inference efficiency. We develop techniques that...Full timeWork at officeRemote workFlexible hours
- Refresh AI is seeking a Research Engineer in San Francisco to push the boundaries of benchmarking technology. You will build benchmarks that labs use for evaluating coding abilities and computer-use capability. Your role will require expertise in reinforcement learning...Full time
- A leading AI research firm in San Francisco is seeking a Research Engineer specializing in Model Architectures. You will design and rigorously test innovative model architectures, improving core modeling capabilities and collaborating closely with pre-training teams. Candidates...
- ...Francisco Bay Area Work model: On-site (some team... ...on-site) Industry: AI infrastructure /... ...that values ownership, speed, and craftsmanship.... ...partner is hiring a Research Engineer to help scale the quality... ...to improve quality and efficiency Partner with external...Remote work
$155k - $269k
...Description Waabi, founded by AI visionary Raquel Urtasun, is the... ...realistic, scalable, controllable, and efficient simulation. As a Research Engineer in the World Models team, you will develop... ...training and inference for efficiency, speed, and reliability on large-scale...Full timeWork at officeWork from homeFlexible hours- ...Francisco is seeking talented individuals for AI research roles focused on open superintelligence... ...to the development of foundational models. The ideal candidate will hold a... ...related field, possess solid software engineering skills, and have experience with large-...
$300k
...is the leading conversational AI platform empowering every... ...as a team. About the Team The Research team at Decagon innovates on... ...looking for people with strong engineering skills, writing bug-free machine... ...this role, you will Develop models for customer support tasks...Work at office- ...Us At Liquid, we’re not just building AI models—we’re redefining the architecture of... ...out of MIT, our mission is to build efficient AI systems at every scale. Our Liquid... ...Liquid team is a community of world-class engineers, researchers, and builders creating the next...
- ...Senior/Staff ML Research Engineer We're assisting a profitable Enterprise AI Customer Support startup with their search for senior/staff ML research engineers. The... ...their SF office. What You'll Do: Develop models for customer support tasks that exceed the...Work at office
- ...on the frontier of generative AI to join us in building the core... ...takes the newest open-source models (image, video, 3D, audio, multimodal... ...You've ported models from research into production and gotten the... ...natively in the ComfyUI core engine Design and build the native...
$250k - $350k
...AI is becoming vitally important in every function of our society... .... The Enterprise ML Research Lab works on the front lines... ...our enterprise clients. As a Staff Agent Post-Training MLRE, you... ...foundation healthtech search models. If you are excited about shaping...Full time- Arcada is seeking an ML Research Engineer to improve evaluation methods and understanding of AI capabilities. You will design large-scale experiments to analyze performance... ...candidate has deep expertise in evaluating models, statistical analysis, and transforming real-...
- ...that powers breakthrough AI models at leading research labs and enterprises. Since... ...intelligence. Innovation at Speed : We celebrate those who... ...Overview As an Applied Research Engineer, you will be at the... ...reliable AI training. Increase efficiency and effectiveness in AI-...Flexible hours
- ...building state-of-the-art AI systems that can write... .... We operate across research, engineering, product, and... ...reliability of AI coding models through a combination... ...ensuring these systems are efficient, cost-effective, and production... ..., balancing speed, cost, and user experience...Work at officeRelocation package
- A tech-focused company in San Francisco seeks candidates with expertise in AI simulation development. The role emphasizes optimizing training efficiency, enhancing GPU performance, and ensuring low-latency inference. Applicants should be proficient in methodologies for...
- A leading conversational AI platform in San Francisco seeks an AI/ML Engineer to build advanced systems for unprecedented performance. The ideal candidate will... .... You'll design state-of-the-art methods, develop models for customer support, and tackle complex challenges...
$350k
...interpretable, and steerable AI systems. We want... ...of committed researchers, engineers, policy experts,... ...researchers to efficiently run experiments... ...help improve our models and products... ...comfortable getting up-to-speed quickly on... ...Currently, we expect all staff to be in one of...Full timeContract workFor contractorsFor subcontractorWork at officeVisa sponsorshipFlexible hours- ...We are an applied AI lab building end-to... ...first AI software engineer, and Windsurf, an... ...former founders, and researchers from the frontier... ...moves at the speed of the infrastructure... ...utilization, and compute efficiency. Implement... ...systems for large models; comfortable owning...
$280k
...interpretable, and steerable AI systems. We want... ...of committed researchers, engineers, policy experts,... ...highly capable models helpful and... ...system that can speed up & improve alignment... ...Build tooling to efficiently evaluate the... ...Currently, we expect all staff to be in one of...Contract workFor contractorsFor subcontractorWork at officeRelocationVisa sponsorshipWork visaFlexible hours- ...the world’s leading research accelerator for frontier AI labs and a trusted... ...improve our customers' models on complex, long-... ...for Software Engineering / coding agents UI... ...to scale production efficiently. In short, you’ll... ...We work at Start‑Up Speed : We move fast, stay...For contractorsFlexible hours
$160k - $300k
Hebbia is the AI platform for investors and... ...-leading accuracy, speed, and transparency... ...deep, multi-source research. We’ve built our own... ...LLM inference engine - a distributed, asynchronous... ...with foundation model providers -... ...Ensure that systems are efficient, maintainable and well...Contract workFor contractorsFor subcontractorWork at office- A leading data and AI company is seeking a Staff Engineer to design and implement core systems for their Foundation Model Serving. The position focuses on large-scale distributed systems, optimizing GPU workloads, and collaborating across teams. Applicants should have...
$168k - $255k
...using computer vision models. Today, over 1M+... ...accelerate cancer research, improving... ...only product and engineering), so Roboflow employs... ...diverse data, run efficiently on different hardware... ...Masters / PhD in AI, machine learning,... ...productive, like high-speed internet at home...Remote workWork from homeHome officeRelocation package- ...team of scientists, ML researchers, and engineers working together to make... ...exploring the next frontier of model architectures for AI x chemistry. Operate at... ...that rewards rigor, speed, execution, and an... ...obsessive eye on fidelity, efficiency, and scale. Map strategy...Full timeTemporary work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff Research Engineer: AI Model Efficiency & Speed. Be the first to apply!
- staff security engineer San Francisco, CA
- assistant engineer San Francisco, CA
- engineering aide San Francisco, CA
- assistant chief engineer San Francisco, CA
- staff engineer San Francisco, CA
- technology administrator San Francisco, CA
- senior staff systems engineer San Francisco, CA
- assistant mechanical engineer San Francisco, CA
- staff data engineer San Francisco, CA
- software engineer staff San Francisco, CA


