Staff ML Inference Engineer — Model Efficiency (Remote)

Jaide Health

Remote job

Jaide Health is seeking an engineer for their Model Efficiency team in San Francisco. The role focuses on building reliable ML systems while enhancing core performance metrics across model execution. You'll work with advanced performance techniques such as GPU/CUDA optimizations and collaborate closely with modeling and systems teams. Ideal candidates will have over 5 years of experience in high-performance coding, plus strong skills in C++ or Python and insights into the LLM inference ecosystem. A commitment to diversity and inclusive work culture is celebrated. #J-18808-Ljbffr Jaide Health

Apply

Vacancy posted 5 days ago

Similar jobs that could be interesting for youBased on the Staff ML Inference Engineer — Model Efficiency (Remote) in San Francisco, CA vacancy

Staff ML Engineer, Generative Model Performance & Efficiency
$251k - $310k
...machine learning (ML) engineers, software engineers... ...ML algorithms, to model the real world,... ...report to a Senior Staff Engineering Manager... ...bottlenecks in training and inference performance (e.g.,... ...distillation, and efficient attention... ...can be performed remote, the specific salary...
Remote work
Full time
Waymo
Remote
2 days ago
Audio Inference Engineer — Model Efficiency (Remote-friendly)
Cohere is seeking an engineering professional in New York to develop and optimize audio machine... ...with cross-functional teams to improve audio model metrics, addressing latency and throughput while ensuring real-time audio inference integration. The ideal candidate will...
Remote job
Cohere
New York, NY
3 days ago
ML Engineer - Inference & Model Deployment
...100x better job search engine: fast, comprehensive, honest... ...looking for a founding ML engineer who can help... ...powerful AI and ML models into fast, reliable production... ...models, optimizing inference latency and throughput,... ...sure our models run efficiently in production. This is...
Suggested
Relocation package
HiringCafe
Cupertino, CA
2 days ago
Senior ML Engineer - Model Compression
$128.7k - $261.3k
...repeatable, high‑velocity model deployments through... ..., and infrastructure engineers to ship numerically robust... ..., Data Science/ML, or a closely related... ...model compression, or efficient inference. Strong proficiency in... ...performance. Location: Remote. If you live within a...
Remote work
Local area
Flexible hours
General Motors
Sunnyvale, CA
2 days ago
Member of Technical Staff, Model Efficiency
...and deploying frontier models for developers and... ...team of researchers, engineers, designers, and more,... ...on building reliable ML systems and pushing the boundaries of LLM inference efficiency. We develop techniques... ...London. We embrace a remote‑friendly environment,...
Remote work
Full time
Work at office
Flexible hours
Cohere
New York, NY
3 days ago
Senior ML Engineer - Model Efficiency & Optimization
$50k - $60k
Apex Systems is hiring a Principal Machine Learning Engineer for Model Efficiency & Optimization in Austin, Texas. This senior individual contributor role involves overseeing model optimization strategy and ensuring high-performing, production-ready models for document...
Apex Systems
Austin, TX
1 day ago
Senior Machine Learning Engineer - Model Inference
$184.7k - $324.8k
...world. In this role on ML Platform, you will help... ...learning and large language models into high-volume, low-... ...As a Software Engineer on the Apple Maps team,... ...scale, high-performance inference services that support a... ...with a strong focus on efficiency, reliability, and scalability...
Relocation
Apple
Cupertino, CA
4 days ago
ML Infrastructure Engineer - Model Inference & Scale
A healthcare technology firm in San Francisco is seeking an ML Infrastructure Engineer, Model Inference to build and optimize AI-driven solutions. You will design scalable Kubernetes clusters, enhance ML model serving infrastructure, and collaborate with cross-functional...
Abridge
San Francisco, CA
5 days ago
Senior ML Inference Platform Engineer — Scale & Serve
...automotive company seeks a Senior ML Infrastructure Engineer in Austin, Texas, to... ...backend software for ML inference workflows. The engineer will... ...ML engineers to ensure efficient model serving and lead technical... ...compensation and benefits, with a remote work option. #J-18808-...
Remote work
General Motors
Austin, TX
5 days ago
Real-Time ML Inference Engineer for Scalable Serving
...company is seeking a Machine Learning Engineer to design and optimize systems for bringing their models to life. The role involves ensuring ML models are efficient and reliable, requiring experience... ...ML systems. This position can be remote or hybrid from several hubs. #J-18...
Remote work
Yobi
New York, NY
2 days ago
Senior ML Inference Engineer - Real-Time On-Vehicle Deployment
$128.7k - $261.3k
...professional to design and operate their ML deployment platform. The... ...be responsible for managing model deployments within the... ...of industry experience. This remote role demands extensive collaboration across teams to build efficient developer tools. The role offers...
Remote work
Dormont Manufacturing Company
Sunnyvale, CA
3 days ago
Senior ML Infrastructure Engineer, Inference Platform
$155.42k - $395.9k
...Description About the Team: The ML Inference Platform is part of... ..., reliable, and cost-efficient platform that powers... ...) machine learning models for experimental,... ...Senior ML Infrastructure engineer to help build and scale... ...relocation benefits. Remote/Hybrid: This role is based...
Remote work
Local area
Relocation
Relocation package
Flexible hours
Israelvcforum
Mountain View, CA
5 days ago
Staff ML Inference Engineer - Scalable LLM Serving (Remote)
$150k - $300k
...systems as part of a hybrid team. This role focuses on developing efficient architecture for serving LLMs and optimizing performance using... ...infrastructure tools. Ideal candidates will have significant experience with ML systems, ensuring robust performance and scalability. The...
Remote job
Prime-Intellect
San Francisco, CA
5 days ago
Remote ML Engineer - Model Compression for Low-Latency AI
Dormont Manufacturing Co is looking for a talented engineer to join their Compression and Parity team in the AV Organization. This role... ...field, along with at least three years of industry experience in model optimization. You will collaborate closely with cross-...
Remote job
Dormont Manufacturing Company
Sunnyvale, CA
1 day ago
Remote ML Deployment Engineer for AV Inference
General Motors is seeking a skilled developer to enhance our ML deployment platform for autonomous vehicles. You will... ...and operate a system that automates the transition from model training to vehicle inference, collaborating with various teams to drive high-value model...
Remote job
General Motors
Sunnyvale, CA
3 days ago
Remote Foundation Model Evaluation ML Engineer
$170k - $216k
Waymo is seeking an ML Engineer for Foundation Model Evaluation to enhance the quality and performance of AI agents. In this remote position, you will leverage cutting-edge robotics and machine learning research, collaborating across teams to integrate innovative technology...
Remote job
Full time
kozmetickesluzby.vecnakraska.sk - Jobboard
California, MO
4 days ago
ML Model Serving Engineer — High-Throughput Inference
$175k - $280k
Sesame, located in Bellevue, Washington, is seeking a talented engineer to join our team focused on revolutionizing the way computers interact... ...with humans. The role involves optimizing machine learning models for a new consumer product category, working with state-of-the-...
Sesame
Bellevue, WA
16 days ago
ML Model Serving Engineer - High-Performance Inference
$175k - $280k
...New York is seeking an expert in optimizing machine learning models to turbocharge their serving layer, integrating LLM, speech, and... ...significant experience in systems programming and performance engineering, aiming to improve high-throughput, low-latency serving. Join...
Sesame
New York, NY
15 days ago
Remote ML Engineer: AI Coding Agent & Model Evaluation Pro
$85 per hour
ChatGPT Jobs is looking for an ML Engineer (Coding Agent Experience) in Chicago, IL. The position focuses on using frontier AI coding agents for complex machine learning tasks. Candidates should have at least 2 years of experience and be familiar with various AI tools....
Remote job
Hourly pay
ChatGPT Jobs
Chicago, IL
2 days ago
Lead ML Inference Engineer, Advertising
...our Machine Learning and Inference Platform that powers... ...hardware, software, and models. We’re looking for a strong... ...deep experience in ML serving, high‑performance... ...excited to mentor engineers, innovate at scale, and... ...Fridays are flexible for remote work except for employees...
Remote work
Work at office
Local area
Monday to Thursday
Flexible hours
Roku, Inc.
Austin, TX
1 day ago
Principal Machine Learning Engineer - Model Efficiency & Optimization
# Principal Machine Learning Engineer - Model Efficiency & OptimizationApply**Job#: 3036752****Job Description:**Principal Machine Learning Engineer - Model Efficiency & Optimization**Location:** Austin, Texas (Onsite)Role OverviewWe are seeking a Principal Machine Learning...
Full time
Apex Systems
Austin, TX
1 day ago
Machine Learning Engineer for AI Model Training
$70 per hour
...dynamic network of Machine Learning Engineers and connect with top AI labs... ...Train and evaluate AI models within the field of Machine Learning... ...proficiency in Python and ML frameworks such as PyTorch or... ...Ability to work independently in a remote environment. Work Terms...
Remote work
Hourly pay
Contract work
SaidGig
United States
7 days ago
Remote ML Engineer: Data Pipelines & Model Validation
...leading technology company is seeking a skilled ML Engineer responsible for developing and maintaining data pipelines for model training and evaluation. Candidates should... ...competitive compensation, the opportunity to work remotely from anywhere in the world, and access to...
Remote job
Eqvilent
New York, NY
5 days ago
Remote ML Platform Engineer for Scalable Inference
$100k - $150k
...Vision Technologies is seeking a skilled ML Platform Engineer to design and build high-performance inference platforms for machine learning models. This role focuses on systems... ...performance engineering. This is a full-time remote position with a competitive salary between...
Remote job
Full time
Bright Vision Technologies
Norcross, GA
5 days ago
Senior ML Inference Platform Engineer (Remote)
Dormont Manufacturing Co is looking for a Senior ML Infrastructure Engineer to help build and scale robust platforms for ML inference workflows. You will collaborate with ML engineers and researchers while shaping the future of AI infrastructure at GM. The ideal candidate...
Remote job
Dormont Manufacturing Co
Sunnyvale, CA
2 days ago
Remote ML Platform Engineer — Scalable Inference
$100k - $150k
...Technologies is looking for a skilled ML Platform Engineer to design and operate high-performance inference platforms for machine learning. This is a remote, full-time position with a salary... ...collaborating with ML teams on new model releases. #J-18808-Ljbffr Bright Vision...
Remote job
Full time
Bright Vision Technologies
Plainsboro, NJ
5 days ago
Senior Staff Machine Learning Engineer, LLM/VLM Model Architecture & Optimization
$298k - $368k
...set of sensors, enabling engineers like you to: ~Develop methods for efficiently and continuously... ...world data. ~Develop models and model training at scale... ...low-latency on-device inference techniques and a deep understanding... ...role can be performed remote, the specific salary...
Remote work
Full time
Waymo
Remote
7 hours ago
Senior ML Inference Platform Engineer (Remote)
Israelvcforum is looking for a Senior ML Infrastructure Engineer in Mountain View,... ...robust platforms for ML inference workflows supporting GM’s AI... ...and researchers to implement model serving strategies and... ...skills. The role offers a remote work setup with required visits...
Remote job
Israelvcforum
Mountain View, CA
5 days ago
Senior ML Engineer
$152k - $228k
...Senior ML Engineer About Invoca Invoca is an... ...lifecycle at Invoca, from model training and fine-tuning through inference optimization and production... ...: Apply parameter-efficient fine-tuning methods (LoRA... ...Location This is a remote-first role. We are currently...
Remote work
Currently hiring
Flexible hours
Invoca
United States
2 days ago
Sr./Staff ML Infrastructure Engineer, Compute (TPU Scheduling) - Foundation Model
$171.6k - $302.2k
...AI Description As a Senior/Staff Engineer on the Foundation Model Compute Infrastructure team,... ...accelerators and enable reliable, efficient execution of large-scale training and inference jobs. This role spans... ...orchestration systems for distributed ML workloads running on...
Relocation
Apple Inc.
Seattle, WA
5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff ML Inference Engineer — Model Efficiency (Remote). Be the first to apply!