Staff ML Inference Engineer — Model Efficiency (Remote)
Jaide Health
- Remote job
Jaide Health is seeking an engineer for their Model Efficiency team in San Francisco. The role focuses on building reliable ML systems while enhancing core performance metrics across model execution. You'll work with advanced performance techniques such as GPU/CUDA optimizations and collaborate closely with modeling and systems teams. Ideal candidates will have over 5 years of experience in high-performance coding, plus strong skills in C++ or Python and insights into the LLM inference ecosystem. A commitment to diversity and inclusive work culture is celebrated. #J-18808-Ljbffr Jaide Health
- ...training and deploying frontier models for developers and... ...is a team of researchers, engineers, designers, and more, who are... ...systems and optimize audio inference serving efficiency using innovative techniques... ...Seoul and London. We embrace a remote-friendly environment, and...Remote workFull timeWork at officeFlexible hours
- ...ML Infrastructure Engineer, Model Inference As an ML Infrastructure Engineer, Model Inference at Abridge, you'll play a pivotal role in building and... ...will be instrumental in enhancing the scalability, efficiency, and performance of our AI-driven solutions. You will...Remote workHourly payFull timeFlexible hours
- Jaide Health is seeking an engineer specializing in audio machine learning... ...involves enhancing audio model serving metrics such as... ...significant experience in audio inference systems and be proficient in... ...audio. The company provides a remote-friendly work environment with...Remote job
- Cohere is seeking an engineering professional in New York to develop and optimize audio machine... ...with cross-functional teams to improve audio model metrics, addressing latency and throughput while ensuring real-time audio inference integration. The ideal candidate will...Remote job
$181.1k - $318.4k
...Staff/Sr. AI Infra Performance Engineer Scaling machine learning workloads across... ...that powers large-scale ML training and inference workloads, bringing together... ...in the ML Compute Efficiency team, you'll tackle ambiguous... .... Familiarity with model architectures and...SuggestedRelocation- ...Member of Technical Staff, Model EfficiencyWho are we?... ...team of researchers, engineers, designers, and more,... ...on building reliable ML systems and pushing the boundaries of LLM inference efficiency. We develop techniques... ..., Seoul, and London. Remote-friendly environment,...Remote workFull timeWork at officeFlexible hours
$170k - $216k
...Machine Learning Engineer, Model Optimization Waymo is an... ...) develop methods for efficiently and continuously learning... ...training and model inference through model architecture... ...~ Experience with ML frameworks like PyTorch... ...role can be performed remote, the specific salary...Remote workFull time$155.42k - $205.9k
...About the Team: The ML Inference Platform is part of... ...agnostic, reliable, and cost-efficient platform that powers... ...) machine learning models for experimental, online... ...ML Infrastructure engineer to help build and scale... ...relocation benefits. Remote/Hybrid: This role is...Remote workLocal areaWork from homeRelocationRelocation packageFlexible hours$242k - $290k
...multi-modality foundation model to drive the next... ...Optimization & Deployment Engineer, you will focus on bringing highly efficient, production-ready large-... .... You will optimize the ML models, write custom CUDA... ...build highly concurrent inference code to ensure real-time...Remote workTemporary workRelocation package- ...automotive company seeks a Senior ML Infrastructure Engineer in Austin, Texas, to... ...backend software for ML inference workflows. The engineer will... ...ML engineers to ensure efficient model serving and lead technical... ...compensation and benefits, with a remote work option. #J-18808-...Remote work
- A healthcare technology firm in San Francisco is seeking an ML Infrastructure Engineer, Model Inference to build and optimize AI-driven solutions. You will design scalable Kubernetes clusters, enhance ML model serving infrastructure, and collaborate with cross-functional...
$148.5k - $266.2k
...a Machine Learning Engineering Manager on the Model Delivery team within... ...will lead production ML engineering across deployment... ...person, hybrid, and remote work.... ...cost improvements for inference and serving, including... ...performance, scaling, and efficiency tradeoffs)...Remote workFor contractors- ...company is seeking a Machine Learning Engineer focused on inference and serving. In this role, you will... ...optimize systems to operationalize AI models. The ideal candidate has deep expertise... .... You will work in either a fully remote or hybrid environment. Competitive salary...Remote work
- ...Machine Learning Engineer - Inference / Serving Join to apply... ...Yobi builds foundation models of human behavior grounded... ...Databricks. Fully remote or hybrid from hubs in... ...This is an applied ML systems role—equal parts... ..., caching, and efficient feature retrieval....Remote workFull time
- ...company is seeking a Machine Learning Engineer to design and optimize systems for bringing their models to life. The role involves ensuring ML models are efficient and reliable, requiring experience... ...ML systems. This position can be remote or hybrid from several hubs. #J-18...Remote work
- PICTOR LABS INC is seeking a Senior ML Inference Engineer based in the United States to optimize and deploy production virtual staining models. This role demands deep expertise in ML inference optimization, proficiency in Python, and experience with PyTorch and NVIDIA...Remote job
$114.6k - $252.1k
...Job Title: Principal AI/ML Engineer (Large Language Model) Job Category: Science Time Type: Full time... ...to a variety of applications within remote sensing such as tasking collections,... ...~ Prompt engineering techniques / Inference time techniques (e.g. chain of thought...Remote workFull timeContract workWork experience placementLocal areaFlexible hours- ...Principal AI/ML Engineer ARKA Group L.P. ("ARKA") is an advanced technologies... ...learning, and large language models. We offer generous relocation... ...of applications within remote sensing such as tasking... ...Prompt engineering techniques / Inference time techniques (e.g. chain...Remote workTemporary workWork at officeLocal areaVisa sponsorshipRelocation packageFlexible hours
$180k - $210k
...Overview: The Principal AI/ML Engineer will support the development... ...learning, and large language models. We offer generous... ...variety of applications within remote sensing such as tasking collections... ...engineering techniques / Inference time techniques (e.g. chain of...Remote workTemporary workWork at officeLocal areaVisa sponsorshipRelocation packageFlexible hours- ...seeking a Senior Machine Learning Engineer to spearhead core machine learning models and manage data pipelines. The ideal... ...strong technical skills in ML methods, including deep learning,... ...concepts for various stakeholders. A remote work option is available. #J-18808...Remote work
$150k - $300k
...systems as part of a hybrid team. This role focuses on developing efficient architecture for serving LLMs and optimizing performance using... ...infrastructure tools. Ideal candidates will have significant experience with ML systems, ensuring robust performance and scalability. The...Remote job- A multi-model AI startup is seeking a Machine Learning Researcher to define and execute... ...environment. You'll contribute to novel ML research and help develop essential features... ...have 6+ years of experience in ML engineering, strong Python skills, and a passion for...Remote work
- Bright Vision Technologies is seeking a Model Serving Engineer to design and operate highly reliable inference platforms for large machine learning models. This remote full-time position requires strong expertise in distributed systems and performance engineering, offering...Remote jobFull time
$253.3k - $354.6k
...Ladders is seeking a Staff Machine Learning Engineer to drive AI initiatives in the Media space. This fully remote position focuses on scalable technology... ..., requiring 7+ years of ML Engineering experience.... ...AI solutions, and ensuring model performance. The role offers...Remote work$180k - $210k
...Position Overview The Principal AI/ML Engineer will support the development... ...learning, and large language models. We offer generous... ...variety of applications within remote sensing such as tasking collections... ...engineering techniques / Inference time techniques (e.g. chain of...Remote workFull timeTemporary workWork at officeLocal areaVisa sponsorshipRelocation packageFlexible hours- ...our Machine Learning and Inference Platform that powers... ...hardware, software, and models. We're looking for a strong... ...deep experience in ML serving, high-performance... ...excited to mentor engineers, innovate at scale, and... ...Fridays are flexible for remote work except for employees...Remote workWork at officeLocal areaMonday to ThursdayFlexible hours
- ...leading technology company is seeking a skilled ML Engineer responsible for developing and maintaining data pipelines for model training and evaluation. Candidates should... ...competitive compensation, the opportunity to work remotely from anywhere in the world, and access to...Remote work
$181.1k - $318.4k
...Sr./Staff ML Infrastructure Engineer, Compute (TPU Scheduling) - Foundation Model Work Locations (2) Submit Resume Apple is where individual... ...and enable reliable, efficient execution of large-scale training and inference jobs. This role spans scheduling...Relocation- ...Description As a Senior/Staff Engineer on the Foundation Model Compute Infrastructure team,... ...accelerators and enable reliable, efficient execution of large-scale training and inference jobs. This role spans... ...Experience with distributed ML training or inference systems...
$100k
...this innovation. It offers ML/AI practitioners across Netflix... ...their machine learning models. As part of our mission... ...for a Machine Learning Engineer to join our team to... ...machine learning models for efficient and scalable inference. -Develop and maintain online...Hourly payFull timeImmediate startFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff ML Inference Engineer — Model Efficiency (Remote). Be the first to apply!
- staff security engineer San Francisco, CA
- assistant engineer San Francisco, CA
- engineering aide San Francisco, CA
- assistant chief engineer San Francisco, CA
- staff engineer San Francisco, CA
- technology administrator San Francisco, CA
- senior staff systems engineer San Francisco, CA
- assistant mechanical engineer San Francisco, CA
- staff data engineer San Francisco, CA
- software engineer staff San Francisco, CA

