Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff ML Inference Engineer — Model Efficiency (Remote)

Jaide Health

Jaide Health is seeking an engineer for their Model Efficiency team in San Francisco. The role focuses on building reliable ML systems while enhancing core performance metrics across model execution. You'll work with advanced performance techniques such as GPU/CUDA optimizations and collaborate closely with modeling and systems teams. Ideal candidates will have over 5 years of experience in high-performance coding, plus strong skills in C++ or Python and insights into the LLM inference ecosystem. A commitment to diversity and inclusive work culture is celebrated. #J-18808-Ljbffr Jaide Health

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Staff ML Inference Engineer — Model Efficiency (Remote) in San Francisco, CA vacancy
  •  ...training and deploying frontier models for developers and...  ...is a team of researchers, engineers, designers, and more, who are...  ...systems and optimize audio inference serving efficiency using innovative techniques...  ...Seoul and London. We embrace a remote-friendly environment, and... 
    Remote work
    Full time
    Work at office
    Flexible hours

    Cohere

    United States
    2 days ago
  •  ...ML Infrastructure Engineer, Model Inference As an ML Infrastructure Engineer, Model Inference at Abridge, you'll play a pivotal role in building and...  ...will be instrumental in enhancing the scalability, efficiency, and performance of our AI-driven solutions. You will... 
    Remote work
    Hourly pay
    Full time
    Flexible hours

    Abridge

    United States
    4 days ago
  • Jaide Health is seeking an engineer specializing in audio machine learning...  ...involves enhancing audio model serving metrics such as...  ...significant experience in audio inference systems and be proficient in...  ...audio. The company provides a remote-friendly work environment with... 
    Remote job

    Jaide Health

    San Francisco, CA
    4 days ago
  • Cohere is seeking an engineering professional in New York to develop and optimize audio machine...  ...with cross-functional teams to improve audio model metrics, addressing latency and throughput while ensuring real-time audio inference integration. The ideal candidate will... 
    Remote job

    Cohere

    New York, NY
    1 day ago
  • $181.1k - $318.4k

     ...Staff/Sr. AI Infra Performance Engineer Scaling machine learning workloads across...  ...that powers large-scale ML training and inference workloads, bringing together...  ...in the ML Compute Efficiency team, you'll tackle ambiguous...  .... Familiarity with model architectures and... 
    Suggested
    Relocation

    Apple

    Santa Clara, CA
    1 day ago
  •  ...Member of Technical Staff, Model EfficiencyWho are we?...  ...team of researchers, engineers, designers, and more,...  ...on building reliable ML systems and pushing the boundaries of LLM inference efficiency. We develop techniques...  ..., Seoul, and London. Remote-friendly environment,... 
    Remote work
    Full time
    Work at office
    Flexible hours

    Cohere

    San Francisco, CA
    12 hours ago
  • $170k - $216k

     ...Machine Learning Engineer, Model Optimization Waymo is an...  ...) develop methods for efficiently and continuously learning...  ...training and model inference through model architecture...  ...~ Experience with ML frameworks like PyTorch...  ...role can be performed remote, the specific salary... 
    Remote work
    Full time

    Waymo

    San Francisco, CA
    3 days ago
  • $155.42k - $205.9k

     ...About the Team: The ML Inference Platform is part of...  ...agnostic, reliable, and cost-efficient platform that powers...  ...) machine learning models for experimental, online...  ...ML Infrastructure engineer to help build and scale...  ...relocation benefits. Remote/Hybrid: This role is... 
    Remote work
    Local area
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Warren, MI
    12 hours ago
  • $242k - $290k

     ...multi-modality foundation model to drive the next...  ...Optimization & Deployment Engineer, you will focus on bringing highly efficient, production-ready large-...  .... You will optimize the ML models, write custom CUDA...  ...build highly concurrent inference code to ensure real-time... 
    Remote work
    Temporary work
    Relocation package

    Zoox

    San Diego, CA
    2 days ago
  •  ...automotive company seeks a Senior ML Infrastructure Engineer in Austin, Texas, to...  ...backend software for ML inference workflows. The engineer will...  ...ML engineers to ensure efficient model serving and lead technical...  ...compensation and benefits, with a remote work option. #J-18808-... 
    Remote work

    General Motors

    Austin, TX
    12 hours ago
  • A healthcare technology firm in San Francisco is seeking an ML Infrastructure Engineer, Model Inference to build and optimize AI-driven solutions. You will design scalable Kubernetes clusters, enhance ML model serving infrastructure, and collaborate with cross-functional... 

    Abridge

    San Francisco, CA
    3 days ago
  • $148.5k - $266.2k

     ...a Machine Learning Engineering Manager on the Model Delivery team within...  ...will lead production ML engineering across deployment...  ...person, hybrid, and remote work....  ...cost improvements for inference and serving, including...  ...performance, scaling, and efficiency tradeoffs)... 
    Remote work
    For contractors

    Autodesk

    Portland, OR
    3 days ago
  •  ...company is seeking a Machine Learning Engineer focused on inference and serving. In this role, you will...  ...optimize systems to operationalize AI models. The ideal candidate has deep expertise...  .... You will work in either a fully remote or hybrid environment. Competitive salary... 
    Remote work

    Yobi AI

    New York, NY
    12 hours ago
  •  ...Machine Learning Engineer - Inference / Serving Join to apply...  ...Yobi builds foundation models of human behavior grounded...  ...Databricks. Fully remote or hybrid from hubs in...  ...This is an applied ML systems role—equal parts...  ..., caching, and efficient feature retrieval.... 
    Remote work
    Full time

    Yobi AI

    New York, NY
    12 hours ago
  •  ...company is seeking a Machine Learning Engineer to design and optimize systems for bringing their models to life. The role involves ensuring ML models are efficient and reliable, requiring experience...  ...ML systems. This position can be remote or hybrid from several hubs. #J-18... 
    Remote work

    Yobi

    New York, NY
    12 hours ago
  • PICTOR LABS INC is seeking a Senior ML Inference Engineer based in the United States to optimize and deploy production virtual staining models. This role demands deep expertise in ML inference optimization, proficiency in Python, and experience with PyTorch and NVIDIA... 
    Remote job

    PICTOR LABS INC

    California, MO
    1 day ago
  • $114.6k - $252.1k

     ...Job Title: Principal AI/ML Engineer (Large Language Model) Job Category: Science Time Type: Full time...  ...to a variety of applications within remote sensing such as tasking collections,...  ...~ Prompt engineering techniques / Inference time techniques (e.g. chain of thought... 
    Remote work
    Full time
    Contract work
    Work experience placement
    Local area
    Flexible hours

    CACI International

    Aurora, CO
    4 days ago
  •  ...Principal AI/ML Engineer ARKA Group L.P. ("ARKA") is an advanced technologies...  ...learning, and large language models. We offer generous relocation...  ...of applications within remote sensing such as tasking...  ...Prompt engineering techniques / Inference time techniques (e.g. chain... 
    Remote work
    Temporary work
    Work at office
    Local area
    Visa sponsorship
    Relocation package
    Flexible hours

    Navstar

    King of Prussia, PA
    2 days ago
  • $180k - $210k

     ...Overview: The Principal AI/ML Engineer will support the development...  ...learning, and large language models. We offer generous...  ...variety of applications within remote sensing such as tasking collections...  ...engineering techniques / Inference time techniques (e.g. chain of... 
    Remote work
    Temporary work
    Work at office
    Local area
    Visa sponsorship
    Relocation package
    Flexible hours

    ARKA Group

    Aurora, CO
    1 day ago
  •  ...seeking a Senior Machine Learning Engineer to spearhead core machine learning models and manage data pipelines. The ideal...  ...strong technical skills in ML methods, including deep learning,...  ...concepts for various stakeholders. A remote work option is available. #J-18808... 
    Remote work

    Thatgamecompany

    Los Angeles, CA
    12 hours ago
  • $150k - $300k

     ...systems as part of a hybrid team. This role focuses on developing efficient architecture for serving LLMs and optimizing performance using...  ...infrastructure tools. Ideal candidates will have significant experience with ML systems, ensuring robust performance and scalability. The... 
    Remote job

    Prime-Intellect

    San Francisco, CA
    3 days ago
  • A multi-model AI startup is seeking a Machine Learning Researcher to define and execute...  ...environment. You'll contribute to novel ML research and help develop essential features...  ...have 6+ years of experience in ML engineering, strong Python skills, and a passion for... 
    Remote work

    Myriad Venture Partners, LLC

    New York, NY
    12 hours ago
  • Bright Vision Technologies is seeking a Model Serving Engineer to design and operate highly reliable inference platforms for large machine learning models. This remote full-time position requires strong expertise in distributed systems and performance engineering, offering... 
    Remote job
    Full time

    Bright Vision Technologies

    Bellevue, WA
    2 days ago
  • $253.3k - $354.6k

     ...Ladders is seeking a Staff Machine Learning Engineer to drive AI initiatives in the Media space. This fully remote position focuses on scalable technology...  ..., requiring 7+ years of ML Engineering experience....  ...AI solutions, and ensuring model performance. The role offers... 
    Remote work

    Ladders

    New York, NY
    3 days ago
  • $180k - $210k

     ...Position Overview The Principal AI/ML Engineer will support the development...  ...learning, and large language models. We offer generous...  ...variety of applications within remote sensing such as tasking collections...  ...engineering techniques / Inference time techniques (e.g. chain of... 
    Remote work
    Full time
    Temporary work
    Work at office
    Local area
    Visa sponsorship
    Relocation package
    Flexible hours

    TSG

    Aurora, CO
    4 days ago
  •  ...our Machine Learning and Inference Platform that powers...  ...hardware, software, and models. We're looking for a strong...  ...deep experience in ML serving, high-performance...  ...excited to mentor engineers, innovate at scale, and...  ...Fridays are flexible for remote work except for employees... 
    Remote work
    Work at office
    Local area
    Monday to Thursday
    Flexible hours

    Roku

    Austin, TX
    2 days ago
  •  ...leading technology company is seeking a skilled ML Engineer responsible for developing and maintaining data pipelines for model training and evaluation. Candidates should...  ...competitive compensation, the opportunity to work remotely from anywhere in the world, and access to... 
    Remote work

    Eqvilent

    New York, NY
    3 days ago
  • $181.1k - $318.4k

     ...Sr./Staff ML Infrastructure Engineer, Compute (TPU Scheduling) - Foundation Model Work Locations (2) Submit Resume Apple is where individual...  ...and enable reliable, efficient execution of large-scale training and inference jobs. This role spans scheduling... 
    Relocation

    Apple

    Santa Clara, CA
    12 hours ago
  •  ...Description As a Senior/Staff Engineer on the Foundation Model Compute Infrastructure team,...  ...accelerators and enable reliable, efficient execution of large-scale training and inference jobs. This role spans...  ...Experience with distributed ML training or inference systems... 

    Apple

    Seattle, WA
    4 days ago
  • $100k

     ...this innovation. It offers ML/AI practitioners across Netflix...  ...their machine learning models. As part of our mission...  ...for a Machine Learning Engineer to join our team to...  ...machine learning models for efficient and scalable inference. -Develop and maintain online... 
    Hourly pay
    Full time
    Immediate start
    Flexible hours

    Netflix

    Los Gatos, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff ML Inference Engineer — Model Efficiency (Remote). Be the first to apply!