Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Foundation Model Services ML Engineer for Scale Inference

$181.1k - $318.4k

Apple Oakbrook

Apple Inc. in Santa Clara, California, is looking for an experienced Machine Learning engineer to optimize and build production-grade solutions serving millions in real time. You will work closely with product teams and utilize advanced machine learning technologies, contributing directly to optimizing language and vision models. Applicants should have at least 5 years of industry experience in machine learning, be proficient in cloud applications, and possess a bachelor's degree in a relevant field. A competitive compensation package including a salary range of $181,100 to $318,400 awaits the right candidate. #J-18808-Ljbffr

Vacancy posted 14 hours ago
Similar jobs that could be interesting for youBased on the Foundation Model Services ML Engineer for Scale Inference in Santa Clara, CA vacancy
  • $181.1k - $318.4k

     ...new product we build, service we create, or Apple...  ...Description As a Senior/Staff Engineer on the Foundation Model Compute...  ...orchestration systems for large‑scale TPU workloads across...  ...‑scale training and inference jobs. This role spans...  ...for distributed ML workloads running on... 
    Foundation
    Relocation

    Apple

    Santa Clara, CA
    14 hours ago
  • $181.1k - $318.4k

     ...bring smile to people’s face”. Foundation Model Services team, within Machine...  ...technologies and make it run at scale of Apple. Description Work closely...  ...to prototype and develop inference for cutting‑edge model...  ...year+ industry experience in ML technologies (LLMs, Machine... 
    Foundation
    Relocation

    Apple

    Santa Clara, CA
    15 hours ago
  •  ...Inference Optimization MLE At Rhoda AI, we're building...  ...and state-of-the-art foundation world models that control our...  ...development, and manufacturing scale-up to make generalist...  ...with research engineers to translate model...  ...inference optimization, ML systems, or a closely... 
    Foundation

    Rhoda ai

    Palo Alto, CA
    3 days ago
  •  ...building the full-stack foundation for the next...  ...the foundational models and video world...  ...intersection of large-scale learning,...  ...re looking for an ML Infrastructure Engineer to help build and operate the inference systems that power...  ...Design and scale services to serve various... 
    Foundation

    Rhoda AI

    Palo Alto, CA
    3 days ago
  •  ...building a 100x better job search engine: fast, comprehensive,...  ...are looking for a founding ML engineer who can help us turn powerful AI and ML models into fast, reliable...  ...deploying models, optimizing inference latency and throughput, scaling serving systems, and making... 
    Suggested
    Full time
    Relocation package

    HiringCafe

    Cupertino, CA
    3 days ago
  • $189k - $300k

     ...of transportation on a global scale. The Data Scaling team owns the Data Flywheel for AV Foundation model development and successive fine...  ...works on and delivers ML models to the product that successively...  ..., high‑impact team of AI/ML engineers, data scientists and engineers... 
    Foundation
    Local area
    Remote work
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    14 hours ago
  • $153.2k - $234.1k

     ...transportation on a global scale. Role Overview:...  ...the Embodied AI Infra Foundation team at General Motors,...  ...every machine learning engineer working on our cutting-...  ...edge Autonomous Driving models. From foundational models...  .... As a Senior ML Infra Engineer, you will... 
    Foundation
    Work at office
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    5 days ago
  • $150k

     ...About the Institute of Foundation Models We are a dedicated...  ...data scientists, and engineers, tackling the most fundamental...  ...Engineer focused on ML infrastructure and...  ...experimental work can scale reliably when needed....  ...preferably AWS) and core services for compute, storage,... 
    Foundation
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    1 day ago
  • $124k

     ...re not just training models, we're building the foundation models that power...  ...systems at enterprise scale. In addition, we deploy...  ...for quantized inference, if you excel at making...  ...compiler, inference engine, and silicon teams to...  ..., theft & legal services, and pet insurance... 
    Foundation
    Hourly pay
    Full time
    Temporary work
    Immediate start
    Flexible hours

    Tesla

    Palo Alto, CA
    3 days ago
  • $244.14k - $413.16k

     ...Senior Staff Machine Learning Engineer - Foundation Model Santa Clara, CA XPENG is a leading smart technology company at the forefront of innovation...  ...infrastructure experts to design, train, and deploy large-scale multi-modal models that unify vision, language, and control... 
    Foundation
    Full time

    XPENG

    Santa Clara, CA
    3 days ago
  • $175k - $296k

     ...full-time Machine Learning Engineer, with deep knowledge and...  ...a state-of-art ML infrastructure for training very large foundation model and accelerating model training/inference. Our mission is to solve...  ...Experience in training large scale vision or language models... 
    Foundation
    Full time

    XPeng Motors

    Santa Clara, CA
    more than 2 months ago
  •  ...ML Infrastructure Service Reliability Engineer- Apple Services Engineering At Apple, we don’t just build products...  ...these challenges through a strong foundation in cloud object storage, data analysis...  ..., resilient, and efficient at scale. Description We are seeking an experienced... 
    Foundation

    Apple

    Cupertino, CA
    15 hours ago
  •  ...re reimagining the foundations of computing to...  ...remove the limits of scale, hardware, and...  ...like one seamless engine. Developers can write...  ...looking for a Senior ML Performance...  ...of large language models — including Llama...  ...for evaluating LLM inference workloads across GPU... 
    Foundation

    Lemurian Labs

    Santa Clara, CA
    18 days ago
  •  ...About The Role The Inference ML Engineering team at Cerebras Systems is dedicated...  ...state‑of‑the‑art generative AI models on our custom hardware. You...  ...enabling key ML features at scale, maintaining our speed...  ...minute. Scale our inference service by implementing detailed observability... 

    Dormont Manufacturing Company

    Sunnyvale, CA
    1 day ago
  • $124k - $195.5k

     ...Learning Applications and Compiler Engineer for New College Grad 2026 in...  ...on developing algorithms for inference and compiler stack...  ...intersection of deep learning and large-scale systems. Ideal candidates should...  ..., and experience with ML frameworks like TensorFlow and... 

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $156k - $316.8k

     ...Research Scientist — Privacy-Preserving Large-Scale Model Training & Architecture Optimization...  ...) tailored to long-running, multi-stage foundation model training. Build fault-tolerant,...  ...Qualifications: Experience with privacy-preserving ML, sensitive data training, or regulated... 
    Foundation
    Temporary work
    Local area

    Ellis Technologies, Inc.

    San Jose, CA
    15 hours ago
  • $150k

     ...About the Institute of Foundation Models We are a dedicated...  ...data scientists, and engineers, tackling the most fundamental...  ...development of large-scale VLM systems, spanning...  ...model modularity, and inference optimization. Build...  ...techniques. Experience with ML infrastructure,... 
    Foundation

    Institute of Foundation Models

    Sunnyvale, CA
    15 hours ago
  •  ...company is seeking a Staff ML Infrastructure Engineer to build robust compute platforms...  ...to ensure efficient model serving, leading technical...  ...-making, and driving large-scale initiatives across GM's ML...  ...or C++, and expertise in ML inference. The position offers a hybrid... 

    General Motors

    Sunnyvale, CA
    15 hours ago
  •  ...shaping the future of transportation on a global scale. The Data Scaling team owns the Data Flywheel for AV Foundation model development and successive fine tuning. It...  ...loop. The team directly works on and delivers ML models to the product that successively go up the... 
    Foundation
    Local area
    Remote work
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Mountain View, CA
    14 hours ago
  • $254k - $349.25k

     ...seeking a Principal ML Architect to lead...  ...deep expertise in model architecture,...  ...capable of operating at scale across high-volume...  ...Optimize inference systems for low latency...  ..., etc.) Systems & Engineering Experience designing...  ...define the technical foundation for secure,... 
    Foundation
    Flexible hours

    Proofpoint

    Sunnyvale, CA
    14 hours ago
  • $254k - $349.25k

     ...seeking a Principal ML Architect to lead...  ...deep expertise in model architecture,...  ...capable of operating at scale across high-...  ...environments Optimize inference systems for low...  ...etc.) Systems & Engineering Experience...  ...define the technical foundation for secure,... 
    Foundation
    Flexible hours

    Proofpoint

    Sunnyvale, CA
    2 days ago
  •  ...is hiring a Machine Learning Systems Engineer in Cupertino, California. You will collaborate with Siri modeling teams to optimize model training and inference on Apple's custom Silicon. The ideal candidate has strong experience in ML models, with proficiency in Python... 

    Apple

    Cupertino, CA
    15 hours ago
  • $184k - $287.5k

     ...accelerating it. The TensorRT inference platform is the backbone of...  ...of cutting-edge deep learning models on every NVIDIA GPU. With demand...  ...a highly skilled and driven Engineering Manager to take the lead in...  .... Proven ability to lead and scale high-performing engineering teams... 

    NVIDIA

    Santa Clara, CA
    14 hours ago
  • $220k - $320k

     ...About the Institute of Foundation Models We are a dedicated...  ...data scientists, and engineers, tackling the most fundamental...  ...the rigor required to scale PAN-v2, translating...  ...familiarity with the ML training lifecycle....  ...between pre-training and inference, know what a... 
    Foundation
    Visa sponsorship
    Flexible hours

    Institute of Foundation Models

    Sunnyvale, CA
    3 days ago
  • $128.7k - $261.3k

     ...enables repeatable, high-velocity model deployments through principled...  ...and deployment and infra engineers to ship numerically robust, low...  ..., Mathematics, Data Science / ML, or a closely related quantitative...  ...model compression / efficient inference or relevant experience ~... 
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    2 days ago
  • $172.43k - $230.95k

     ...Senior Software Engineer For The Ai Model Lifecycle Team Crusoe...  ...who believe in the scale of our ambition and...  ...construction, and cloud services. If you want to...  ...tuning systems for large foundation models (SFT, PEFT,...  ...on GPU systems and inference frameworks.... 
    Foundation
    Temporary work

    Crusoe

    Sunnyvale, CA
    4 days ago
  • $150k - $350k

     ...About the Institute of Foundation Models We are a dedicated...  ...data scientists, and engineers, tackling the most fundamental...  ...of our large‑scale GPU resources. You track...  ...specifically within AI/ML or HPC environments. Foundation...  ...pre‑training and inference, and are familiar with... 
    Foundation
    Live in
    Immediate start

    Institute of Foundation Models

    Sunnyvale, CA
    3 days ago
  •  ...when it's needed. This model supports real-time...  ...As a Senior IT AI/ML Engineer , you will be responsible...  ...deployment and real-time inference systems. System...  ...Design and optimize large-scale AI/ML systems for...  ...Technical Expertise : Strong foundation in machine learning (... 
    Foundation
    Full time
    Work at office
    Visa sponsorship
    Work visa

    Palo Alto Networks

    Santa Clara, CA
    4 days ago
  • $150k

     ...About the Institute of Foundation Models We are a dedicated...  ...data scientists, and engineers, tackling the most fundamental...  ...The Distributed ML Engineer will play a...  ...especially at training and inference, and support the team...  ..., and large‑scale machine learning experience... 
    Foundation
    Work experience placement
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    14 hours ago
  •  ...General Motors is seeking a Senior ML Infrastructure Engineer to build and scale a robust platform for machine learning inference workflows. You will design backend software components, collaborate with ML engineers, and lead initiatives across GM's ML ecosystem. With... 
    Remote work

    General Motors

    Sunnyvale, CA
    15 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Foundation Model Services ML Engineer for Scale Inference. Be the first to apply!