Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Framework Engineer (MetalLM) for GPU Inference

Apple Inc.

Apple Inc. in Cupertino, California, is seeking an experienced ML Framework Engineer to join their Server ML Frameworks team. This role focuses on enabling Apple Intelligence through high-performance ML applications and involves working on custom-built server hardware for distributed inference. The ideal candidate will have a strong programming background and expertise in GPU compute, with responsibilities including optimizing ML frameworks and collaborating on GPU architecture design. Competitive benefits and pay are offered. #J-18808-Ljbffr Apple Inc.

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the ML Framework Engineer (MetalLM) for GPU Inference in Cupertino, CA vacancy
  • $147.4k - $272.1k

    ML Framework (MetalLM) Engineer, Graphics, Game and ML Cupertino, California, United States Apple’s Server ML Frameworks team in GPU, Graphics and Machine Learning works on enabling Apple Intelligence...  ...high-performance, distributed inference of GenAI applications (such as... 
    Suggested
    Relocation package

    Apple

    Cupertino, CA
    5 days ago
  •  ...100x better job search engine: fast, comprehensive, honest...  ...looking for a founding ML engineer who can help...  ...models, optimizing inference latency and throughput,...  ...of model performance, GPU utilization, inference...  ...worked with inference frameworks or serving stacks such... 
    Suggested
    Relocation package

    HiringCafe

    Cupertino, CA
    2 days ago
  •  ...We're looking for an Inference Optimization MLE to help...  ...closely with research engineers to translate model innovations...  ...optimization, ML systems, or a closely related...  ...with inference serving frameworks (e.g., Triton, TensorRT...  ...(But Not Required) GPU kernel or compiler‑level... 
    Suggested

    Rhoda AI

    Mountain View, CA
    7 days ago
  • $272k - $431.25k

     ...We are seeking a Principal AI and ML Infra Software Engineer, GPU Clusters at NVIDIA to join our Hardware...  ...developments in AI/ML technologies, frameworks, and successful strategies, and...  ...data processing, model training, and inference pipelines. ~ Proficiency in programming... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  •  ...export, kernel development, and performance engineering so that every cycle on our accelerators...  ...The AI Kernels team builds high‑performance GPU kernels and custom libraries that sit at the heart of our on‑vehicle ML inference for ADAS and autonomous driving. We make core... 
    Suggested
    Local area
    Relocation package
    Flexible hours

    Dormont Manufacturing Co

    Sunnyvale, CA
    2 days ago
  • $155.42k - $205.9k

    Job Description Senior ML Infrastructure Engineer (ML Inference Platform). About the Team The ML Inference Platform...  ..., and scalability while maximizing GPU utilization across platforms (B200,...  ...state‑of‑the‑art model serving frameworks, hardware accelerators and... 
    Local area
    Remote work
    Relocation
    Relocation package
    Flexible hours

    Dormont Manufacturing Co

    Sunnyvale, CA
    2 days ago
  •  ...hiring a Machine Learning Systems Engineer in Cupertino, California. You...  ...optimize model training and inference on Apple's custom Silicon....  ...has strong experience in ML models, with proficiency in Python...  ...and knowledge of various ML frameworks. The role offers competitive... 

    Apple

    Cupertino, CA
    2 days ago
  • $159.05k - $199.3k

     ...looking for a software engineer with deep experience in optimizing ML models and deploying them...  ...work across the entire ML framework stack (e.g. PyTorch, JAX...  ...and latency of model inference for compute boards selected...  ...with ML accelerators, GPU, CPU, SoC architecture and... 
    Full time
    For contractors
    For subcontractor

    Decisive Point

    Sunnyvale, CA
    6 days ago
  •  ...industry‑leading training and inference speeds and empowers...  ...run large‑scale ML applications, without the...  ...over 10 times faster than GPU‑based hyperscale cloud...  ...versatile and experienced engineer to join our SOTA...  ...Experience with deep learning frameworks (e.g., PyTorch,... 
    Internship

    Cerebras

    Sunnyvale, CA
    2 days ago
  • $128.7k - $261.3k

     ...development, and performance engineering so that every cycle on...  ...into fast, reliable inference across GPUs powering GM...  ..., systems, and GPU engineerswho enjoyworking...  ...reliable, andeffortlessfor ML engineers across the AV...  ...compilers Experience with ML frameworks (e.g.,PyTorch,... 
    Local area
    Work from home
    Flexible hours

    Dormont Manufacturing Co

    Sunnyvale, CA
    2 days ago
  • $147.4k - $272.1k

     ...features. We are looking for an exceptional ML Engineer to help us build the next generation of...  ...data pipelines, and automated frameworks that ensure our health features are mathematically...  .... Experience building data pipelines, inference frameworks, and automated evaluation... 
    Relocation

    Apple

    Cupertino, CA
    3 days ago
  • $128.7k - $261.3k

     ...The Model Deployment & Inference Solutions team in GM AV...  ...learning models from training frameworks (e.g. PyTorch) onto...  ...is two‑fold: build the ML deployment platform...  ...performed manually by engineers. Build the developer experience...  ...with the NVIDIA GPU stack at the integration... 
    Local area
    Remote work
    Flexible hours
    Shift work

    General Motors

    Mountain View, CA
    4 days ago
  • $152k - $287.5k

     ...Machine Learning Applications and Compiler Engineer in Santa Clara, California. This role...  ...involves developing algorithms for their LPX inference and compiler stack, optimizing the...  ...skills, and familiarity with deep learning frameworks. The position offers a competitive... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • ScOp Venture Capital is looking for an ML Systems Engineer to optimize LLM inference systems crucial for their AI platform. The role focuses on enhancing performance...  ...candidate will have a strong background in ML systems, GPU optimization, and programming skills in Python and C++.... 

    ScOp Venture Capital

    Santa Clara, CA
    5 days ago
  • A leading automotive company is seeking a Staff ML Infrastructure Engineer to build robust compute platforms for machine learning workflows in...  ...strong coding skills in Go, Python or C++, and expertise in ML inference. The position offers a hybrid work model and competitive... 

    General Motors

    Sunnyvale, CA
    6 days ago
  • $278.1k - $347.6k

     ...Principal Machine Learning Engineer, you will be the...  ...role. You will define the inference strategy, drive architectural...  ...across the full mobile ML stack, and mentor a...  ...kernel tuning on NPU, GPU, and CPU. Architecture...  ...‑source ML inference frameworks or mobile ML research publications... 
    Work at office
    Worldwide
    Relocation package

    Dormont Manufacturing Co

    Mountain View, CA
    2 days ago
  •  ...industry-leading training and inference speeds; over 10 times faster than GPU-based hyperscale cloud inference...  ...We are looking for a Software Engineer to join the ML Integration and Quality team at...  ...building automation tools, testing frameworks, or internal developer tooling.... 
    Work at office
    Remote work

    Cerebras Systems, Inc.

    Sunnyvale, CA
    3 days ago
  • Dormont Manufacturing Co is seeking a Software Engineer in Sunnyvale, CA to enhance their platform's performance and usability. The successful...  ...design critical backend services and integrate with advanced GPU hardware systems, working on innovative solutions alongside a... 
    Relocation package

    Dormont Manufacturing Co

    Sunnyvale, CA
    2 days ago
  •  ...and deploy production‑grade ML systems with end‑to‑end ownership...  ...model training, deployment, inference, and monitoring in production...  ...experience in ML engineering. Strong programming skills in...  ...Hands‑on experience with ML frameworks such as PyTorch or TensorFlow... 
    Full time

    Catalyst Labs, LLC

    Cupertino, CA
    3 days ago
  • $129k - $198.4k

    General Motors is seeking an AI/ML Engineer for the Metrics Frameworks team in Sunnyvale, California. The successful candidate will focus on developing analytics frameworks and tools to accelerate autonomous vehicle development and testing. Candidates should have a BS... 

    General Motors

    Sunnyvale, CA
    5 days ago
  • Decisive Point is seeking a Software Engineer in Sunnyvale, California, with expertise in optimizing...  ...compute platforms, collaborating with ML engineers, and requires strong software...  ...in ML accelerators and deep learning frameworks such as PyTorch and JAX. The position offers... 

    Decisive Point

    Sunnyvale, CA
    3 days ago
  • $165k - $242k

    Dormont Manufacturing Co is seeking a Senior Engineer to lead designs and improve engineering standards. The role focuses on evolving our Kubernetes-native inference platform and ensuring reliability across multiple services. Qualified candidates should have 5-8 years... 

    Dormont Manufacturing Co

    Sunnyvale, CA
    2 days ago
  • $129k - $198.4k

    Job Description As an AI/ML Engineer on the Metrics Frameworks team, part of the Simulation, Evaluation, and Data organization, you will be an individual contributor focused on developing and optimizing infrastructure to accelerate autonomous vehicle development, testing... 
    Local area

    General Motors

    Sunnyvale, CA
    5 days ago
  • $296.3k

     ...seeking a Principal AI Engineer to lead the design and...  ...scale training and cloud inference. This includes...  ..., and optimize core AI/ML platform infrastructure...  ...Python, with proficiency in frameworks such as PyTorch (preferred...  ...with distributed systems, GPU computing, and cloud... 
    Remote work
    Flexible hours

    General Motors

    Sunnyvale, CA
    2 days ago
  • HiringCafe is seeking a Founding ML Engineer in Cupertino to transform AI and ML models into reliable production systems. You'll be responsible for deploying models, optimizing their performance, and ensuring they run efficiently in production. Success in this role requires... 

    HiringCafe

    Cupertino, CA
    5 days ago
  •  ...feel like one seamless engine. Developers can write once...  ...looking for a Senior ML Performance Engineer to...  ...optimization on modern GPU architectures. This is...  ...for evaluating LLM inference workloads across GPU clusters...  ...Experience with ML frameworks (PyTorch, TensorFlow, ONNX... 

    Lemurian Labs

    Santa Clara, CA
    22 days ago
  • Dormont Manufacturing Co is looking for a Senior ML Infrastructure Engineer to help build and scale robust platforms for ML inference workflows. You will collaborate with ML engineers and researchers while shaping the future of AI infrastructure at GM. The ideal candidate... 
    Remote job

    Dormont Manufacturing Co

    Sunnyvale, CA
    2 days ago
  •  ...is seeking a Machine Learning Engineer to build and optimize the infrastructure...  ...designing and deploying ML models for multimodal data understanding, optimizing inference pipelines, and collaborating...  ..., and experience with ML frameworks are required. Knowledge of NLP... 

    Corvic

    Mountain View, CA
    3 days ago
  • Apple Inc. is seeking an exceptional ML Engineer to join the Health Sensing Machine Learning Interpretability & Analytics team in Cupertino, California. The role involves developing scalable evaluation tools and ensuring model performance and safety for health sensing features... 

    Apple

    Cupertino, CA
    3 days ago
  •  ...generative AI to assist engineers in RTL design,...  ...Overview We are seeking an ML Systems Engineer to optimize...  ...large language model inference powering our agentic AI...  ...the systems level—from GPU kernel execution to memory...  ...and benchmarking frameworks that measure accuracy,... 

    ScOp Venture Capital

    Santa Clara, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Framework Engineer (MetalLM) for GPU Inference. Be the first to apply!