ML Framework Engineer (MetalLM) for GPU Inference

Apple Inc.

Apple Inc. in Cupertino, California, is seeking an experienced ML Framework Engineer to join their Server ML Frameworks team. This role focuses on enabling Apple Intelligence through high-performance ML applications and involves working on custom-built server hardware for distributed inference. The ideal candidate will have a strong programming background and expertise in GPU compute, with responsibilities including optimizing ML frameworks and collaborating on GPU architecture design. Competitive benefits and pay are offered. #J-18808-Ljbffr Apple Inc.

Apply

Vacancy posted 5 days ago

Similar jobs that could be interesting for youBased on the ML Framework Engineer (MetalLM) for GPU Inference in Cupertino, CA vacancy

ML Framework (MetalLM) Engineer, Graphics, Game and ML
$147.4k - $272.1k
ML Framework (MetalLM) Engineer, Graphics, Game and ML Cupertino, California, United States Apple’s Server ML Frameworks team in GPU, Graphics and Machine Learning works on enabling Apple Intelligence... ...high-performance, distributed inference of GenAI applications (such as...
Suggested
Relocation package
Apple
Cupertino, CA
5 days ago
ML Engineer - Inference & Model Deployment
...100x better job search engine: fast, comprehensive, honest... ...looking for a founding ML engineer who can help... ...models, optimizing inference latency and throughput,... ...of model performance, GPU utilization, inference... ...worked with inference frameworks or serving stacks such...
Suggested
Relocation package
HiringCafe
Cupertino, CA
2 days ago
Inference Optimization ML Engineer
...We're looking for an Inference Optimization MLE to help... ...closely with research engineers to translate model innovations... ...optimization, ML systems, or a closely related... ...with inference serving frameworks (e.g., Triton, TensorRT... ...(But Not Required) GPU kernel or compiler‑level...
Suggested
Rhoda AI
Mountain View, CA
7 days ago
Principal AI and ML Infra Software Engineer, GPU Clusters
$272k - $431.25k
...We are seeking a Principal AI and ML Infra Software Engineer, GPU Clusters at NVIDIA to join our Hardware... ...developments in AI/ML technologies, frameworks, and successful strategies, and... ...data processing, model training, and inference pipelines. ~ Proficiency in programming...
Suggested
NVIDIA Gruppe
Santa Clara, CA
3 days ago
Senior ML Accelerator Engineer - GPU
...export, kernel development, and performance engineering so that every cycle on our accelerators... ...The AI Kernels team builds high‑performance GPU kernels and custom libraries that sit at the heart of our on‑vehicle ML inference for ADAS and autonomous driving. We make core...
Suggested
Local area
Relocation package
Flexible hours
Dormont Manufacturing Co
Sunnyvale, CA
2 days ago
Senior ML Infrastructure Engineer, Inference Platform
$155.42k - $205.9k
Job Description Senior ML Infrastructure Engineer (ML Inference Platform). About the Team The ML Inference Platform... ..., and scalability while maximizing GPU utilization across platforms (B200,... ...state‑of‑the‑art model serving frameworks, hardware accelerators and...
Local area
Remote work
Relocation
Relocation package
Flexible hours
Dormont Manufacturing Co
Sunnyvale, CA
2 days ago
ML Systems Engineer: Scale Training & Inference on Custom Silicon
...hiring a Machine Learning Systems Engineer in Cupertino, California. You... ...optimize model training and inference on Apple's custom Silicon.... ...has strong experience in ML models, with proficiency in Python... ...and knowledge of various ML frameworks. The role offers competitive...
Apple
Cupertino, CA
2 days ago
ML Runtime Optimization Engineer
$159.05k - $199.3k
...looking for a software engineer with deep experience in optimizing ML models and deploying them... ...work across the entire ML framework stack (e.g. PyTorch, JAX... ...and latency of model inference for compute boards selected... ...with ML accelerators, GPU, CPU, SoC architecture and...
Full time
For contractors
For subcontractor
Decisive Point
Sunnyvale, CA
6 days ago
Senior ML Systems Engineer
...industry‑leading training and inference speeds and empowers... ...run large‑scale ML applications, without the... ...over 10 times faster than GPU‑based hyperscale cloud... ...versatile and experienced engineer to join our SOTA... ...Experience with deep learning frameworks (e.g., PyTorch,...
Internship
Cerebras
Sunnyvale, CA
2 days ago
Senior ML Compiler Engineer
$128.7k - $261.3k
...development, and performance engineering so that every cycle on... ...into fast, reliable inference across GPUs powering GM... ..., systems, and GPU engineerswho enjoyworking... ...reliable, andeffortlessfor ML engineers across the AV... ...compilers Experience with ML frameworks (e.g.,PyTorch,...
Local area
Work from home
Flexible hours
Dormont Manufacturing Co
Sunnyvale, CA
2 days ago
Machine Learning Engineer - AI & ML Evaluation Frameworks
$147.4k - $272.1k
...features. We are looking for an exceptional ML Engineer to help us build the next generation of... ...data pipelines, and automated frameworks that ensure our health features are mathematically... .... Experience building data pipelines, inference frameworks, and automated evaluation...
Relocation
Apple
Cupertino, CA
3 days ago
Senior ML Inference Engineer - Platform
$128.7k - $261.3k
...The Model Deployment & Inference Solutions team in GM AV... ...learning models from training frameworks (e.g. PyTorch) onto... ...is two‑fold: build the ML deployment platform... ...performed manually by engineers. Build the developer experience... ...with the NVIDIA GPU stack at the integration...
Local area
Remote work
Flexible hours
Shift work
General Motors
Mountain View, CA
4 days ago
Senior ML Compiler & Inference Systems Engineer
$152k - $287.5k
...Machine Learning Applications and Compiler Engineer in Santa Clara, California. This role... ...involves developing algorithms for their LPX inference and compiler stack, optimizing the... ...skills, and familiarity with deep learning frameworks. The position offers a competitive...
NVIDIA Gruppe
Santa Clara, CA
5 days ago
ML Systems Engineer: Production-Scale LLM Inference
ScOp Venture Capital is looking for an ML Systems Engineer to optimize LLM inference systems crucial for their AI platform. The role focuses on enhancing performance... ...candidate will have a strong background in ML systems, GPU optimization, and programming skills in Python and C++....
ScOp Venture Capital
Santa Clara, CA
5 days ago
Staff ML Infra Engineer: Scalable Inference Platform (Hybrid)
A leading automotive company is seeking a Staff ML Infrastructure Engineer to build robust compute platforms for machine learning workflows in... ...strong coding skills in Go, Python or C++, and expertise in ML inference. The position offers a hybrid work model and competitive...
General Motors
Sunnyvale, CA
6 days ago
Principal Machine Learning Engineer, Mobile AI Inference Optimization
$278.1k - $347.6k
...Principal Machine Learning Engineer, you will be the... ...role. You will define the inference strategy, drive architectural... ...across the full mobile ML stack, and mentor a... ...kernel tuning on NPU, GPU, and CPU. Architecture... ...‑source ML inference frameworks or mobile ML research publications...
Work at office
Worldwide
Relocation package
Dormont Manufacturing Co
Mountain View, CA
2 days ago
Senior ML Software Engineer - Integration & Quality
...industry-leading training and inference speeds; over 10 times faster than GPU-based hyperscale cloud inference... ...We are looking for a Software Engineer to join the ML Integration and Quality team at... ...building automation tools, testing frameworks, or internal developer tooling....
Work at office
Remote work
Cerebras Systems, Inc.
Sunnyvale, CA
3 days ago
ML Infrastructure Engineer - Scale Backend & GPU
Dormont Manufacturing Co is seeking a Software Engineer in Sunnyvale, CA to enhance their platform's performance and usability. The successful... ...design critical backend services and integrate with advanced GPU hardware systems, working on innovative solutions alongside a...
Relocation package
Dormont Manufacturing Co
Sunnyvale, CA
2 days ago
ML Engineer
...and deploy production‑grade ML systems with end‑to‑end ownership... ...model training, deployment, inference, and monitoring in production... ...experience in ML engineering. Strong programming skills in... ...Hands‑on experience with ML frameworks such as PyTorch or TensorFlow...
Full time
Catalyst Labs, LLC
Cupertino, CA
3 days ago
Senior AI/ML C++ Metrics Frameworks Engineer
$129k - $198.4k
General Motors is seeking an AI/ML Engineer for the Metrics Frameworks team in Sunnyvale, California. The successful candidate will focus on developing analytics frameworks and tools to accelerate autonomous vehicle development and testing. Candidates should have a BS...
General Motors
Sunnyvale, CA
5 days ago
Embedded ML Inference Optimization Engineer
Decisive Point is seeking a Software Engineer in Sunnyvale, California, with expertise in optimizing... ...compute platforms, collaborating with ML engineers, and requires strong software... ...in ML accelerators and deep learning frameworks such as PyTorch and JAX. The position offers...
Decisive Point
Sunnyvale, CA
3 days ago
Senior AI/ML Systems Engineer - Scalable Inference
$165k - $242k
Dormont Manufacturing Co is seeking a Senior Engineer to lead designs and improve engineering standards. The role focuses on evolving our Kubernetes-native inference platform and ensuring reliability across multiple services. Qualified candidates should have 5-8 years...
Dormont Manufacturing Co
Sunnyvale, CA
2 days ago
ML / AI Software Engineer - C++ Metrics Frameworks
$129k - $198.4k
Job Description As an AI/ML Engineer on the Metrics Frameworks team, part of the Simulation, Evaluation, and Data organization, you will be an individual contributor focused on developing and optimizing infrastructure to accelerate autonomous vehicle development, testing...
Local area
General Motors
Sunnyvale, CA
5 days ago
Principal Machine Learning Engineer
$296.3k
...seeking a Principal AI Engineer to lead the design and... ...scale training and cloud inference. This includes... ..., and optimize core AI/ML platform infrastructure... ...Python, with proficiency in frameworks such as PyTorch (preferred... ...with distributed systems, GPU computing, and cloud...
Remote work
Flexible hours
General Motors
Sunnyvale, CA
2 days ago
Founding ML Engineer: Production Inference & Deployment
HiringCafe is seeking a Founding ML Engineer in Cupertino to transform AI and ML models into reliable production systems. You'll be responsible for deploying models, optimizing their performance, and ensuring they run efficiently in production. Success in this role requires...
HiringCafe
Cupertino, CA
5 days ago
Senior ML Performance Engineer
...feel like one seamless engine. Developers can write once... ...looking for a Senior ML Performance Engineer to... ...optimization on modern GPU architectures. This is... ...for evaluating LLM inference workloads across GPU clusters... ...Experience with ML frameworks (PyTorch, TensorFlow, ONNX...
Lemurian Labs
Santa Clara, CA
22 days ago
Senior ML Inference Platform Engineer (Remote)
Dormont Manufacturing Co is looking for a Senior ML Infrastructure Engineer to help build and scale robust platforms for ML inference workflows. You will collaborate with ML engineers and researchers while shaping the future of AI infrastructure at GM. The ideal candidate...
Remote job
Dormont Manufacturing Co
Sunnyvale, CA
2 days ago
ML Engineer — AI Platform & Multimodal Inference
...is seeking a Machine Learning Engineer to build and optimize the infrastructure... ...designing and deploying ML models for multimodal data understanding, optimizing inference pipelines, and collaborating... ..., and experience with ML frameworks are required. Knowledge of NLP...
Corvic
Mountain View, CA
3 days ago
ML Engineer Health AI Evaluation & Safety Frameworks
Apple Inc. is seeking an exceptional ML Engineer to join the Health Sensing Machine Learning Interpretability & Analytics team in Cupertino, California. The role involves developing scalable evaluation tools and ensuring model performance and safety for health sensing features...
Apple
Cupertino, CA
3 days ago
ML Systems Engineer
...generative AI to assist engineers in RTL design,... ...Overview We are seeking an ML Systems Engineer to optimize... ...large language model inference powering our agentic AI... ...the systems level—from GPU kernel execution to memory... ...and benchmarking frameworks that measure accuracy,...
ScOp Venture Capital
Santa Clara, CA
5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Framework Engineer (MetalLM) for GPU Inference. Be the first to apply!