ML Engineer: LLMs & Generative AI Inference

$147.4k - $272.1k

Apple Inc.

A leading technology company is searching for a Machine Learning Engineer in Cupertino, California. The role involves working with Large Language Models and Generative AI to enhance user experiences across Apple's platforms. Candidates should have extensive experience in Machine Learning, ideally with published research, and an advanced degree in a related field is preferred. The position offers a competitive salary range of $147,400 to $272,100, along with comprehensive benefits and opportunities for stock ownership. #J-18808-Ljbffr Apple Inc.

Apply

Vacancy posted 16 hours ago

Similar jobs that could be interesting for youBased on the ML Engineer: LLMs & Generative AI Inference in Cupertino, CA vacancy

Machine Learning Engineer, Proactive - Large Language Models & Generative AI Inference
$147.4k - $272.1k
...Machine Learning Engineer, Proactive - Large Language Models & Generative AI Inference The Intelligence Platform team empowers clients across Apple's operating systems... ...a particular emphasis on Large Language Models (LLMs) and Generative AI. Published research in the...
Suggested
Relocation
Apple
Cupertino, CA
11 hours ago
Staff Inference ML Runtime Engineer
...the world's largest AI chip, 56 times... ...leading training and inference speeds and... ...effortlessly run large-scale ML applications,... ...offers the fastest Generative AI inference solution... ...The Inference ML Engineering team at Cerebras... ...inference systems for LLMs or multimodal...
Suggested
CEREBRAS SYSTEMS INC.
Sunnyvale, CA
1 day ago
ML Engineer L4, Consumer Inference
$100k
...innovation. It offers ML/AI practitioners across Netflix... ...a Machine Learning Engineer to join our team to... ...efficient and scalable inference. -Develop and... ...large language models (LLMs) for efficient, scalable... ...large-scale inference for generative models and large...
Suggested
Hourly pay
Full time
Immediate start
Flexible hours
Netflix
Los Gatos, CA
16 hours ago
Inference Optimization ML Engineer
...Inference Optimization MLE At Rhoda AI, we're building the next generation of generalist intelligent robots. We own the full robotics... ...Collaborate closely with research engineers to translate model... ...experience in inference optimization, ML systems, or a closely related...
Suggested
Rhoda ai
Palo Alto, CA
3 days ago
Senior Machine Learning Engineer (Generative AI)
...are looking for a Machine Learning Engineer or scientist who will be... ...evaluating, and shipping different AI/ML technologies to improve data quality... ...or more of the following ML areas: generative AI models (e.g. Transformers, LLMs, VLMs, MLLMs), computer vision or...
Suggested
Apple
Cupertino, CA
16 hours ago
Data Scientist + ML Engineer - Gen AI
...Job Title : Data Scientist + ML Engineer (Gen AI) Location : Cupertino,... ...Data Scientist + ML Engineer (Generative AI) to join our team. In... ...models, large language models (LLMs), and other state-of-the-art... ...training, evaluation, and inference workflows Conduct exploratory...
Pride Global
Cupertino, CA
16 hours ago
Generative AI - ML System Engineering
...Machine Learning Systems Engineer We are looking for Machine Learning... ...help us build our end to end ML framework dedicated for 3D,... ...are the market leader in 3D generative AI, recognized as the No.1 in... ...challenges in both training and inference. Your next challenge at Meshy...
Part time
Remote work
Meshy
Sunnyvale, CA
3 days ago
Senior ML Infrastructure Engineer, Proactive
$181.1k - $318.4k
...Senior ML Infrastructure Engineer, Proactive The Intelligence Platform... ...user-centric knowledge and inferences that enable next generation user experiences. We're... ...technologies like generative AI, graph machine learning,... ...Experience building on LLMs or other generative...
Worldwide
Relocation
Apple
Cupertino, CA
11 hours ago
ML Inference Engineer
At Rhoda AI, we're building the full-stack foundation for the next generation of humanoid robots — from high-performance, software-defined... ...a reality. We're looking for an ML Infrastructure Engineer to help build and operate the inference systems that power our automation...
Rhoda AI
Palo Alto, CA
3 days ago
Senior AI/ML Platform Engineer (LLM/SLM Inference)
$199.7k - $254.6k
...Team Join Cisco’s CX AI Incubation Team as a Senior AI/ML DevOps Engineer and help productionize... ...AI services, optimizing inference performance from CPU... ...end-to-end AI DevOps for LLMs/SLMs, including on-prem... ...serving architectures for generative AI (multi-tenant,...
Full time
Temporary work
Local area
Flexible hours
Cisco
San Jose, CA
3 days ago
Senior Health AI ML Engineer: Generative Models & LLMs
$212k - $386.3k
Apple Inc. in Cupertino is seeking a Senior Engineer for the Health AI team to design innovative machine learning solutions that impact millions. The ideal candidate will have over 10 years of software development experience, expertise in machine learning, and a strong...
Apple Inc.
Cupertino, CA
2 days ago
Machine Learning Research Engineer, Generative AI
$147.4k - $272.1k
...Machine Learning Research Engineer, Generative AI Apple is where individual imaginations gather together, committing to... ...experience in building product features based on ML including generative models or multimodal LLMs Experience with image processing, computer vision...
Relocation
Apple
Cupertino, CA
3 days ago
Multimodal ML Research Engineer (LLMs & AI Agents)
$181.1k - $318.4k
...machine learning. The role involves innovative research on Multimodal LLMs and AI Agents, collaboration with experts, and the possibility of publishing results. A PhD or MS in Computer Science or Engineering is required, alongside strong expertise in machine learning....
Apple Inc.
Cupertino, CA
2 days ago
Generative AI / Machine Learning Engineer
...Job Title: Generative AI / Machine Learning Engineer Work Location with ZIP: Sunnyvale, CA 94085 (Hybrid)... ...optimization, and deployment. Knowledge of LLMs and Generative AI, including fine... .... Ability to build scalable ML/AI pipelines and APIs, integrating models...
eTeam
Sunnyvale, CA
2 days ago
Distinguished, Software Engineer -AI/ML Engineer- Walmart Connect
$169k - $338k
...Overview: As a Distinguished, Software Engineer - AI/ML Engineer- Walmart Connect , you... ...ML strategy and delivery of custom LLMs, multimodal generative models, RAG pipelines, and... ...omnichannel measurement. Drive inference optimization (quantization,vLLM,TensorRT...
Full time
Temporary work
Part time
Local area
Walmart
Sunnyvale, CA
1 day ago
Sr. ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs
$193.3k - $261.5k
...performance for AWS's custom ML accelerators. Working... ...software boundary, our engineers craft high-performance... ...of what's possible in AI acceleration. The... ...enabling unparalleled ML inference and training... ...performance across multiple generations of Neuron hardware Conduct...
Internship
Local area
Work from home
Flexible hours
Amazon
Cupertino, CA
2 days ago
Senior ML Evaluation Engineer - Autonomous Vehicles
$184k - $287.5k
...building the next generation of driving behavior... ...evaluation using LLMs, VLMs, and agentic... ...systems that bridge ML research and... ...that chain model inference, retrieval, and structured... ...Science, Computer Engineering, or a related... ...Knowledge of agentic AI frameworks (...
Remote work
NVIDIA
Santa Clara, CA
1 day ago
Senior Machine Learning Engineer - Physical AI and Synthetic Data Generation
$224k - $356.5k
...outstanding Machine Learning Engineers to join our Physical AI teams! As the pioneers of... ...developing sophisticated generative pipelines to build high-... ...to the full lifecycle of ML software, including performance... ...the performance during inference/training. Familiarity with...
NVIDIA
Santa Clara, CA
2 days ago
ML Modeling Engineer, AI Hardware
$176k - $420k
...Expect The Tesla AI Hardware team is at... ...brilliant engineers and visionaries, the... ...develops advanced AI inference chips tailored to accelerate... ...team, the AI/ML Modeling Engineer will... ...of next-generation tensor compute hardware... ...Large Language Models (LLMs), transformer architectures...
Hourly pay
Full time
Temporary work
Flexible hours
Tesla
Palo Alto, CA
16 hours ago
Machine Learning Engineer
...job Sonilo is a VC-backed AI music generation company building the next generation... ...As a Machine Learning Engineer, you will combine hands-on... ...edge large language models (LLMs) and diffusion models.... ...scalable model architectures and inference pipelines for multimodal generation...
Sonilo
Sunnyvale, CA
3 days ago
Senior ML Compiler Engineer
$128.7k - $261.3k
...accessible mobility. For the AI Kernels & Compilers team,... ..., and performance engineering so that every cycle on our... ...into fast, reliable inference across GPUs powering GM's next-generation autonomous and assisted driving... ..., and effortless for ML engineers across the AV organization...
Local area
Work from home
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
3 days ago
Senior ML Performance Engineer
...computing to make AI accessible to everyone... ...like one seamless engine. Developers can... ...looking for a Senior ML Performance Engineer... ...for evaluating LLM inference workloads across GPU... ...-based models and LLMs ~ Hands-on experience... ...hardware and next-generation LLMs....
Lemurian Labs
Santa Clara, CA
3 days ago
Senior Computer Vision and Machine Learning Engineer, Creator Studio
$181.1k - $318.4k
...Vision and Machine Learning Engineer, Creator Studio Work... ...help shaping the next generation of creative editing... ...the field of Generative AI. The ideal candidate... ...validation to efficient inference at scale Design data... ...particularly multimodal LLMs, Mixture of Experts, PEFT...
Relocation
Apple
Cupertino, CA
1 day ago
ML Engineer - Creator Studio
...ML Engineer - Creator Studio Apple is where individual imaginations gather together... ...new and innovative on-device ML and AI tools in the creative space. You... ...Experience delivering products in Multimodal-LLMs/Foundation models, Generative AI, Machine Learning or related...
Apple
Cupertino, CA
2 days ago
Machine Learning Engineer - Inference
$156k - $387.6k
...Machine Learning Engineer - Inference Location: San Jose Team: Technology Employment Type... ...of our AML team is to push the next-generation AI infrastructure and recommendation platform... ...-Design, High Performance Computing, ML Hardware Acceleration (e.g., GPU/RDMA)...
Temporary work
Local area
ByteDance
San Jose, CA
1 day ago
Senior ML Engineer, LLM / VLM Distillation
$213k - $263k
...Senior ML Engineer, LLM / VLM Distillation Waymo is an autonomous... ...The mission of the Waymo AI Foundations team is to... ...learning from demonstration, generative modeling, Bayesian inference, hierarchical learning,... ...or deploying multi-modal LLMs. Distillation related experience...
Full time
Remote work
Waymo
Mountain View, CA
16 hours ago
Senior ML Infrastructure Engineer, Inference Platform
$155.42k - $205.9k
...Job Description About the Team: The ML Inference Platform is part of the AV ML... ...cost-efficient platform that powers GM's AI efforts. We're proud to serve teams developing... ...are seeking a Senior ML Infrastructure engineer to help build and scale robust platforms...
Local area
Remote work
Work from home
Relocation
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
1 day ago
Senior, Data Scientist (Machine Learning Engineer)
..., Deep Learning, and Engineering. We tackle complex problems... ...of-the-art GenAI and ML models to identify... ...in building the next generation of our compliance... ...batch and real-time inference pipelines using frameworks... ...with Generative AI technologies: LLMs, multimodal models, RAG...
Walmart
Sunnyvale, CA
11 days ago
Machine Learning/Generative AI Engineering Manager - Maps Search Query Understanding
$228.1k - $393.8k
...Machine Learning/Generative AI Engineering Manager - Maps Search Query Understanding Apple Maps are... ...of innovative engineers while driving ML and Generative AI solutions at scale in... ...search features including Generative AI and LLMs, large-scale machine learning models,...
Relocation
Apple
Cupertino, CA
16 hours ago
Staff ML Engineer, Inference Platform
$185.5k - $270k
...eligible for relocation assistance. About the Team: The ML Inference Platform is part of the AI Compute Platforms organization within Infrastructure... ...the Role: We are seeking a Staff ML Infrastructure engineer to help build and scale robust Compute platforms for ML...
Local area
Work from home
Relocation package
Flexible hours
General Motors
Sunnyvale, CA
16 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Engineer: LLMs & Generative AI Inference. Be the first to apply!