Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Engineer: LLMs & Generative AI Inference

$147.4k - $272.1k

Apple Inc.

A leading technology company is searching for a Machine Learning Engineer in Cupertino, California. The role involves working with Large Language Models and Generative AI to enhance user experiences across Apple's platforms. Candidates should have extensive experience in Machine Learning, ideally with published research, and an advanced degree in a related field is preferred. The position offers a competitive salary range of $147,400 to $272,100, along with comprehensive benefits and opportunities for stock ownership. #J-18808-Ljbffr Apple Inc.

Vacancy posted 16 hours ago
Similar jobs that could be interesting for youBased on the ML Engineer: LLMs & Generative AI Inference in Cupertino, CA vacancy
  • $147.4k - $272.1k

     ...Machine Learning Engineer, Proactive - Large Language Models & Generative AI Inference The Intelligence Platform team empowers clients across Apple's operating systems...  ...a particular emphasis on Large Language Models (LLMs) and Generative AI. Published research in the... 
    Suggested
    Relocation

    Apple

    Cupertino, CA
    11 hours ago
  •  ...the world's largest AI chip, 56 times...  ...leading training and inference speeds and...  ...effortlessly run large-scale ML applications,...  ...offers the fastest Generative AI inference solution...  ...The Inference ML Engineering team at Cerebras...  ...inference systems for LLMs or multimodal... 
    Suggested

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    1 day ago
  • $100k

     ...innovation. It offers ML/AI practitioners across Netflix...  ...a Machine Learning Engineer to join our team to...  ...efficient and scalable inference. -Develop and...  ...large language models (LLMs) for efficient, scalable...  ...large-scale inference for generative models and large... 
    Suggested
    Hourly pay
    Full time
    Immediate start
    Flexible hours

    Netflix

    Los Gatos, CA
    16 hours ago
  •  ...Inference Optimization MLE At Rhoda AI, we're building the next generation of generalist intelligent robots. We own the full robotics...  ...Collaborate closely with research engineers to translate model...  ...experience in inference optimization, ML systems, or a closely related... 
    Suggested

    Rhoda ai

    Palo Alto, CA
    3 days ago
  •  ...are looking for a Machine Learning Engineer or scientist who will be...  ...evaluating, and shipping different AI/ML technologies to improve data quality...  ...or more of the following ML areas: generative AI models (e.g. Transformers, LLMs, VLMs, MLLMs), computer vision or... 
    Suggested

    Apple

    Cupertino, CA
    16 hours ago
  •  ...Job Title : Data Scientist + ML Engineer (Gen AI) Location : Cupertino,...  ...Data Scientist + ML Engineer (Generative AI) to join our team. In...  ...models, large language models (LLMs), and other state-of-the-art...  ...training, evaluation, and inference workflows Conduct exploratory... 

    Pride Global

    Cupertino, CA
    16 hours ago
  •  ...Machine Learning Systems Engineer We are looking for Machine Learning...  ...help us build our end to end ML framework dedicated for 3D,...  ...are the market leader in 3D generative AI, recognized as the No.1 in...  ...challenges in both training and inference. Your next challenge at Meshy... 
    Part time
    Remote work

    Meshy

    Sunnyvale, CA
    3 days ago
  • $181.1k - $318.4k

     ...Senior ML Infrastructure Engineer, Proactive The Intelligence Platform...  ...user-centric knowledge and inferences that enable next generation user experiences. We're...  ...technologies like generative AI, graph machine learning,...  ...Experience building on LLMs or other generative... 
    Worldwide
    Relocation

    Apple

    Cupertino, CA
    11 hours ago
  • At Rhoda AI, we're building the full-stack foundation for the next generation of humanoid robots — from high-performance, software-defined...  ...a reality. We're looking for an ML Infrastructure Engineer to help build and operate the inference systems that power our automation... 

    Rhoda AI

    Palo Alto, CA
    3 days ago
  • $199.7k - $254.6k

     ...Team Join Cisco’s CX AI Incubation Team as a Senior AI/ML DevOps Engineer and help productionize...  ...AI services, optimizing inference performance from CPU...  ...end-to-end AI DevOps for LLMs/SLMs, including on-prem...  ...serving architectures for generative AI (multi-tenant,... 
    Full time
    Temporary work
    Local area
    Flexible hours

    Cisco

    San Jose, CA
    3 days ago
  • $212k - $386.3k

    Apple Inc. in Cupertino is seeking a Senior Engineer for the Health AI team to design innovative machine learning solutions that impact millions. The ideal candidate will have over 10 years of software development experience, expertise in machine learning, and a strong... 

    Apple Inc.

    Cupertino, CA
    2 days ago
  • $147.4k - $272.1k

     ...Machine Learning Research Engineer, Generative AI Apple is where individual imaginations gather together, committing to...  ...experience in building product features based on ML including generative models or multimodal LLMs Experience with image processing, computer vision... 
    Relocation

    Apple

    Cupertino, CA
    3 days ago
  • $181.1k - $318.4k

     ...machine learning. The role involves innovative research on Multimodal LLMs and AI Agents, collaboration with experts, and the possibility of publishing results. A PhD or MS in Computer Science or Engineering is required, alongside strong expertise in machine learning.... 

    Apple Inc.

    Cupertino, CA
    2 days ago
  •  ...Job Title: Generative AI / Machine Learning Engineer Work Location with ZIP: Sunnyvale, CA 94085 (Hybrid)...  ...optimization, and deployment. Knowledge of LLMs and Generative AI, including fine...  .... Ability to build scalable ML/AI pipelines and APIs, integrating models... 

    eTeam

    Sunnyvale, CA
    2 days ago
  • $169k - $338k

     ...Overview: As a Distinguished, Software Engineer - AI/ML Engineer- Walmart Connect , you...  ...ML strategy and delivery of custom LLMs, multimodal generative models, RAG pipelines, and...  ...omnichannel measurement. Drive inference optimization (quantization,vLLM,TensorRT... 
    Full time
    Temporary work
    Part time
    Local area

    Walmart

    Sunnyvale, CA
    1 day ago
  • $193.3k - $261.5k

     ...performance for AWS's custom ML accelerators. Working...  ...software boundary, our engineers craft high-performance...  ...of what's possible in AI acceleration. The...  ...enabling unparalleled ML inference and training...  ...performance across multiple generations of Neuron hardware Conduct... 
    Internship
    Local area
    Work from home
    Flexible hours

    Amazon

    Cupertino, CA
    2 days ago
  • $184k - $287.5k

     ...building the next generation of driving behavior...  ...evaluation using LLMs, VLMs, and agentic...  ...systems that bridge ML research and...  ...that chain model inference, retrieval, and structured...  ...Science, Computer Engineering, or a related...  ...Knowledge of agentic AI frameworks (... 
    Remote work

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $224k - $356.5k

     ...outstanding Machine Learning Engineers to join our Physical AI teams! As the pioneers of...  ...developing sophisticated generative pipelines to build high-...  ...to the full lifecycle of ML software, including performance...  ...the performance during inference/training. Familiarity with... 

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $176k - $420k

     ...Expect The Tesla AI Hardware team is at...  ...brilliant engineers and visionaries, the...  ...develops advanced AI inference chips tailored to accelerate...  ...team, the AI/ML Modeling Engineer will...  ...of next-generation tensor compute hardware...  ...Large Language Models (LLMs), transformer architectures... 
    Hourly pay
    Full time
    Temporary work
    Flexible hours

    Tesla

    Palo Alto, CA
    16 hours ago
  •  ...job Sonilo is a VC-backed AI music generation company building the next generation...  ...As a Machine Learning Engineer, you will combine hands-on...  ...edge large language models (LLMs) and diffusion models....  ...scalable model architectures and inference pipelines for multimodal generation... 

    Sonilo

    Sunnyvale, CA
    3 days ago
  • $128.7k - $261.3k

     ...accessible mobility. For the AI Kernels & Compilers team,...  ..., and performance engineering so that every cycle on our...  ...into fast, reliable inference across GPUs powering GM's next-generation autonomous and assisted driving...  ..., and effortless for ML engineers across the AV organization... 
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    3 days ago
  •  ...computing to make AI accessible to everyone...  ...like one seamless engine. Developers can...  ...looking for a Senior ML Performance Engineer...  ...for evaluating LLM inference workloads across GPU...  ...-based models and LLMs ~ Hands-on experience...  ...hardware and next-generation LLMs.... 

    Lemurian Labs

    Santa Clara, CA
    3 days ago
  • $181.1k - $318.4k

     ...Vision and Machine Learning Engineer, Creator Studio Work...  ...help shaping the next generation of creative editing...  ...the field of Generative AI. The ideal candidate...  ...validation to efficient inference at scale Design data...  ...particularly multimodal LLMs, Mixture of Experts, PEFT... 
    Relocation

    Apple

    Cupertino, CA
    1 day ago
  •  ...ML Engineer - Creator Studio Apple is where individual imaginations gather together...  ...new and innovative on-device ML and AI tools in the creative space. You...  ...Experience delivering products in Multimodal-LLMs/Foundation models, Generative AI, Machine Learning or related... 

    Apple

    Cupertino, CA
    2 days ago
  • $156k - $387.6k

     ...Machine Learning Engineer - Inference Location: San Jose Team: Technology Employment Type...  ...of our AML team is to push the next-generation AI infrastructure and recommendation platform...  ...-Design, High Performance Computing, ML Hardware Acceleration (e.g., GPU/RDMA)... 
    Temporary work
    Local area

    ByteDance

    San Jose, CA
    1 day ago
  • $213k - $263k

     ...Senior ML Engineer, LLM / VLM Distillation Waymo is an autonomous...  ...The mission of the Waymo AI Foundations team is to...  ...learning from demonstration, generative modeling, Bayesian inference, hierarchical learning,...  ...or deploying multi-modal LLMs. Distillation related experience... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    16 hours ago
  • $155.42k - $205.9k

     ...Job Description About the Team: The ML Inference Platform is part of the AV ML...  ...cost-efficient platform that powers GM's AI efforts. We're proud to serve teams developing...  ...are seeking a Senior ML Infrastructure engineer to help build and scale robust platforms... 
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    1 day ago
  •  ..., Deep Learning, and Engineering. We tackle complex problems...  ...of-the-art GenAI and ML models to identify...  ...in building the next generation of our compliance...  ...batch and real-time inference pipelines using frameworks...  ...with Generative AI technologies: LLMs, multimodal models, RAG... 

    Walmart

    Sunnyvale, CA
    11 days ago
  • $228.1k - $393.8k

     ...Machine Learning/Generative AI Engineering Manager - Maps Search Query Understanding Apple Maps are...  ...of innovative engineers while driving ML and Generative AI solutions at scale in...  ...search features including Generative AI and LLMs, large-scale machine learning models,... 
    Relocation

    Apple

    Cupertino, CA
    16 hours ago
  • $185.5k - $270k

     ...eligible for relocation assistance. About the Team: The ML Inference Platform is part of the AI Compute Platforms organization within Infrastructure...  ...the Role: We are seeking a Staff ML Infrastructure engineer to help build and scale robust Compute platforms for ML... 
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    16 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Engineer: LLMs & Generative AI Inference. Be the first to apply!