Machine Learning Engineer - Large Language Models & Generative AI Inference
$147.4k - $272.1kApple Inc.
Cupertino, California, United States Machine Learning and AI The Intelligence Platform team empowers clients across Apple’s operating systems with a high quality user‑centric search and data platform, and the primary inference platform that enable next generation user experiences for Apple Intelligence. We are in search of an accomplished and driven Machine Learning Engineer who has a robust understanding of Large Language Models, Generative AI and high-performance systems computing. Your primary role will involve working with our cross-functional teams that build foundation models, as well as our client teams that want to run inference using these models and continue the development of our inference stack. By contributing to our team, you’ll play an integral part in developing Siri, Photos, Music, and various other services, leaving a significant footprint on the evolution of our AI platforms. Join us in our exciting journey as we push the frontiers of Machine Learning and Generative AI. Your expertise and contributions will be invaluable in shaping the future of our Intelligence Platform. Description As a Machine Learning Engineer on the Apple Intelligence Platform Inference Team, you’ll join a phenomenal team of top‑performing engineers and will be entrusted with a range of responsibilities. Your tasks will include: Leading the exploration and application of Large Language Models and Generative AI, venturing into new areas within these fields. Translating the latest research into high‑performing systems and a model serving stack that can be practically applied to enhance user experiences. Contribute to setting the team’s strategic direction, cultivating an environment that encourages innovation and professional growth. Collaborating with various teams to develop and implement evolving requirements of our clients on the GenAI inference stack, ensuring performance optimization and alignment with broader business goals. Minimum Qualifications In-depth experience in Machine Learning, with a particular emphasis on Large Language Models (LLMs) and Generative AI. Published research in the field of Machine Learning or AI is highly desirable, indicating an ability to not only understand but also contribute to cutting‑edge research. Proven ability to comprehend, interpret, and apply cutting‑edge research into tangible applications. Proven problem‑solving and leadership abilities, with the capacity to steer the team’s research and practical applications in a collaborative and fast‑paced environment. Preferred Qualifications An advanced degree (Master’s or Ph.D.) in Computer Science, Artificial Intelligence, Machine Learning, or a related field is required. Ongoing professional development in Machine Learning and Artificial Intelligence domains is expected. At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $147,400 and $272,100, and your base pay will depend on your skills, qualifications, experience, and location. Apple employees also have the opportunity to become an Apple shareholder through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. You’ll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits. Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program. Apple is an equal opportunities employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant. Apple accepts applications to this posting on an ongoing basis. #J-18808-Ljbffr Apple Inc.
$147.4k - $272.1k
A leading technology company is searching for a Machine Learning Engineer in Cupertino, California. The role involves working with Large Language Models and Generative AI to enhance user experiences across Apple's platforms. Candidates should have extensive experience...Language$147.4k - $272.1k
...Machine Learning Engineer - Speech & Multimodal Language Modeling Apple is where individual imaginations gather... ...improving multi-modal generative models to meet end-to-... ...Data Engineers to process large scale speech audio data... ...evaluating generative AI models ~ Proficiency...LanguageRelocation$224k - $356.5k
...for outstanding Machine Learning Engineers to join our Physical AI teams! As the... ...sophisticated generative pipelines to... ...art multimodal models and diffusion... ...and fine-tune large-scale models,... ...and computer languages including Python... ...performance during inference/training....Language- ...Inference Optimization MLE At Rhoda AI, we're building the next generation of generalist intelligent robots... ...foundation world models that control our... ...out of large multimodal models... ...(vision, video, language, proprioception)... ...closely with research engineers to translate model...Language
- ...experiences. Join the Ai Data Platform Applied Machine Learning team to pioneer... ...where generative AI meets Apple'... ...build, and evolve models, tools and... ...generations, efficient inference, and multimodal... ..., natural language processing, or... ...record of building large scale,...Language
$181.1k - $318.4k
...Computer Vision and Machine Learning Engineer, Creator Studio... ...the next generation of creative editing... ...of Generative AI. The ideal... ...machine learning models, from large distributed training... ...to efficient inference at scale... ...in high-level languages like Python and...LanguageRelocation$176k - $420k
...The Tesla AI Hardware team... ...Comprising brilliant engineers and... ...advanced AI inference chips tailored... ...Tesla's machine learning capabilities... ..., the AI/ML Modeling Engineer will... ...optimization of next-generation tensor... ...-specific languages, computer architecture... ...of Large Language Models...LanguageHourly payFull timeTemporary workFlexible hours$181.1k - $318.4k
...AIML - Machine Learning Engineer, Foundation Models Apple is revolutionizing artificial intelligence... ..., production-ready AI solutions. We are looking... ...Experience with large-scale model training and... ...foundation models and large language models Background in multi...LanguageRelocation$278.1k - $347.6k
...View, CA, USA Principal Machine Learning Engineer, Mobile AI Inference Optimization Location... ...are building the next generation of mobile game AI experiences, deploying world models to mobile on-device. As... ...written exchanges in this language since the performance of...LanguageWork at officeWorldwideRelocation package- ...technologies in GenAI, Machine Learning, Deep Learning, and Engineering. We tackle... ...natural language... ...visualization, and model serving. We... ...building the next generation of our... ...and real-time inference pipelines using... ...Generative AI technologies... ...and operate large-scale batch...Language
$244.14k - $413.16k
...Senior Staff Machine Learning Engineer - Foundation Model Santa Clara, CA XPENG is a leading... ..., integrating advanced AI and autonomous driving... ...of XPENG's next-generation Vision-Language-Action (VLA) Foundation... ...design, train, and deploy large-scale multi-modal models...LanguageFull time- ...and deployment of advanced AI agents and agentic systems.... ...capabilities. Develop and integrate large language models (LLMs) and other state‑of‑... ...Knowledge and passion in machine learning algorithms, Gen AI, LLMs,... ...(QLORA, DPO) and inference optimization (vLLM, TensorRT...LanguageWork experience placement
$175k - $296k
...advanced Internet, AI and autonomous... ...a full-time Machine Learning Engineer, with deep knowledge... ...training very large foundation model and... ...model training/inference. Our mission... ...enable the next-generation E2E solution of... ...scale vision or language models Previous...LanguageFull time- ...The future of AI is on-device. On... ...product-critical models. This is where the... ...building the next generation of LLMs and deploying... ...'re looking for engineers who thrive on... ...hardware, software, and machine learning. This is a... ...Knowledge of Natural Language Processing...LanguageWork from home
$165.2k - $223.6k
...accelerate deep learning and GenAI... ...'s custom machine learning... ...ML inference and training... ...range of models and supporting... ..., our engineers build systematic... ...in AI acceleration... ...are very large, yet our teams... ...large language models like... ...multiple generations of Neuron...LanguageWork experience placementInternshipLocal areaFlexible hours- ...About Eightfold.ai: Eightfold... ...and 30+ languages. Today, Eightfold... ...standards. Our engineers, product leaders... ...work with a large database of career... ...-edge deep learning models across all Eightfold... ...passion in machine learning... ...QLORA, DPO) and inference optimization (...LanguageWork experience placementWork at officeRemote workFlexible hours3 days per week
- ...00x better job search engine: fast, comprehensive,... ...help us turn powerful AI and ML models into fast, reliable production... ...models, optimizing inference latency and throughput... ...and optimized deep learning models in production... ...Have experience with large-scale model serving, multi...Full timeRelocation package
- ...Summary We are looking for a Machine Learning Engineer or scientist who will be... ..., and shipping different AI/ML technologies to improve... ...of experience in building large-scale machine learning... ...of the following ML areas: generative AI models (e.g. Transformers, LLMs,...
$181.1k - $318.4k
...Sr. Machine Learning Engineer, Siri Speech Are you excited about Generative AI and Large Language Models? Do you want to work on cutting-edge generative technologies that power intelligent, natural interactions for billions of users? Join the SWE Speech team at Apple...LanguageWorldwideRelocation$172.2k - $258.4k
...are looking for a Staff Machine Learning Engineer to join our Vector Core Modeling team. In this role, you... ...that power ad ranking in large-scale advertising platforms... ...member of the Vector AI group, you will play a central... ...exchanges in this language since the performance of...LanguageWork at officeWorldwideRelocation package$148.7k - $258.72k
...USA Senior Machine Learning Engineer, Ads Experimentation... ...Department AI & Machine... ...machine learning models or ads delivery... ...methodology and large-scale engineering... ...into the next generation of causal inference and high-... ...exchanges in this language since the performance...LanguageTemporary workWork at officeWorldwideRelocation package$212k - $318.4k
...Machine Learning Engineer, Proactive Posted: Jun 03, 2026 Weekly Hours: 40... ...machine learning and natural language processing! The features we create... ...develop state-of-the-art generative AI technologies based on Large Language Models to power innovative features...LanguageWork experience placementRelocation$181.1k - $318.4k
...Machine Learning Compiler Engineer At Apple, we're on the cutting... ...boundaries of AI and hardware optimization... ...deep learning inference with a focus on... ...or programming language design,... ...and back-end code generation High-level proficiency... ...working with large, complex...LanguageRelocation$165.2k - $223.6k
...Product: Amazon's Machine Learning accelerators are... ...for building Generative AI on Amazon. The Inferentia... ...-in-class ML inference performance at... ...silicon engineering, hardware design... ...complex neural net models on our custom-built... ...programming language Preferred Qualifications...LanguageInternshipLocal areaFlexible hours$181.1k - $318.4k
...AIML - Sr Machine Learning Engineer, Responsible AI Work Locations Submit Resume Would you like to... ...play a part in building the next generation of generative AI applications at... ...learning, particularly focused on large language models for text generation, diffusion...LanguageRelocation$230k - $265k
...deploy cutting-edge AI technology to... ...scientists and engineers. As a Senior Machine Learning Engineer, you’... ..., and evolve large-scale SID / ASR... ...-training, and inference strategies for large language and speech models using PyTorch... ...intelligence, Otter generates real-time...LanguagePermanent employment$165.2k - $223.6k
...accelerate deep learning and GenAI... ...'s custom machine learning... ..., our engineers craft high... ...possible in AI... ...unparalleled ML inference and training... ...enable their models and ensure... ...are very large, yet our teams... ...multiple generations of Neuron... ...language Preferred...LanguageInternshipLocal areaWork from homeFlexible hours$184.5k
...flexible work model (with some pretty... ...Senior ML/Gen AI Engineer... ...powered by data and machine learning provides secure... ...building the next-generation, scalable B2B... ...build, and own large-scale, distributed... ..., deployment, inference, and... ...skills in modern languages such as Python...LanguageLocal areaFlexible hours$200k - $386.4k
...most powerful AI advertising... ...name-short for "machine learning company"-... ...monetization engine and key search... ...build revenue-generating ad businesses... ...and deploy the large-scale models that power... ...ingestion to online inference - on top of... ...Stack: Languages: Python, Go,...LanguageTemporary workWork at officeImmediate startFlexible hoursShift work$120k - $215k
...Senior Machine Learning Engineer – Fine-Tuning and On-device AI Palo Alto, CA Who We... ...deployment of AI models for diverse tasks... ...emphasis on on-device inference. You will work on... ...Fine-tune large language models, multimodal... ...including synthetic data generation and annotation....LanguageFull timeTemporary workLocal areaFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Machine Learning Engineer - Large Language Models & Generative AI Inference. Be the first to apply!
- senior ml engineer Cupertino, CA
- computer vision machine learning engineer Cupertino, CA
- machine learning software engineer Cupertino, CA
- machine learning engineer Cupertino, CA
- speech language Cupertino, CA
- language manager Cupertino, CA
- language analyst Cupertino, CA
- natural language processing Cupertino, CA
- machine learning remote Cupertino, CA
- machine learning intern Cupertino, CA

