Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Machine Learning Engineer - Large Language Models & Generative AI Inference

$147.4k - $272.1k

Apple Oakbrook

Cupertino, California, United States Machine Learning and AI The Intelligence Platform team empowers clients across Apple’s operating systems with a high quality user‑centric search and data platform, and the primary inference platform that enable next generation user experiences for Apple Intelligence. We are in search of an accomplished and driven Machine Learning Engineer who has a robust understanding of Large Language Models, Generative AI and high-performance systems computing. Your primary role will involve working with our cross-functional teams that build foundation models, as well as our client teams that want to run inference using these models and continue the development of our inference stack. By contributing to our team, you’ll play an integral part in developing Siri, Photos, Music, and various other services, leaving a significant footprint on the evolution of our AI platforms. Join us in our exciting journey as we push the frontiers of Machine Learning and Generative AI. Your expertise and contributions will be invaluable in shaping the future of our Intelligence Platform. Description As a Machine Learning Engineer on the Apple Intelligence Platform Inference Team, you’ll join a phenomenal team of top‑performing engineers and will be entrusted with a range of responsibilities. Your tasks will include: Leading the exploration and application of Large Language Models and Generative AI, venturing into new areas within these fields. Translating the latest research into high‑performing systems and a model serving stack that can be practically applied to enhance user experiences. Contribute to setting the team’s strategic direction, cultivating an environment that encourages innovation and professional growth. Collaborating with various teams to develop and implement evolving requirements of our clients on the GenAI inference stack, ensuring performance optimization and alignment with broader business goals. Minimum Qualifications In-depth experience in Machine Learning, with a particular emphasis on Large Language Models (LLMs) and Generative AI. Published research in the field of Machine Learning or AI is highly desirable, indicating an ability to not only understand but also contribute to cutting‑edge research. Proven ability to comprehend, interpret, and apply cutting‑edge research into tangible applications. Proven problem‑solving and leadership abilities, with the capacity to steer the team’s research and practical applications in a collaborative and fast‑paced environment. Preferred Qualifications An advanced degree (Master’s or Ph.D.) in Computer Science, Artificial Intelligence, Machine Learning, or a related field is required. Ongoing professional development in Machine Learning and Artificial Intelligence domains is expected. At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $147,400 and $272,100, and your base pay will depend on your skills, qualifications, experience, and location. Apple employees also have the opportunity to become an Apple shareholder through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. You’ll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits. Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program. Apple is an equal opportunities employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant. Apple accepts applications to this posting on an ongoing basis. #J-18808-Ljbffr Apple

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Machine Learning Engineer - Large Language Models & Generative AI Inference in Cupertino, CA vacancy
  • $147.4k - $272.1k

    A leading technology company is searching for a Machine Learning Engineer in Cupertino, California. The role involves working with Large Language Models and Generative AI to enhance user experiences across Apple's platforms. Candidates should have extensive experience... 
    Language

    Apple

    Cupertino, CA
    9 hours ago
  • $147.4k - $272.1k

    Machine Learning Engineer - Speech & Multimodal Language Modeling Cupertino, California, United States | Machine Learning and AI Apple is where individual imaginations...  ...improving multi‑modal generative models to meet end‑...  ...Engineers to process large‑scale speech audio... 
    Language
    Relocation

    Apple

    Cupertino, CA
    3 days ago
  • $278.1k - $347.6k

     ...building the next generation of AI-driven game experiences...  ...running generative models on-device, right...  .... As our Principal Engineer for On-Device AI Inference & Systems, you will...  ...Proficiency in the core languages of a browser-native...  ...regression CI, and large device-farm... 
    Language
    Work at office
    Worldwide
    Relocation package

    Unity

    Mountain View, CA
    4 days ago
  • $112.7k - $169.1k

     ...opportunity Unity's Vector AI team builds the machine learning systems that...  ...'s leading game engine. Recommendation...  ...building the next generation of these systems....  ...frontier has shifted — large language models, reinforcement...  ...experiments using causal inference, A/B testing, and... 
    Language
    Internship
    Work at office
    Worldwide
    Relocation package
    Shift work

    Jobr

    Mountain View, CA
    5 days ago
  • $139.5k - $258.1k

    Machine Learning Research Engineer , Text Generation, Input Experience Seattle, Washington,...  ...Machine Learning and AI Apple is where...  ...interaction with generative models for text generation...  ...the Natural-Language framework. If you want...  ...optimizing large diffusion models or... 
    Language
    Relocation

    Apple

    Cupertino, CA
    2 days ago
  • At Rhoda AI, we’re building the next generation of generalist intelligent...  ...world models that control our...  ...looking for an Inference Optimization MLE...  ...performance out of large multimodal...  ...vision, video, language, proprioception...  ...with research engineers to translate model... 
    Language

    Rhoda AI

    Mountain View, CA
    1 day ago
  •  ...AI Models Team Member Splunk, a Cisco...  ..., multi-modal machine-generated data —...  ...Cisco's global engineering capabilities....  ...deployment for large-scale foundation...  ...areas: large language modeling for both...  ...unstructured data, deep learning-based time...  ..., and inference efficiency to... 
    Language
    Flexible hours

    Webex Events (formerly Socio)

    Sunnyvale, CA
    1 day ago
  • $126.8k - $220.9k

    Machine Learning Engineer, Apple Store Online Cupertino...  ...the next generation of algorithms...  ...including developing models for product...  ...), Generative AI and optimizing...  ...oriented programming languages such as Python...  ...pipelines, large‑scale machine...  ..., training, inference, deployment, monitoring... 
    Language
    Relocation package

    Apple

    Cupertino, CA
    4 days ago
  • $38 per hour

     ...hiring Creative Writing Generative AI Analysts in the...  ...efforts of multimedia and language data labeling and...  ...improve generative AI models. Project Details Job...  ...evaluation workflows for machine learning systems Identify...  ...generative AI systems, large language models, RLHF... 
    Language
    Full time
    Remote work

    Sonara Inc.

    Santa Clara, CA
    4 days ago
  •  ...and deployment of advanced AI agents and agentic systems....  ...capabilities. Develop and integrate large language models (LLMs) and other state‑of‑...  ...Knowledge and passion in machine learning algorithms, Gen AI, LLMs,...  ...(QLORA, DPO) and inference optimization (vLLM, TensorRT... 
    Language
    Work experience placement

    Nutanix

    Santa Clara, CA
    3 days ago
  • $175k - $296k

     ...advanced Internet, AI and autonomous...  ...a full-time Machine Learning Engineer, with deep knowledge...  ...training very large foundation model and...  ...model training/inference. Our mission...  ...enable the next-generation E2E solution of...  ...scale vision or language models Previous... 
    Language
    Full time

    XPeng Motors

    Santa Clara, CA
    more than 2 months ago
  • $199k - $278.5k

     ...Senior ML/Gen AI Engineer Introduction to...  ...by data and machine learning provides secure...  ...building a next‑generation B2B...  ...and scale ML models, from experimentation...  ...build, and own large‑scale, distributed...  ..., deployment, inference, and...  ...skills in modern languages such as Python... 
    Language

    PowerToFly

    San Jose, CA
    5 days ago
  • $133.95k - $245k

     ...inherited by younger generations in the next two...  ...Senior Machine Learning Engineer to help shape the...  ...cases, develop AI inference services for product...  ...training or finetune models for product use...  ..., and version large datasets....  ...fine‑tuning large language models using techniques... 
    Language
    Work at office
    Flexible hours
    Shift work
    3 days per week

    Robinhood

    Menlo Park, CA
    5 days ago
  • $133.95k - $245k

     ...inherited by younger generations in the next two...  ...Senior Machine Learning Engineer to help shape the...  ...cases, developing AI inference services for product...  ...or finetune models for product use...  ...validate, and version large datasets....  ...fine‑tuning large language models using techniques... 
    Language
    Work at office
    Remote work
    Flexible hours
    Shift work
    3 days per week

    Unchain Data

    Menlo Park, CA
    5 days ago
  • $181.1k - $318.4k

     ...Sr. Machine Learning Engineer, Siri Speech Join the team redefining...  ...most widely used AI assistants, powered by our next-generation of Apple...  ...the art in natural language processing, speech and audio modeling. Design and build...  ..., training/tuning large generative models... 
    Language
    Worldwide
    Relocation

    Apple

    Cupertino, CA
    3 days ago
  • $151.8k - $265.35k

     ...Adobe is seeking Senior Machine Learning Engineers who excel at turning...  ...research to advance Adobe’s AI capabilities in Natural Language, Image/Video generation, editing, and...  ...Language Processing, and large language models. Familiarity with inference optimization, performance... 
    Language

    Dormont Manufacturing Company

    San Jose, CA
    1 day ago
  •  ...seeking a highly skilled Machine Learning Engineer to design and build...  ...reliance on large language models. The role focuses on...  ...focus on sub-second inference, CPU-based...  ...reliance on hosted AI services. Design...  ...accuracy and evidence generation. Collaborate with... 
    Language
    Local area

    Sparktek

    San Jose, CA
    4 days ago
  • $150k

     ...of Foundation Models We are a dedicated...  ...the next generation of AI builders, and...  ..., and engineers, tackling the...  ...computing in deep learning, driving impactful...  ...Role As a Machine Learning Engineer...  ...of large-scale machine...  ...infrastructure, Natural Language Processing or... 
    Language
    Worldwide
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    5 days ago
  • $147.4k - $272.1k

     ...Machine Learning Compiler Engineer At Apple, we're on the cutting...  ...boundaries of AI and hardware optimization...  ...deep learning inference with a focus on...  ...or programming language design,...  ...and back-end code generation High-level proficiency...  ...working with large, complex... 
    Language
    Relocation

    Apple

    Sunnyvale, CA
    3 days ago
  • $184.5k

     ...flexible work model (with some pretty...  ...Senior ML/Gen AI Engineer...  ...powered by data and machine learning provides secure...  ...building the next-generation, scalable B2B...  ...build, and own large-scale, distributed...  ..., deployment, inference, and...  ...skills in modern languages such as Python... 
    Language
    Local area
    Flexible hours

    Expedia Group

    San Jose, CA
    11 days ago
  • $196k - $221k

     ...deploy cutting-edge AI technology to...  ...scientists and engineers. As a Machine Learning Engineer, you'll...  ...build, and evolve large-scale SID / ASR...  ...-training, and inference strategies for large language and speech models using PyTorch...  ...intelligence, Otter generates real-time... 
    Language
    Permanent employment

    Otter.ai

    Mountain View, CA
    4 days ago
  •  ...California is seeking a highly skilled Machine Learning Engineer to lead research and development...  ...on cutting-edge projects involving large language models (LLMs), and enhance search capabilities...  ...with a passion for innovative AI technologies. The role promises growth... 
    Language

    Apple

    Cupertino, CA
    3 days ago
  •  ...and monetize large audiences,...  ...latency. We use Machine Learning,...  ...Reinforcement Learning, AI, Control and...  ..., and Inference Platform that...  ...for seasoned engineers with a background...  ...statistical languages. What you...  ...the entire model lifecycle -...  ...and generate features that... 
    Language
    Work at office
    Local area
    Remote work
    Monday to Thursday
    Flexible hours

    Roku

    San Jose, CA
    2 days ago
  •  ...and monetize large audiences,...  ...latency. We use Machine Learning,...  ...Reinforcement Learning, AI, Control and...  ..., and Inference Platform that...  ...for seasoned engineers with a background...  ...statistical languages. What you’ll...  ...the entire model lifecycle -...  ...and generate features that... 
    Language
    Work at office
    Local area
    Remote work
    Monday to Thursday
    Flexible hours

    Roku

    San Jose, CA
    3 days ago
  • $195k - $298k

     ...About the Team The ML Inference Platform is part of the AI Compute Platforms...  ...of-the‑art (SOTA) machine learning models for experimental...  ...ML Infrastructure engineer to help build and...  ...techniques. Lead large‑scale technical initiatives...  ...relevant coding languages. Expertise in ML... 
    Language
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    1 day ago
  • $150k

     ...Institute of Foundation Models We are a...  ...nurture the next generation of AI builders, and drive...  ...scientists, and engineers, tackling the most...  ...computing in deep learning, driving...  ...models to unlock machine intelligence beyond...  ...distributed systems for large‑scale data... 
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    5 days ago
  • $212k - $386.3k

     ...California, United States Machine Learning and AI The Health AI team...  ..., software engineers, and machine...  ...deep learning, and generative AI to design, implement...  ...machine learning models and systems. You will...  ...models, natural language processing (NLP), and large language models (... 
    Language
    Relocation

    Apple

    Cupertino, CA
    2 days ago
  • $212.3k - $275.8k

     ...Team The Cisco AI Software &...  ...and delivers Generative AI based solutions...  ...ML Operations Engineer to join our global...  ...of scalable machine learning systems . You...  ...and inference in multiple public...  ...frameworks Monitor models in production...  ...Experience deploying large language models (LLMs)... 
    Language
    Full time
    Temporary work
    Local area
    Flexible hours

    Cisco Systems, Inc.

    Milpitas, CA
    4 days ago
  • $195k - $298k

     ...About the Team The ML Inference Platform is part of the AI Compute Platforms...  ...state‑of‑the‑art machine learning models for experimental...  ...ML Infrastructure Engineer to build and scale...  ...techniques. Lead large‑scale technical initiatives...  ...relevant coding languages: Go, Python, C++.... 
    Language
    Local area
    Relocation package
    Flexible hours

    Israelvcforum

    Sunnyvale, CA
    1 day ago
  • $38 per hour

     ...for highly detail-oriented Generative AI Analysts to join our team onsite...  ...passionate about AI, language, data quality, and emerging...  ...quality issues in datasets and model outputs Support data categorization...  ...review workflows for machine learning systems Assist in the creation... 
    Language
    Full time

    Welo Global

    Santa Clara, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning Engineer - Large Language Models & Generative AI Inference. Be the first to apply!