Machine Learning Engineer - Speech & Multimodal Language Modeling
$147.4k - $272.1kApple Oakbrook
Machine Learning Engineer - Speech & Multimodal Language Modeling
Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or experience we deliver is the result of us making each other's ideas stronger. The diversity of our people and their thinking inspires the innovation that runs through everything we do. When we bring everybody in, we can do the best work of our lives. Here, you'll do more than join something — you'll add something.
The Special Projects team at Apple is developing novel user-facing features that leverage the multimodal capabilities of state-of-the-art foundation language models. We are looking for a highly skilled Machine Learning Engineer to build and evaluate these experiences, with a specific focus on Multimodal and Speech Language Models. A successful candidate is experienced in evaluating complex foundation model-driven systems end-to-end, translating subjective product requirements into objective criteria, has strong statistical analysis skills, and has worked with Speech Language Models.
Responsibilities
- Design and implement processes for evaluating and improving multi-modal generative models to meet end-to-end product requirements.
- Work with Data Engineers to process large scale speech audio data for foundation model training.
- Fine-tune Large Language Models (LLMs) and Speech Language Models (SpeechLMs) to improve performance for specific use cases.
- Work closely with other ML Researchers to define evaluation criteria and methodology to systematically evaluate foundation models.
- Experimental design for testing models/systems under test.
- Conduct robust statistical analysis to identify model deficiencies and failure states.
Minimum Qualifications
- Master's degree in Computer Science or Machine Learning
- 2+ years of hands-on experience building and evaluating generative AI models
- Proficiency in Python and ML frameworks (Pytorch or Tensorflow)
Preferred Qualifications
- PhD in Computer Science, Machine Learning, Statistics, or other STEM field
- 5+ years of hands-on experience with SpeechLMs or LLMs
- Experience with large-scale audio data processing on distributed systems
- Experience with prompt evaluation and optimization for generative AI models
- Proficiency in training, fine-tuning, and evaluation of foundation models and frameworks
- A track record of publications or technical presentations in Machine Learning journals or conferences
- Excellent communication skills and cross-functional collaboration
Pay & Benefits
At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $147,400 and $272,100, and your base pay will depend on your skills, qualifications, experience, and location. Apple employees also have the opportunity to become an Apple shareholder through participation in Apple's discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple's Employee Stock Purchase Plan. You'll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation.
$181.1k - $318.4k
...Sr. Machine Learning Engineer, Siri Speech Are you excited about Generative AI and Large Language Models? Do you want to work on cutting-edge generative technologies that power... ...dialog generation, speech recognition, and multimodal interaction. We are looking for an...LanguageWorldwideRelocation$147.4k - $220.9k
...Machine Learning Engineer, Siri Speech Apple is where individual imaginations gather... ...large-scale systems, spoken language, big data, and artificial... ...production-quality models that power natural voice... ...speaker recognition and multimodal understanding, and collaborate...LanguageRelocation$181.1k - $318.4k
...AIML - Machine Learning Engineer, Foundation Models Apple is revolutionizing artificial intelligence by developing... ...models across text, image, speech, and multi-modal domains Collaborate... ...with foundation models and large language models Background in multi-...LanguageRelocation$150k - $387.6k
...Machine Learning Engineer (CV/NLP/Multimodal/LLM) -E-commerce Governance Location: San Jose Employment... ...alignment, and other work for large models in the e-commerce domain, aiming... ...large-scale multimodal (vision, speech, natural language, etc.) algorithms and their...LanguageTemporary workLocal areaOverseas$181.1k - $318.4k
...Sr. Machine Learning Engineer, ASR Infrastructure and Tools, Siri Speech Want to join the team pushing the boundaries of AI and... ...the best speech recognition models, we need to use the latest technology... ...speech recognition, natural language processing, and dialogue...LanguageWorldwideRelocation$181.1k - $318.4k
...Sr. Machine Learning Research Engineer, Siri Speech We are a group of engineers/researchers responsible for advancing... ...infrastructure, datasets, and models that empower Siri with powerful general capabilities across natural language understanding, dialog generation,...LanguageRelocation$158.4k - $237.6k
...Staff Software Engineer Join the Qualcomm... ...integrate machine learning into their products... ...machine learning models on edge and mobile... ...quantizing large language models (LLMs) and... ...vision, audio, and multimodal networks for deployment... ...models, speech, and multimodal architectures...LanguageWork experience placementImmediate startWork from home$147.4k - $272.1k
...Applied Machine Learning Research Engineer - Multimodal for Human Understanding We're starting to see the incredible potential of multimodal foundation and large language models, and many applications in the computer vision and machine learning domain that previously...LanguageWorldwideRelocation$156k - $387.6k
...recommendation, weakly-supervised learning, few-shot... ...multilingual learning, multimodal pretraining, and more.... ...breakthroughs. 3. Drive engineering deployment and implementation, ensuring model stability, scalability... ...expertise in large language models (LLMs) and familiarity...LanguageTemporary workLocal area$213k - $263k
...of massive foundation models directly onto the... ...onboard multi-task, multimodal perception model designed... ...functional collaboration to engineer robust, high-... ...in Computer Vision, Machine Learning, Robotics, or a related... ...Foundation Models or Vision-Language Models (VLMs)....LanguageFull timeRemote work$147.4k - $272.1k
...Machine Learning Engineer, Proactive - Large Language Models & Generative AI Inference The Intelligence Platform team empowers clients across Apple's operating systems with a high quality user-centric search and data platform, and the primary inference platform that...LanguageRelocation$150k - $387.6k
...Machine Learning Engineer(NLP/CV/Multimodal), TikTok E-commerce Knowledge Graph Location: San Jose Employment... ...: machine learning, NLP(Natural Language Processing), multimodal, and computer... ...algorithms, understand basic network model structure (DNN/LSTM/CNN, etc.) and...LanguageTemporary workWork experience placementLocal area- ...first post-transformer model that adapts and thinks... ...data processing engine on the market, Pathway... ...to apply Attention to speech and worked with Nobel... ...strong track record in machine learning models research .... ...with a track record in Language Models and/or RL (candidates...LanguagePermanent employmentFull timeContract workImmediate startRemote workFlexible hours
$181.1k - $318.4k
...Machine Learning Engineer, Foundation Model Services Work Locations (2) Submit Resume Do you feel you think differently, you are eager... ...chance to work on optimizing billions of parameter language and vision and speech models using state of the art technologies and...LanguageRelocation$215.28k - $364.32k
...Staff Machine Learning Engineer – Autonomous Driving Model Quantization & Deployment Santa Clara, CA XPENG is a leading... .... The challenge of Vision-Language-Action (VLA) models and... ...optimization roadmap for large-scale multimodal models (Transformers, VLMs)....LanguageFull time$147.4k - $272.1k
...Machine Learning Research Engineer, Generative AI Apple is where individual imaginations... ...in computer vision, speech recognition, deep... ...ML including generative models or multimodal LLMs Experience with... ...recognition, or natural language processing Proven track...LanguageRelocation$147.4k - $272.1k
...Machine Learning Systems Engineer, Siri Agent Modeling The Siri organization is looking for passionate Machine Learning Systems Engineers to join us in developing... ...models Proficiency in a compiled programming language (e.g. Swift, C/C++, Java) Pay & Benefits At...LanguageRelocation$147.4k - $272.1k
...Machine Learning Research Engineer, Siri Comprehension & Planning, Siri Agent Modeling The future of AI is on-device. On the Siri team, we're solving the defining challenge... ...Qualifications Knowledge of Natural Language Processing Experience with Small...LanguageWork from homeRelocation$26 - $28 per hour
...Data Labeling Analysts, supporting speech and voice AI systems. This is a... ...working with audio, speech, and language data - helping ensure models are trained on accurate, well-structured... ...solutions for NLP-enabled machine learning by blending technology and human intelligence...LanguageFull timeWork experience placementRemote workVisa sponsorship$147.4k - $272.1k
...Machine Learning Engineer: Multimodal Sensor Fusion At Apple, individual creativity converges around shared values that drive innovation. Our products... ...will drive the development of multimodal deep learning models optimized for edge deployment, leveraging sensor fusion...Relocation$244.8k
...community. Our team develops advanced machine learning solutions to moderate content... ...also a frontier domain for Large Language Models (LLMs) and multimodal foundation models. By joining our... ...content safety vertical - Lead ML engineers to deliver high-quality, production...LanguageTemporary workLocal area- ...the Ai Data Platform Applied Machine Learning team to pioneer enterprise... ...help design, build, and evolve models, tools and applications that... ..., efficient inference, and multimodal integration, while enabling... ...AI, computer vision, natural language processing, or general...Language
- ...modern creators. Role Description As a Machine Learning Engineer, you will combine hands-on engineering with... ...reasoning systems, tool orchestration, and multimodal integrations using cutting-edge large language models (LLMs) and diffusion models. What You'...Language
$181.1k - $318.4k
...On-Device Machine Learning Engineer We're starting to see the incredible potential of multimodal foundation and large language models, and many applications in the computer vision and machine learning domain that previously appeared infeasible are now within reach....LanguageRelocation- ...Machine Learning Engineer Location: Cupertino, CA ABOUT THIS FEATURED OPPORTUNITY... ...team to build intelligent, multimodal agentic systems focused on troubleshooting... ...experience working with Vision Language Models (VLMs) and multimodal AI systems that...LanguageLocal areaFlexible hours
$230k - $265k
...veteran scientists and engineers. As a Senior Machine Learning Engineer, you’ll bring... ...summarization, chat, and speech understanding across... ...inference strategies for large language and speech models using PyTorch and/or... ...in ASR, TTS, multimodal, or modern LLM/NLP systems...LanguagePermanent employment$172.2k - $258.4k
...The opportunity We are looking for a Staff Machine Learning Engineer to join our Vector Core Modeling team. In this role, you will design and build scalable... ...professional verbal and written exchanges in this language since the performance of the duties related to this...LanguageWork at officeWorldwideRelocation package$181.1k - $272.1k
...Machine Learning Engineer - LLM Imagine what you could do here. At Apple, new ideas have a way of becoming... ...algorithms, software engineering, and data mining models with an emphasis on large language models (LLM) or large multimodal models (LMM). ~ Masters in Machine...LanguageRelocation$181.1k - $318.4k
...Senior Computer Vision and Machine Learning Engineer, Creator Studio Work Locations... ...of machine learning models, from large distributed training... ...techniques, particularly multimodal LLMs, Mixture of Experts,... ...programming skills in high-level languages like Python and one of the...LanguageRelocation$181.1k - $318.4k
...AIML - Sr Machine Learning Engineer, Responsible AI Work Locations Submit Resume Would you... ...learning, particularly focused on large language models for text generation, diffusion... ...generation, and mixed model systems for multimodal applications. As a member of Apple's...LanguageRelocation
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Machine Learning Engineer - Speech & Multimodal Language Modeling. Be the first to apply!
- machine learning engineer Cupertino, CA
- junior machine learning research engineer Cupertino, CA
- machine learning software engineer Cupertino, CA
- senior ml engineer Cupertino, CA
- computer vision machine learning engineer Cupertino, CA
- data scientist machine learning engineer Cupertino, CA
- language analyst Cupertino, CA
- speech language Cupertino, CA
- natural language processing Cupertino, CA
- language manager Cupertino, CA

