Sr Machine Learning Engineer, Tech Lead Autograder Systems, Evaluation
Apple Oakbrook
Role Number: 200628183-0836
Summary
We are looking for a Senior MLE Tech Lead to join a centralized evaluation organization and define the next generation of autograder quality across 20+ of Apple's most visible generative AI features. You will own the end-to-end technical vision for how we evaluate model outputs at scale — pioneering state-of-the-art methods, raising the technical bar, and leading a team of talented MLEs to build a robust autograder training and hillclimbing system from the ground up.
This is a high-impact, hands-on leadership role at the intersection of model evaluation, data quality, and ML systems engineering. You will work closely with model developers, data teams, and product partners to ensure our autograders are fast, accurate, and continuously improving — directly shaping the quality of AI experiences used by hundreds of millions of people.
Description
In this role you will focus on:
Technical Leadership
- Define and drive the technical roadmap for autograder quality — researching and introducing novel methods such as reward modeling, LLM-as-judge, preference learning, and calibration techniques to measurably improve evaluation accuracy.
- Architect and lead the build-out of a scalable autograder training pipeline encompassing data curation, model fine-tuning, evaluation harnesses, and versioning.
- Design and own the hillclimbing system that iteratively improves autograder performance through systematic prompt and model optimization loops.
- Establish quality benchmarks, confidence metrics, and failure analysis frameworks that enable the team to track, trust, and act on autograder outputs.
People & Collaboration
- Mentor and technically guide a team of MLEs through design reviews, modeling standards, and hands-on problem-solving — fostering a culture of rigor and continuous learning.
- Partner with data annotation teams to define labeling guidelines that feed autograder training.
- Collaborate with feature engineers to align autograder signals with broader training and product objectives.
- Translate complex technical trade-offs into clear narratives for engineering, product, and leadership audiences.
Minimum Qualifications
Master's or PhD in Computer Science, Machine Learning, Artificial Intelligence, or a related field.
5+ years of industry experience in machine learning, with a strong focus on LLM or VLM systems.
Deep expertise in prompt-tuning and fine-tuning techniques (SFT, RLHF, DPO, or equivalent), with proven experience of model calibration and uncertainty estimation.
Familiarity with data flywheel design — leveraging model outputs to continuously improve future training data.
Proficiency in Python and ML frameworks (PyTorch preferred).
Preferred Qualifications
Strong ML systems instincts — you care deeply about data quality, reproducibility, latency, and scale.
Background in human-in-the-loop annotation pipelines and inter-annotator agreement analysis.
Prior experience on an evaluation infrastructure or model quality team.
- ...for a Senior MLE Tech Lead to join a centralized evaluation organization... ...generation of autograder quality across... ...and hillclimbing system from the ground... ...and ML systems engineering. You will work... ...judge, preference learning, and... ...Computer Science, Machine Learning, Artificial...Senior
- Apple Inc. in Cupertino, California, is seeking a Sr Machine Learning Engineer, Tech Lead for Autograder Systems. In this high-impact role, you will define the technical vision for evaluating model outputs and lead a team of MLEs to enhance generative AI features. Candidates...Senior
$181.1k - $318.4k
Apple Inc. is seeking a Senior Machine Learning Engineer, Tech Lead for Autograder Systems in Cupertino, California. This high-impact role involves defining the... ...leading a team of machine learning engineers in evaluating model outputs, and implementing innovative...Senior$147.4k - $272.1k
...Sr Machine Learning Engineer, Proactive - ML Systems Engineering Apple's products combine the best hardware and incredible software to deliver magical... ...systems; establish scalable automated processes for evaluation and monitoring; contribute to a healthy team culture...SeniorRelocation- ...of personalized intelligence systems? In this role, you will be... ...developing and deploying robust evaluation frameworks across the data... ...to user behaviors with machine learning running locally on-device or... ...a high-impact ML Evaluation Engineer to help architect rigorous evaluations...Suggested
$181.1k - $318.4k
...Senior Machine Learning Engineer, Video Quality Systems Apple's Camera ISP Algorithm team is looking for dedicated engineers to shape the future of photography... ...visual quality at scale. While human expert evaluation remains the gold standard for accuracy, it is...SeniorRelocation$172.5k - $306.63k
...Senior Machine Learning Engineer At Adobe's Experience Platform, we are looking... ...scalable intelligent AI systems that power end-user AI products... ...and memory services, evaluation, safety/guardrails, and high... ...experiences. Adobe's industry-leading offerings including Adobe...SeniorTemporary workLocal areaWorldwide$172.5k - $306.63k
...Experience Platform, we are looking for a Senior Machine Learning Engineer to compose, build, and operate scalable intelligent AI systems that power end-user AI products. You will... ..., retrieval and memory services, evaluation, safety/guardrails, and high-performance backend...SeniorTemporary workLocal areaRelocation- ...Senior ML Engineer Medical Imaging Evaluation & AI Reliability About the Role: My client is building... ...for safety-critical AI systems, starting with diagnostic medical... ...Qualifications: Strong experience in machine learning for medical imaging (radiology, pathology...Shift work
- ...seeking a highly experienced Machine Learning Engineer to build, deploy, and... ...operations) and scalable production systems. At Apple, we believe in... ...and Apple Intelligence evaluations. We are looking for a Machine... ...experience in a high-growth tech company or similar...
$181.1k - $318.4k
...AIML - Sr Machine Learning Engineer - Answers, Knowledge & Information (AKI) The Answers, Knowledge... ...scalable, high quality solutions to evaluate personalized search applications. In... ...methodologies for personalized model and search systems evaluations, online and offline...SeniorRelocation$296.3k
Scale up introspection and evaluation tools that work with billions of... ..., high-impact team of AI/ML engineers, data scientists and engineers... ...in building large scale systems that are performant and used... ...mobility.We are determined to lead change for the world through...Flexible hours- ...Number: 200658427-0836 Summary For the engineer that obsesses on how software can enable OS developers to evaluate and improve their features, there is no better... ...can push the boundaries of low-level operating system technologies while maintaining a customer-centric...SeniorWork experience placement
$181.1k - $318.4k
...Sr. Machine Learning Engineer, Siri Speech We are a group of engineers/researchers responsible for advancing... ...Design, train, and evaluate machine learning models for production... ...prototypes into robust, production-grade systems Monitor deployed models for performance...SeniorRelocation$181.1k - $318.4k
...AIML - Sr Machine Learning Engineer, Responsible AI Work Locations Submit Resume Would you like... ...for image generation, and mixed model systems for multimodal applications. As a... ...and communicating pre- and post-ship evaluations of the safety of Apple Intelligence...SeniorRelocation$204k - $259k
...Senior Machine Learning Engineer – VLM/LLM Evaluation Waymo is an autonomous driving technology company with the... ...Models that are used throughout Waymo's systems, both onboard autonomous vehicles... ...organization to land innovative tech in production Implement and extend...SeniorFull timeTemporary workRemote work$213k - $263k
...Senior Machine Learning Engineer (Infra), Driver Understanding and Evaluation Waymo is an autonomous driving technology company with the mission to be the world'... ...and operate scalable machine learning and data systems, simulation workflow and insight tools, improve...SeniorFull time- A leading technology company located in Cupertino is seeking an experienced Machine Learning Engineer to develop data generation methodologies and quality assessment systems. This role involves designing automated evaluation systems and collaborating on data requirements...Senior
$204k - $259k
...Driver Understanding and Evaluation (DUE) team at Waymo... ...Driver. The DUE Machine Learning team will build and operate... ...learning and data systems, simulation workflow... ...and software engineers who are passionate about... ...stack. You will: Lead the design,...SeniorFull time$181.1k - $318.4k
...AIML - Sr Software Data Engineer, Evaluation Are you excited about using data to shape the experience... ...platform that powers Siri, Search, and Machine Learning across Apple. We're looking for... ...maintaining distributed data processing systems at scale. ~5+ years of hands-on...SeniorRelocation$147.4k - $272.1k
...AIML - Sr Machine Learning Engineer, Data and ML Innovation Work Locations (2) Submit Resume Do... ...techniques to improve model training and evaluation efficiency and performance. As a... ...of large language models and agentic systems, from training pipelines to evaluation...SeniorRelocation$126.8k - $220.9k
...Machine Learning Systems Engineer, Siri Runtime Systems and Interaction The Siri Team at Apple is actively looking for a highly motivated Systems... ...Building and optimizing infrastructure for ML model evaluation, analysis, and deployment Collaborating with ML engineers...Relocation$181.1k - $318.4k
...Sr. Machine Learning Research Engineer, Siri Speech We are a group of engineers/researchers responsible for... ...learning for effective dialog systems and foundation models—ranging from... ...large scale machine learning training/evaluation On-device intelligence and learning...SeniorRelocation$213k - $263k
...Senior Machine Learning Engineer, Prediction & Planning, System Architecture Waymo is an autonomous driving technology... ...you will report to a Technical Lead Manager. You will: Tackle... ..., algorithms, pipelines and evaluation systems on Google's extensive data...SeniorFull timeContract workInternshipRemote work- ...of their business systems through natural language... ...and continuously learn and adapt.... ...combining ServiceNow's leading workflow... ...Moveworks' Reasoning Engine and natural language... ...are looking for a Machine Learning Engineer... ...models(LLM), model evaluation and monitoring framework...SeniorWork at officeRemote workFlexible hours
$181.1k - $318.4k
...AIML - Sr. Software Development Engineer, Evaluation At Apple, we create world-class innovative... ..., powered by advanced machine learning technologies. The... ...organization, you will lead the backend development... ...Knowledge of databases systems, data model design and experience...SeniorImmediate startRelocation- ...of their business systems through natural language... ...and continuously learn and adapt.... ...combining ServiceNow's leading workflow... ...Moveworks' Reasoning Engine and natural language... ...software engineer with machine learning expertise... ...to fine-tune, evaluate, and serve your own...SeniorWork at officeImmediate startRemote workFlexible hours
- ...Summary The Siri organization is looking for passionate Machine Learning Systems Engineers to join us in developing and shipping state-of-the-art... ...is responsible for training on-device & cloud models, evaluating various approaches, pushing the envelope with the latest...
$60 - $70 per hour
...Overview: We are seeking a Machine Learning Engineer to join a high-impact team focused on advancing LLM evaluation, NLP, and AI-driven... ...prompt strategies, and building systems that ensure high-quality,... ...Global Services We’re a leading provider of business and technology...Contract workTemporary workRemote work3 days per week$204k - $259k
...Perception team builds the system which learns the spatial-temporal... ...of sensors, enabling engineers like you to (1)... ...report to a Technical Lead Manager. You will:... ...recipes for human and machine labeling of... ...methods and recipes for evaluating real-world performance...SeniorFull timeRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Sr Machine Learning Engineer, Tech Lead Autograder Systems, Evaluation. Be the first to apply!
- machine learning ai engineer Cupertino, CA
- machine learning engineer Cupertino, CA
- junior machine learning research engineer Cupertino, CA
- machine learning software engineer Cupertino, CA
- ai ml engineer Cupertino, CA
- senior ml engineer Cupertino, CA
- computer vision machine learning engineer Cupertino, CA
- data scientist machine learning engineer Cupertino, CA
- technical leader Cupertino, CA
- technical lead Cupertino, CA


