Staff Machine Learning Platform Engineer, AI Evaluation
Apple
Role Number: 200659247-3337
Summary
Join Apple Services Engineering to build the next generation of AI evaluation systems. We are seeking a staff machine learning platform engineer to lead the architectural design and development of the high availability services and internal tools powering self-service evaluation at scale. You will partner with researchers to operationalize their innovations, transforming complex workflows into intuitive, developer-first platforms. We are looking for builders who thrive in the ambiguity of new initiatives and are passionate about creating scalable infrastructure.
Description
We're building the evaluation platform that will serve all of Apple's generative AI and agent systems. This is early-stage work - some scrappy components exist, much is greenfield and we need a staff engineer who can take it from here to org-wide self-service scale.
This is not a "maintain the infra" role. You'll make consequential decisions about what to build, what to integrate, and what to say no to then ship it in Python with a small team.
Minimum Qualifications
8+ years of software engineering experience with a track record of owning platform-level technical direction.
0-to-1 builder who designs for scale. You've taken something from nothing to production, made deliberate tradeoffs about what to build now vs. later, and can articulate why.
ML depth : You're not building the models, but you can read research code and assess: is this a software problem or an infrastructure problem? Do we need a rewrite or do we need GPUs? You speak the language of research engineers fluently.
AI/Agent evaluation experience that goes beyond traces. You understand the hard problems: non-deterministic outputs, multi-step agent reasoning, judge model reliability, scoring drift. You've built or operated systems that handle these.
Judgment under ambiguity. You know when to build a rapid prototype for quick validation and when to be disciplined (design doc, review, test). You can tell the difference in real time, not just in retrospect.
Communication as a core skill. You write clearly design docs, decision records, platform roadmaps. You speak clearly in meetings with researchers, in rooms with engineering leaders, and balance the needs and priorities of partner teams and contribute to the sequencing of execution.
Python as primary language. Strong with FastAPI, Pydantic, and the ecosystem. Experience with job orchestration frameworks (Temporal.io or similar). Bonus: Go or Rust for compute-hot paths.
Operational ownership. You've owned CI/CD, containerization (Docker/K8s), and monitoring for production services. You don't just ship, you keep things running.
Preferred Qualifications
Experience with distributed compute frameworks (Ray, Dask)
Background in startup or early-stage environments where you wore multiple hats
Familiarity with LLM token economics, rate limiting, and cost management at scale
$171.6k - $258.1k
...Machine Learning Platform Engineer, AI Evaluation Platform (All Levels) Join Apple Services Engineering to build the next generation of AI evaluation systems. We are seeking machine learning platform engineers at multiple levels (Mid-Level to Principal) to architect...SuggestedRelocation$204k - $259k
...Senior Machine Learning Engineer – VLM/LLM Evaluation Waymo is an autonomous driving technology... ...to a range of vehicle platforms and product use cases. The... ...The mission of the Waymo AI Foundations team is to develop... ...will report to a Senior Staff Software Engineer. You...SuggestedFull timeTemporary workRemote work$139.5k - $258.1k
...Evaluation & Insights Machine Learning Engineer Imagine what you could do here. At Apple, great new ideas have a... ...So are we! Join our Human-Centered AI team for Apple Products. In this role... ...to drive improvements across our platforms. We are looking for an...SuggestedRelocation- ...VTI Aerospace builds AI-powered perception and... ...WA, we are a team of engineers and technologists... ...Senior Software Engineer Evaluation, you will design and... ...will work closely with machine learning and data engineering... ...Grafana or similar platforms Why You'll Love...SuggestedFull time
$171.6k - $258.1k
...ML Research Engineer, AI Evaluation Platform AI systems are only as trustworthy as the methods used to evaluate them. At Apple, where AI powers... ...to use at scale—that's where your skills in both machine learning and engineering come into play. On the research side, you...SuggestedLocal areaRelocation- Apple Inc. in Seattle is seeking a Software Engineer to build and ship features for its generative AI evaluation platform. In this hands-on role, you will collaborate closely with research engineers and integrate ML research into reliable services. Strong Python skills...
$60 - $70 per hour
...Overview: We are seeking a Machine Learning Engineer to join a high-impact team focused on advancing LLM evaluation, NLP, and AI-driven automation. This role centers on designing scalable evaluation frameworks, optimizing prompt strategies, and building systems that...Contract workTemporary workRemote work3 days per week- ...Foundation Model Inference Team, within AI, Search & Knowledge Platform Technologies organization. Our team... ...and use cases. Mentor and guide engineers in the organization. Minimum... ...Science, Artificial Intelligence, Machine Learning, Information Retrieval, Data Science...
- ...Weekly Hours: 40 Role Number: 200657970-3337 Summary The Productivity and Machine Learning Evaluation team ensures the quality of AI-powered features across a suite of productivity and creative applications; including Creator Studio, used by hundreds of millions...Shift work
- ...200657984-3337 Summary The Productivity and Machine Learning Evaluation team ensures the quality of AI-powered features across a suite of productivity and... ...useful AI outputs Experience partnering with engineering or data teams to define data collection...
$120.3k - $210.1k
...Applied ML Engineer – AI/ML Evaluation & Simulation We're building the next generation of AI evaluation systems — and we're looking for... ...someone with a strong foundation in software engineering and machine learning, and an eagerness to grow by building tools and systems...InternshipRelocation$139.5k - $258.1k
ML Engineer - Automated Evaluation and Adversarial Design Seattle, Washington, United States Software and Services The Productivity and Machine Learning Evaluation team ensures the quality of AI-powered features across a suite of productivity and creative applications...RelocationShift work$139.5k - $258.1k
Apple Inc. is seeking an ML Engineer for its Seattle location to build and scale automated evaluation systems for AI features. The ideal candidate will have a Bachelor's degree in a relevant field and over 4 years of experience in ML evaluation. Responsibilities include...$139.5k - $258.1k
Apple Inc. in Seattle, Washington, seeks an ML Engineer for the Productivity and Machine Learning Evaluation team. This role involves defining quality metrics and analyzing evaluation results to inform decisions on AI features across productivity applications. Candidates...$141.9k - $190.3k
...Senior Machine Learning Engineer, Ad Platforms Technology is at the heart of Disney's past, present, and future... ...on specialization in generative AI applications, including generative mixed... ...modeling applications. Create, evaluate, improve, optimize technologies Drive...$171.6k - $302.2k
...Senior/Staff Applied ML Engineer – AI/ML Evaluation & Simulation We're building the next generation of AI evaluation... ...and a solid understanding of machine learning. In this hands-on role, you'll... ...contribute to building scalable platforms for simulation and behavior analysis...Relocation$139.5k - $258.1k
...Machine Learning Engineer, Apple Search & Knowledge Platforms The Apple Knowledge Quality Team is building the next-generation... ...evolution through measurement, evaluation, and analysis of the user... ...experience building production ML/AI systems, OR PhD degree in a related...Relocation- ...79-3337 Summary The AI, Search & Knowledge Platforms team builds amazing products... ...be doing large scale machine learning and deep learning research... ...develop, fine-tune, and evaluate domain specific Large Language... ...machine learning engineers and researchers having strong...
$168.1k - $227.4k
...scale. We build an AI-driven analytics platform that turns 50+ PB of... ...detection and causal policy evaluation to LLM-powered... ...alongside talented engineers and product leaders... ...techniques in ML, deep learning, and GenAI to... ...- Experience with Machine Learning and Large Language...InternshipFlexible hours$212k - $318.4k
...Senior Machine Learning Platform Engineer - AI, Search & Knowledge Work Locations (2) Submit Resume Join us in building the AI, Search & Knowledge... ...move between datasets, training, model serving, and evaluation. We're looking for a versatile engineer who thrives at...Relocation- ...company in Seattle is seeking an early-career Applied ML Engineer to develop AI evaluation systems. This role involves collaboration with... ...strong programming skills, and a basic understanding of machine learning principles. Opportunities for growth and competitive benefits...
$171.6k - $302.2k
...Staff Machine Learning Engineer, Search & Knowledge Platform Apple is where individual imaginations come together, committing... ...with outstanding Search and AI engineers on large scale machine... ...delivering tooling and frameworks to evaluate individual components and end-to...Work experience placementRelocation$171.6k - $230.1k
...Lead Machine Learning Engineer, Ad Platforms Disney Entertainment and ESPN Product & Technology is a global... ...business. Our mission is to advance AI and machine learning capabilities... ...Llama, etc. Strong grasp of LLM evaluation methodologies, experience with RAG architectures...- ...leading technology company in Seattle is seeking a Senior or Staff-level Applied ML Engineer to bridge ML, software, and product development. In this... ...simulate complex interactions and develop tools for AI evaluation. The ideal candidate has at least 8 years of experience...
- ...Summary We're building the evaluation platform that will serve all of Apple's generative AI and agent systems. Evaluating non... ...here. This is a hands-on engineering role with a lot of autonomy. You... ...into reliable services. You'll learn their world quickly and translate...
$135k - $210k
...food supply by building the AI farmer that automates our... ...this data lives in our cloud platform, FruitScope OS, that we've... ...We are looking for a Machine Learning Engineer to build creative, practical... ...infrastructure for model training, evaluation, and inference, both in the...Full timeWork at officeWeekend work- ...unlocking scalable training and evaluation for our autonomous system's... ...stack. As a Generative AI Engineer, you will develop and train... ...multi-sensor fusion based deep learning models to understand obstacles... ...or PhD in Computer Science, Machine Learning, or related...Temporary workRelocation package
$130k - $300k
GEICO . For more information, please .Senior Staff Machine Learning Engineer, AI Agent Platform page is loaded## Senior Staff Machine Learning Engineer, AI... ...agent orchestration, AI agent lifecycle management, evaluation frameworks, skill registries and marketplace, and workflow...Hourly payWork experience placement- ...service. We are looking for a Senior Machine Learning Engineer to join our team to help find rare events... ...training deep learning models, evaluation, and optimization. Strong programming... ...We may use artificial intelligence (AI) tools to support parts of the hiring...Temporary workRelocation package
$125k - $180k
...Our extensive learning programs and mentorship... ...the Product and Engineering team at PitchBook,... ...with us! As a Machine Learning Engineer (MLE) on the AI & ML (Insights) team... ...on the PitchBook Platform. You will be deeply... ..., monitoring, evaluation, and compliance. Help...Work at officeRemote workVisa sponsorship
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff Machine Learning Platform Engineer, AI Evaluation. Be the first to apply!
- staff security engineer Seattle, WA
- assistant engineer Seattle, WA
- engineering aide Seattle, WA
- assistant chief engineer Seattle, WA
- staff engineer Seattle, WA
- technology administrator Seattle, WA
- senior staff systems engineer Seattle, WA
- staff data engineer Seattle, WA
- software engineer staff Seattle, WA
- assistant engineering manager Seattle, WA



