Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Sr./Staff ML Infrastructure Engineer, Compute (TPU Scheduling) - Foundation Model

$181.1k - $318.4k

Apple Oakbrook

Sr./Staff ML Infrastructure Engineer, Compute (TPU Scheduling) - Foundation Model

Work Locations (2) Submit Resume

Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us making each other's ideas stronger. That happens because every one of us shares a belief that we can make something wonderful and share it with the world, changing lives for the better. It's the diversity of our people and their thinking that inspires the innovation that runs through everything we do. When we bring everybody in, we can do the best work of our lives. Here, you'll do more than join something — you'll add something!

Description

As a Senior/Staff Engineer on the Foundation Model Compute Infrastructure team, you will lead the design and development of scheduling and orchestration systems for large-scale TPU workloads across multi-region clusters. You will work on distributed systems that manage thousands of accelerators and enable reliable, efficient execution of large-scale training and inference jobs. This role spans scheduling algorithms, cluster lifecycle management, workload orchestration, reliability engineering, and performance optimization.

Responsibilities
  • Design and evolve large-scale scheduling systems for TPU-based training and inference workloads across multi-region clusters
  • Build topology-aware, quota-aware, and fault-tolerant schedulers to improve utilization, fairness, startup latency, and reliability
  • Develop orchestration systems for distributed ML workloads running on Kubernetes and accelerator infrastructure
  • Improve cluster efficiency and operational scalability through automation of provisioning, resource management, quota workflows, and recovery handling
  • Collaborate closely with foundation model teams to support advanced distributed training and inference frameworks such as Pathways, Ray, and JAX-based workloads
  • Mentor engineers and influence architectural direction across Apple's distributed AI compute platform
Minimum Qualifications
  • 7+ years of industry experience building large-scale distributed systems or cloud infrastructure
  • Strong programming skills in Python, Go, C++, or similar systems languages
  • Extensive experience with compute infrastructure and workload scheduling
  • Strong expertise in distributed systems, scalability, reliability, and performance engineering
  • Experience with Kubernetes, container orchestration, or large-scale cluster management systems
  • Experience designing backend services or infrastructure platforms operating at production scale
  • Strong communication and collaboration skills across engineering and research teams
  • Bachelor's degree in Computer Science, Engineering, or related field
Preferred Qualifications
  • Experience building schedulers, resource managers, or orchestration systems for distributed workloads
  • Experience with accelerator infrastructure such as TPU, GPU
  • Experience with distributed ML training or inference systems
  • Familiarity with frameworks such as JAX, PyTorch, TensorFlow, Ray, Pathways
  • Experience operating large-scale multi-tenant infrastructure in cloud or hybrid environments
  • Background in performance optimization, fault tolerance, or resource efficiency for large distributed systems
  • MS or PhD in Computer Science, Engineering, or related field
Pay & Benefits

At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $181,100 and $318,400, and your base pay will depend on your skills, qualifications, experience, and location. Apple employees also have the opportunity to become an Apple shareholder through participation in Apple's discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple's Employee Stock Purchase Plan. You'll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program. Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant At Apple, we believe accessibility is a fundamental human right. You'll find that idea reflected in everything here — in our culture, our benefits and our digital tools. By welcoming as many perspectives as possible, we help you build a career where you feel like you belong. Learn about accessibility in Apple's workplace Learn about reasonable accommodations for job applicants Apple accepts applications to this posting on an ongoing basis. Submit Resume Back to search results See all roles in Santa Clara

Vacancy posted 17 hours ago
Similar jobs that could be interesting for youBased on the Sr./Staff ML Infrastructure Engineer, Compute (TPU Scheduling) - Foundation Model in Seattle, WA vacancy
  • $171.6k - $302.2k

    Apple Inc. in Seattle seeks a Senior/Staff Engineer for its Foundation Model Compute Infrastructure team. The role involves leading the design and development of large-scale scheduling and orchestration systems for TPU workloads. Candidates should have over 7 years of experience... 
    Senior
    Foundation

    Apple Inc.

    Seattle, WA
    4 days ago
  •  ...groundbreaking generative modeling technologies...  ...ML teams focus on...  ...Visual Generative Foundation Models, Multimodal...  ...are seeking engineers experienced in building infrastructure for training,...  ...for effective scheduling of multimodal...  ...Electrical Engineering/Computer Science or a... 
    Senior
    Foundation

    Apple

    Seattle, WA
    3 days ago
  •  ...based in Seattle, is seeking a Staff Engineer to lead the technical...  ...driven lifecycle management of computing clusters. You will manage...  ...scalability, security, and infrastructure excellence. The ideal candidate...  ...engineering, with a strong foundation in distributed systems and... 
    Foundation

    Menlo Ventures

    Seattle, WA
    2 days ago
  • $232.56k - $427.5k

     ...committed to building a storage and computing infrastructure that can adapt to various data...  ...storage systems and computing models. Use Paimon as the storage foundation and combine it with the...  ...in computer science, software engineering, or related fields, with experience... 
    Senior
    Foundation
    Temporary work
    Local area
    Flexible hours

    Tik Tok

    Seattle, WA
    17 hours ago
  • $181.1k - $318.4k

     ...Staff/Sr. Machine Learning Engineer, Foundation Models - AI, Search & Knowledge Platforms Work Locations...  ...models on servers. Our Infrastructure powers a wide gamut of...  ...drawing every ounce of compute from our hardware. As...  ...one of the popular ML Frameworks like Pytorch... 
    Senior
    Foundation
    Relocation

    Apple

    Seattle, WA
    13 hours ago
  • $264.1k - $369.74k

     ...Senior Principal Engineer for Network Architecture...  ...and architectural foundation for the "brain" of...  ...Pioneer capacity modeling methodologies that...  ...'s degree in Computer Science, Electrical...  ...Segment Routing (SRv6/SR-MPLS) in complex,...  ...based on weekly scheduled hours, and up to 1... 
    Senior
    Foundation
    Permanent employment
    Temporary work
    Local area

    Blue Origin

    Seattle, WA
    4 days ago
  • $320k

     ...Staff + Sr. Software Engineer, Cloud Inference Launch Engineering...  ...Inference, the model & inference...  ...is high-leverage infrastructure work: validation...  ...at a time when compute is our scarcest...  ...prior inference or ML experience is not...  ...Capacity-constrained scheduling or shared-... 
    Senior
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    Seattle, WA
    1 day ago
  • $171.6k - $258.1k

     ...Sr. ML Infrastructure Engineer, Siri Runtime Systems and Interaction Apple is where individual imaginations...  ...in close partnership with our ML Modeling teams, Infrastructure, Software,...  ...following: Strong background in computer science: algorithms, data structures... 
    Senior
    Relocation

    Apple

    Seattle, WA
    2 days ago
  • $163.3k - $290.1k

     ...something.The ASE - AI Infrastructure team at Apple is building...  ...generation of Machine Learning models and we are looking for...  ...is seeking a Senior Engineering Program Manager for its ML Compute Platform. This platform...  ...cloud infrastructure, GPU/TPU usage for ML training,... 
    Senior
    Work experience placement
    Relocation

    Apple Inc.

    Seattle, WA
    4 days ago
  • $171.6k - $302.2k

     ...also focuses on ML-driven forecasting...  ...cost models for iCloud's large...  ...services. As a Sr. ML Optimization Engineer, you will work at...  ...systems engineering, infrastructure strategy, applied...  ...allocate capacity, schedule workloads, and...  ...Master’s degree in Computer Science,... 
    Senior
    Relocation

    Apple Inc.

    Seattle, WA
    3 days ago
  • $171.6k - $302.2k

    Senior Software Release Engineer, Private Cloud Computing Seattle, Washington,...  ...the most ambitious infrastructure efforts in the...  ...contributor to the foundation of Apple's future cloud...  ...Cloud Compute Manage model weights across OS...  ...Experience validating ML or LLM workloads running... 
    Senior
    Foundation
    Relocation

    Apple Inc.

    Seattle, WA
    4 days ago
  •  ...Opportunity Sesame believes in a future where computers are lifelike - with the ability to see, hear,...  ...consisting of a variety of LLM, speech, and vision models. Partner with ML infrastructure and training engineers to build a fast, cost-effective, accurate, and... 
    Full time
    Contract work
    Flexible hours

    SESAME

    Bellevue, WA
    3 days ago
  • $207k - $300k

    A leading technology company is seeking a Staff Software Engineer in AI/ML for its Cloud Identity and Access Management team...  ...development, technical leadership, and cloud computing, focusing on building scalable infrastructure systems. The position offers a competitive... 

    Google Inc.

    Seattle, WA
    4 days ago
  • $179k - $294k

    Senior Software Engineer - High Performance Computing Design and implement improvements...  ...Computing infrastructure. Location: Seattle,...  ..., algorithmic job scheduling, and adaptive cloud...  ...by partnering with ML practitioners,...  ...field with a strong foundation in data structures... 
    Senior
    Foundation
    Temporary work

    jobs.frontdoordefense.com - Jobboard

    Seattle, WA
    2 days ago
  •  ...Staff + Sr. Software Engineer, Inference Deployment...  ...— and every model update must reach...  ...the deployment infrastructure that moves inference...  ...validation, scheduling deployments...  ...across GPU, TPU, and Trainium...  ...with ML inference or...  ...traditional efforts in computer science. We'... 
    Senior
    Work at office
    Visa sponsorship
    Flexible hours
    Shift work

    anthropic

    Seattle, WA
    3 days ago
  •  ...As a Principal AI/ML at JPMorgan Chase...  ...learning, software engineering, and product management...  ...distributed computing, big data, cloud engineering...  ...distributed AI/ML infrastructure, including inference, training, scheduling, orchestration,...  ...(e.g., GPU, TPU, RDMA), or ML for... 

    JPMorgan Chase & Co.

    Seattle, WA
    3 days ago
  • $202.16k - $368.22k

     ...Senior Software Engineer - Compute Infrastructure (Cloud Native) Location: Seattle...  ...Infrastructure - Orchestration & Scheduling team uses Kubernetes and...  ...TikTok and various AI/ML & LLM initiatives, we face...  ...infrastructure for AI & LLM models. Your expertise can drive... 
    Senior
    Temporary work
    Internship
    Local area
    Overseas

    ByteDance

    Seattle, WA
    2 days ago
  • $171.6k - $302.2k

     ...United States Machine Learning and AI The Foundation Model Services team within the Machine...  ...Qualifications 5+ years of industry experience in ML technologies (LLMs, machine learning,...  ...). Bachelor’s degree or higher in Computer Science or a related technical field. Preferred... 
    Foundation
    Relocation package

    Apple Inc.

    Seattle, WA
    1 day ago
  • $171.6k - $302.2k

    A leading technology company is seeking a Senior Computer Vision and Machine Learning Engineer to innovate in creative editing tools. The ideal candidate will possess advanced skills in computational photography and machine learning, with at least 5 years of experience.... 
    Senior

    Apple Inc.

    Seattle, WA
    1 day ago
  • $124.9k - $228.9k

     ...mobile apps, news, and more. Our Software Engineers are end‑to‑end owners who design, build...  ...for stakeholders. The High‑Performance Computing team powers the core bidding platform...  ...experience. Key Attributes Strong foundation in engineering fundamentals; confident... 
    Senior
    Foundation

    The Trade Desk

    Bellevue, WA
    17 hours ago
  • $228.7k - $309.4k

     ...next science and engineering revolution at Amazon's Delivery Foundation Model team, where you'll...  ...'s vast data and computational resources to...  ...training and evaluation infrastructure - Guide and...  ...successful production ML deployments -...  ...supervisors, and staff; adhere to... 
    Foundation
    Local area
    Worldwide
    Flexible hours

    Amazon.com Services LLC

    Bellevue, WA
    17 hours ago
  • $76.2 - $129.74 per hour

     ...Senior Principal Software Engineer IS - Hybrid The...  ...caregiver experience. Staff in this role bring together...  ...Bachelor’s Degree in Computer Engineering, Computer...  ...delivering a robust foundation of services and...  ...Information Technology Job Schedule: Full time Job... 
    Senior
    Foundation
    Minimum wage
    Full time
    Local area
    Shift work

    Providence Service

    Renton, WA
    3 days ago
  •  ...IT Infrastructure Engineer V - ITSM, ITIL, VoIP, PSTN The Support Engineer...  ...business drivers, and establish a foundation for enterprise systems...  .... Bachelors degree in Computer Science, CIS, or related field...  ..., WA Honolulu, HI Scheduled Weekly Hours: 40 Shift: Day... 
    Foundation
    Full time
    Work experience placement
    Work from home
    Flexible hours
    Shift work

    Kaiser Permanente

    Renton, WA
    1 day ago
  •  ...for Machine Learning Engineers to work on ambitious...  ...focused on large language models for text generation,...  ...methodology & infrastructure to benefit ongoing and...  ...Qualifications MS, or PhD in Computer Science, Machine...  ...Deep experience in foundation model-based AI... 
    Senior
    Foundation

    Apple

    Seattle, WA
    3 days ago
  • $171.6k - $258.1k

     ...us create the data and infrastructure ecosystem needed to support our ML development and continuously...  ...with our ML Modeling teams, Infrastructure,...  ...: Strong background in computer science: algorithms, data...  ...degree in Computer Science, Engineering, or related discipline,... 
    Senior
    Relocation

    Apple Inc.

    Seattle, WA
    17 hours ago
  • $141.9k - $190.3k

     ...Sr Machine Learning Engineer Disney Entertainment and ESPN...  ...the technological foundation and consumer...  ...an experienced ML engineer who enjoys...  ...system reliability, model performance, and...  ...'s degree in Computer science or...  ...on cloud-native infrastructure and distributed... 
    Senior
    Foundation

    Disney

    Seattle, WA
    3 days ago
  • $204k - $259k

     ...Machine Learning Engineer – VLM/LLM...  ...of the Waymo AI Foundations team is to develop...  ...demonstration, generative modeling, Bayesian...  ...a hybrid work schedule and you will...  ...to a Senior Staff Software Engineer...  ...Master's degree in Computer Science,...  ...Experience in ML engineering and... 
    Senior
    Foundation
    Full time
    Temporary work
    Remote work

    Waymo

    Kirkland, WA
    3 days ago
  • $123.95k - $157.11k

     ...Division Ops and Infrastructure Opening Date...  ...Senior Network Engineer's as part of its...  ...KCIT provides the foundational systems and services...  ...degree in Computer Science, Information...  ...works in a hybrid model, with days in the...  ...responsive during scheduled work hours. Work... 
    Senior
    Foundation
    Full time
    Temporary work
    Part time
    Work at office
    Remote work
    Monday to Friday
    Flexible hours

    King County

    Seattle, WA
    1 day ago
  • $230.77k - $323.08k

     ...As a Principal Engineer for Network Architecture...  ...to create the foundation for a...  ...Drive capacity modeling that translates...  ...mentoring senior staff and influencing...  ...Bachelor's degree in Computer Science,...  ...Routing (SRv6/SR-MPLS) in complex...  ...based on weekly scheduled hours, and up to... 
    Foundation
    Permanent employment
    Temporary work
    Local area

    Blue Origin

    Seattle, WA
    4 days ago
  • $141.9k - $190.3k

     ...global organization of engineers, product developers, designers...  ...the technological foundation and consumer media...  ...mixed media, language models, and other agentic multimodal...  ...of classical ML models to optimize advertisement...  ...BRING Bachelor's in computer science or equivalent... 
    Senior
    Foundation

    The Walt Disney Company

    Seattle, WA
    17 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Sr./Staff ML Infrastructure Engineer, Compute (TPU Scheduling) - Foundation Model. Be the first to apply!