Senior ML Infra Engineer: TPU Scheduling & Orchestration
Apple Oakbrook
Apple Inc. is seeking a Senior/Staff Engineer in Santa Clara, California, to lead the design of scheduling systems for TPU workloads. The ideal candidate will have over 7 years of experience building large-scale distributed systems, strong programming skills in Python and C++, and expertise in Kubernetes. The role includes responsibilities such as developing orchestration systems for distributed ML workloads and mentoring engineers. Benefits include competitive pay and comprehensive medical coverage. #J-18808-Ljbffr
$181.1k - $318.4k
...add something! Description As a Senior/Staff Engineer on the Foundation Model Compute... ...the design and development of scheduling and orchestration systems for large‑scale TPU workloads across multi‑region clusters... ...systems for distributed ML workloads running on Kubernetes...SeniorRelocation- ...General Motors in Sunnyvale, California, is offering a Staff ML Infra Engineer position that focuses on enhancing autonomous driving through machine learning solutions. The role involves designing scalable systems for training and evaluating ML models, requiring a strong...SeniorRemote work
$152k - $287.5k
...NVIDIA Gruppe, based in Santa Clara, is seeking a Senior Software Engineer to accelerate the development of machine learning innovations. In this role, you'll design and implement solutions for GPU clusters, enabling researchers to optimize their work. Strong expertise...Senior$272k - $431.25k
...We are seeking a Principal AI and ML Infra Software Engineer, GPU Clusters at NVIDIA to join our Hardware Infrastructure team. As an Engineer... ...silicon), storage (e.g., Lustre, GPFS, BeeGFS), scheduling & orchestration (e.g., Slurm, Kubernetes, LSF), high‑speed networking...Suggested- ...Staff/Sr. ML Compute Efficiency Engineer Scaling machine learning workloads across thousands of GPUs and TPUs creates challenges that... ...parallelism techniques, and refining workload scheduling and orchestration across the compute fleet. Characterize ML workload...Senior
- ...NVIDIA Gruppe is seeking a Senior Software Engineer in Santa Clara, California, to build foundational infrastructure for Robotics Research. The role emphasizes ML productivity tooling and requires significant experience in MLOps and software development. The ideal candidate...Senior
$262k - $365k
A leading technology company in Sunnyvale, CA seeks a Senior Staff Software Engineer specializing in ML Infrastructure. The role involves designing back-end services and collaborating with AI teams. Candidates should have significant software development experience, particularly...Senior- Google Inc. seeks a Senior Software Engineer to work on TPU Performance and Hardware Software Co-Design. The role demands 5 years of experience in software... ...C, C++, Java, or Python, and a strong background in ML algorithms, performance analysis, and optimization. The...Senior
- ...General Motors is looking for a Senior ML Infrastructure Engineer to build robust compute platforms for AI validation. This role emphasizes driving efficiency and maximizing GPU utilization while improving platform reliability. You will collaborate with engineers to shape...Senior
$153k - $222k
...Decisive Point is looking for infrastructure engineers and ML engineers to join the Data & ML infra group in Mountain View, California. The role focuses on working across the ML lifecycle and solving broad data problems. Ideal candidates will have software engineering...Senior$174k - $252k
Google Inc. is hiring a Senior Software Engineer specialized in ML Infrastructure to develop technologies for massive-scale applications. Located in Sunnyvale, CA, you will build infrastructure solutions for conversational AI products, collaborating closely with various...Senior$166k - $244k
...Carlsbad Tech is actively seeking a Senior Software Engineer to work on the Gemini Live API in Sunnyvale, CA. This role involves building scalable... ...in software development, infrastructure management, and AI/ML technologies. Benefits include a competitive salary ranging...Senior- ...NVIDIA Gruppe is seeking a Principal AI and ML Infra Software Engineer to join our Hardware Infrastructure team in Santa Clara, CA. In this role, you'll work closely with AI research teams to enhance efficiency by addressing infrastructure deficiencies for GPU Clusters...
$166k - $244k
A leading tech company based in Sunnyvale is seeking a Senior Software Engineer with expertise in Python and C++ to develop innovative solutions in AI and ML. Candidates should possess extensive experience in ML infrastructure and software architecture. This role offers...Senior- ...A leading automotive company is looking for a Senior ML Infrastructure Engineer to build and enhance robust computing platforms for simulation workflows. The successful candidate will contribute to the design of core backend components, collaborate with cross-functional...Senior
- ...A major automotive firm is seeking a Senior ML Engineer to join their Embodied AI team, focusing on delivering machine learning solutions for autonomous vehicles. The ideal candidate should have over 5 years of experience in large-scale systems and strong skills in Python...SeniorWork at officeRemote work
$272k - $431.25k
...NVIDIA Corporation seeks a Principal AI and ML Infra Software Engineer in Santa Clara, California, to enhance the efficiency of AI/ML research on GPU Clusters. The role involves collaboration with various teams, monitoring infrastructure performance, and implementing...- ...automation solutions. Build and maintain AI and ML heterogeneous clusters on-premises and... ...degree in Computer Science, Electrical Engineering or related field or equivalent... ...infrastructure. Experience with AI/HPC advanced job schedulers, such as Slurm, K8s, PBS, RTDA or LSF....Senior
- ...A technology company in Palo Alto is seeking a Senior ML & Data Infrastructure Engineer to own and scale its data infrastructure. The role involves architecting a high-throughput system for managing billions of clips, optimizing storage solutions, and collaborating directly...Senior
$213k - $263k
...Senior Machine Learning Engineer (Infra), Driver Understanding and Evaluation Waymo is an autonomous driving technology company with the mission to... ...Design and scale large distributed systems covering the ML lifecycle, supporting planet-scale dataset generation, model...SeniorFull time$153.2k - $234.1k
...autonomous vehicle behavior across real-world scenarios. As a Senior ML Infra Engineer, you will work on the core systems that enable rapid... ...Blaze, orCMake. Proficiencywith containerization and orchestration technologies (e.g., Docker, Kubernetes). Remote/Hybrid...SeniorLocal areaRemote workWork from homeRelocation packageFlexible hours$153.2k - $234.1k
...driving? Join the Embodied AI Infra Foundation team at General... ...powers every machine learning engineer working on our cutting-edge... ...vehicles. As a Senior ML Infra Engineer, you will build... ...working with containerization and orchestration technologies (Docker, Kubernetes...SeniorWork at officeLocal areaRemote workWork from homeRelocationRelocation packageFlexible hours$190k - $253k
Matterport - Senior ML/CV Engineer - Computational Photography & Image Processing Job Description CoStar Group (NASDAQ: CSGP) is a leading global... ...This role is located in our Sunnyvale, CA office and has a schedule of 4 days on-site and 1 day remote. What You Will Do...SeniorWork at officeRemote work$128.7k - $261.3k
.... We partner closely with model developers and deployment and infra engineers to ship numerically robust, low-latency models to the car, blending... ...Electrical Engineering, Physics, Mathematics, Data Science / ML, or a closely related quantitative field (or equivalent...SeniorLocal areaRemote workWork from homeRelocation packageFlexible hours$275.8k - $340.5k
...Position Overview The Principal AI/ML Engineer will lead a growing organization, guiding the AV ML Infra team in achieving its mission while shaping long‑term vision... ...Web Services. Experience with open‑source orchestration platforms such as Kubeflow, Flyte, Airflow, etc...Local areaRemote workRelocationRelocation packageFlexible hours- ...effortlessly run large‑scale ML applications, without the hassle... ...are looking for a Software Engineer to join the ML Integration... ...containerized environments, or cluster orchestration. Exposure to hardware... ...Location This role follows a hybrid schedule, requiring in-office presence...SeniorWork at officeRemote work
$275.8k - $340.5k
...join us. About the team: The AV ML Infra team at GM builds ML infrastructure designed... ..., enhance the productivity of ML engineers, and drive the adoption of cutting-edge... ...Services. Experience with open-source orchestration platforms such as Kubeflow, Flyte, Airflow...Local areaRemote workWork from homeRelocationRelocation packageFlexible hours- ...General Motors is seeking a Senior AI/ML Engineer to join their team in Mountain View, California. This role focuses on designing scalable ML infrastructure solutions, mentoring engineers, and taking ownership of technical projects. Candidates should have a strong background...SeniorRemote work
$162.8k - $203.5k
...Rivian is searching for a Staff Software Engineer on the Data team, responsible for expertise in cloud and data engineering. The role requires a solid understanding of the AWS Cloud Data Platform, leading critical infrastructure services for the ADAS team. Key qualifications...Senior- ...Hewlett Packard Enterprise Development LP seeks a Senior AI/ML Engineer – Agentic in San Jose, California. The role involves designing, building, and operating a production-grade agentic orchestration platform while integrating enterprise-scale LLM services. Ideal candidates...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior ML Infra Engineer: TPU Scheduling & Orchestration. Be the first to apply!
- machine learning software engineer Santa Clara, CA
- ai ml engineer Santa Clara, CA
- computer vision machine learning engineer Santa Clara, CA
- machine learning engineer Santa Clara, CA
- senior ml engineer Santa Clara, CA
- machine learning ai engineer Santa Clara, CA
- senior cost analyst Santa Clara, CA
- senior computer engineer Santa Clara, CA
- senior development engineer Santa Clara, CA
- senior manager quality engineering Santa Clara, CA

