TPU ML Compiler Engineer — Optimize Large-Scale Workloads
$147k - $211kGoogle Inc.
Google Inc. is seeking a skilled ML Compiler Software Engineer for its Sunnyvale office. The position requires a Bachelor's degree, proficiency in C++, and experience in low-level programming and hardware interaction. In this role, you will focus on developing compiler optimizations for Tensor Processing Units (TPUs), enhancing parallelization features, and collaborating with cross-functional teams to maximize performance. The US base salary for this full-time role is between $147,000 and $211,000 plus bonus and equity. #J-18808-Ljbffr Google Inc.
- ...Principal Machine Learning Engineer to join our Models and... ...training of large models on a large number... ...training generative AI at scale. THE PERSON: The... ...pipeline performance. Optimize the distributed training... ...Experience with ML/DL frameworks such as PyTorch...Suggested
$147k - $211k
ML Compiler Software Engineer, Tensor Processing Unit Google - Sunnyvale,... ...Experience in profiling workloads, identifying and introducing performance optimization. Experience with high... ...techniques for TPU back-end necessary for large-scale workloads. Contribute...SuggestedFull time$128.7k - $261.3k
...the AI Kernels & Compilers team, that... ...real vehicles at scale. We pioneer new... ...and performance engineering so that every cycle... ...of on-vehicle workloads while... ...them into highly optimized inference artifacts... ...effortless for ML engineers across... ...used to deploy large-scaleperception...SuggestedLocal areaWork from homeRelocation packageFlexible hours$181.1k - $318.4k
...Sr./Staff ML Infrastructure Engineer, Compute (TPU Scheduling) - Foundation Model Work Locations (2... ...and orchestration systems for large-scale TPU workloads across multi-region clusters. You... ...reliability engineering, and performance optimization. Responsibilities...SuggestedRelocation$181.1k - $318.4k
...Description As a Senior/Staff Engineer on the Foundation Model... ...orchestration systems for large‑scale TPU workloads across multi‑region clusters... ...engineering, and performance optimization. Responsibilities Design... ...orchestration systems for distributed ML workloads running on...SuggestedRelocation$165.2k - $223.6k
...AWS Neuron is the SDK that optimizes the performance of complex ML models executed on AWS... ...accelerate deep-learning workloads. This role is for a software engineer in the Compiler team for AWS Neuron. As part... ...including massive scale large language models like Llama...InternshipLocal areaFlexible hours$181.1k - $318.4k
...Sr. ML Optimization Engineer, iCloud In Apple's iCloud services organization... ...efficiently utilize resources at scale. This team also focuses on... ...cost models for iCloud's large-scale services. As a Sr. ML... ...capacity, schedule workloads, and minimize cost under reliability...Relocation$156k - $387.6k
...company. Currently, we are looking for Machine Learning Engineer in AI Compiler Optimization to join our team to support and advance that mission.... ...optimization and low-latency inference optimization for large-scale recommendation systems, with the ability to handle compilation...Temporary workLocal area$124k - $195.5k
...Machine Learning Applications and Compiler Engineer for New College Grad 2026 in... ...and compiler stack optimizations, working at the intersection of deep learning and large-scale systems. Ideal candidates should... ..., and experience with ML frameworks like TensorFlow and...- ...Inc. is seeking a Senior/Staff Engineer in Santa Clara, California,... ...design of scheduling systems for TPU workloads. The ideal candidate will... ...years of experience building large-scale distributed systems, strong programming... ...systems for distributed ML workloads and mentoring...
$174k - $252k
Google Inc. is seeking a Senior Software Engineer to develop next-generation ML compiler optimizations for EdgeTPU hardware in Mountain View, CA. This role requires strong expertise in software development, compiler optimization, and machine learning. Responsibilities include...$165.2k - $223.6k
...delivers best-in-class ML inference... ...which includes an ML compiler, runtime and natively... ...are used at scale with customers like... ...including silicon engineering, hardware design and... ...Neuron team works to optimize the performance of... ...'s largest ML workloads. Architecting and...InternshipLocal areaFlexible hours- ...Inference Optimization MLE At Rhoda AI, we're... ...and manufacturing scale-up to make generalist... ...out of large multimodal models,... ...fusion, and model compilation (e.g., TensorRT, torch... ...closely with research engineers to translate model... ...inference optimization, ML systems, or a...
$181.1k - $318.4k
...Machine Learning Compiler Engineer At Apple, we're on the cutting edge... ...of AI and hardware optimization, we want you to join our team... ...general performance of AI workloads on Apple hardware, contributing... ...and experience working with large, complex software systems...Relocation$213k - $263k
...Machine Learning Engineer, Runtime & Optimization Waymo is an autonomous... ...U.S. states. The ML Platform team at Waymo... ...and models to make ML workloads faster. Lead... ...runtime libraries, ML compiler). ~ Deep understanding... ...accelerators (e.g., GPU/TPU). Deep knowledge...Full timeRemote work$152k - $241.5k
...Learning Applications and Compiler Engineer! NVIDIA is seeking... ...algorithms and optimizations for our LPX inference... ...the intersection of large-scale systems, compilers, and... ...crafting how neural network workloads map onto future... ...at top tier ML, compiler, and computer...$126.8k - $190.9k
...boundaries of AI and hardware optimization, we want you to join our team! As a Machine Learning Compiler Engineer on the Apple Neural Engine (... ...general performance of AI workloads on Apple hardware, contributing... ...and experience working with large, complex software systems...Relocation package$193.3k - $261.5k
...learning and GenAI workloads on Amazon's... ...for AWS's custom ML accelerators.... ...boundary, our engineers craft high-performance... ...in delivering optimal performance for... ...includes an ML compiler, runtime, and... ...that are very large, yet our teams... ...and scaling) of new and existing...InternshipLocal areaWork from homeFlexible hours- ...remove the limits of scale, hardware, and... ...one seamless engine. Developers can... ...looking for a Senior ML Performance... ..., validate, and optimize the performance of large language models... ...before and after compiler optimization on... ...evaluating LLM inference workloads across GPU...
$165.2k - $223.6k
...The AWS Neuron Compiler team is actively seeking skilled compiler engineers to join our efforts in developing... ...stack is designed to optimize application models across... ...domains, including Large Language and Vision, originating... ...for advanced ML capabilities, powering...InternshipLocal areaFlexible hours$128.7k - $261.3k
...AI Kernels & Compilers team, that mission... ...vehicles at scale. We pioneer... ...and performance engineering so that every... ...our on-vehicle ML inference for... ...making core AI workloads faster, more reliable... ...in large codebases.... ...architecture-aware optimizations. Strong communicationskills...Local areaWork from homeRelocation packageFlexible hours- ...Our novel wafer-scale architecture provides... ...effortlessly run large-scale ML applications,... ...transforming key workloads with ultra high-speed... ...The Role Engineers on the inference... ...performance debugging and optimization, system-level... ...micro code and compiler algorithms to...
- ...GPUs. Our novel wafer-scale architecture... ...to effortlessly run large-scale ML applications, without... ...scale, transforming key workloads with ultra high-speed... ...versatile and experienced engineer to join our SOTA... ..., graph lowering, compiler optimizations, runtime integration...Internship
$153.2k - $234.1k
...transportation on a global scale. Role Overview:... ...every machine learning engineer working on our cutting-... ...models to state-of-the-art optimization, our work is at the... ...vehicles. As a Senior ML Infra Engineer, you... ...of experience building large-scale distributed systems...Work at officeLocal areaRemote workWork from homeRelocationRelocation packageFlexible hours- ...Our novel wafer-scale architecture provides... ...effortlessly run large-scale ML applications,... ...transforming key workloads with ultra high-speed... ...The Inference ML Engineering team at Cerebras... ..., including compiler developers, cluster... ...various features. Optimize software to...
$174k - $252k
Google Inc. is seeking a Senior Machine Learning Engineer in Sunnyvale, CA, to improve AI model performance and efficiency. Candidates... ...experience in software development, testing, and performance optimization. Responsibilities include engaging with product teams to resolve...- ...developing end-to-end ML models for... ...data, scaling distributed training, optimizing inference for real... ...distillation and model compilation to optimize... ...management at large scale Optimize... ...Strong software engineering and systems fundamentals... ...training workloads (SLURM,...
- A leading technology firm is seeking a Principal Machine Learning Engineer in San Jose, CA. The role focuses on optimizing distributed training for large models, making significant contributions to AMD's AI platform. The ideal candidate should have expertise in distributed...
$181.1k - $318.4k
...On-device ML Infrastructure Engineer, Compiler & Runtime, Graphics, Games & ML Imagine being at the forefront... ...that enables machine learning at scale on Apple devices. This involves... ...architectures to embedded systems, developing optimization toolkits for model compression and...Relocation$213k - $263k
...Senior Machine Learning Engineer, Runtime and Serving... ...U.S. states. The ML Optimization team at Waymo... ...vehicle compute and large-scale, offboard data center... ...strategic migration of ML workloads toward a JAX-native runtime... ...underlying ML compilers and runtimes (e.g., OpenXLA...Full timeRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to TPU ML Compiler Engineer — Optimize Large-Scale Workloads. Be the first to apply!
- machine learning ai engineer Sunnyvale, CA
- machine learning engineer Sunnyvale, CA
- machine learning software engineer Sunnyvale, CA
- ai ml engineer Sunnyvale, CA
- senior ml engineer Sunnyvale, CA
- computer vision machine learning engineer Sunnyvale, CA
- machine learning research scientist Sunnyvale, CA
- machine learning part time Sunnyvale, CA
- artificial intelligence - machine learning intern Sunnyvale, CA
- machine learning Sunnyvale, CA

