Staff Software Engineer, GPU Performance
$207k - $300kBenefits Health, dental, vision, life, disability insurance Retirement Benefits: 401(k) with company match Paid Time Off: 20 days of vacation per year, accruing at a rate of 6.15 hours per pay period for the first five years of employment Sick Time: 40 hours/year (increased to 69 hours/year for Seattle) including 5 discretionary sick days per instance Maternity Leave (Short-Term Disability + Baby Bonding): 28-30 weeks Baby Bonding Leave: 18 weeks Holidays: 13 paid days per year Location Preferences By applying to this position you will have an opportunity to share your preferred working location from the following: Sunnyvale, CA, USA; Kirkland, WA, USA; New York, NY, USA . Minimum Qualifications Bachelor’s degree or equivalent practical experience. 8 years of experience in software development. 5 years of experience testing, and launching software products, and 3 years of experience with software design and architecture. Experience with modern GPU architectures (NVIDIA, AMD, or other AI accelerators), memory hierarchies, and performance bottlenecks. Experience with modern LLMs and their deployment on AI accelerators. Experience with low-level GPU programming (CUDA, Triton, CUTLASS, etc.) and performance engineering techniques. Preferred Qualifications Master’s degree or PhD in Engineering, Computer Science, or a related technical field. 8 years of experience with data structures and algorithms. 3 years of experience in a technical leadership role leading project teams and setting technical direction. 3 years of experience working in a structured organization involving cross‑functional, or cross‑business projects. Experience with compiler optimization, code generation, and runtime systems for GPU architectures (OpenXLA, MLIR, Triton, etc.). About The Job Google Cloud’s mission is to make every business successful through AI by combining cutting‑edge technology, infrastructure, and talent. AI/ML software engineers in Cloud bridge the gap between pioneering models and a massive product vehicle reaching billions. Our talent density and AI‑powered tools drive rapid development, rooted in a culture of empowerment and a bias to action. In this role, you aren’t just building technology; you’re shaping the frontier of enterprise and driving the evolution of advanced models. While known for pioneering work with TPUs, GPUs are an equally vital and rapidly expanding frontier within Google's machine learning infrastructure. GPUs are indispensable to Google’s diverse and ever‑evolving landscape for strategic, pragmatic, and performance‑driven reasons ensuring top performance for our machine learning (ML) models, adapting to ML workloads, achieving results, and influencing next‑gen GPU architectures via partnerships. In recognition of hardware as a strength, Google’s Core ML organization is heavily invested in growing the powerhouse team of GPU experts, and we invite you to be at its vanguard. In this role, you will have the opportunity to move beyond incremental improvements and architect transformative solutions, shaping the future of AI and accelerated computing for Google and the world. The AI and Infrastructure team is redefining what’s possible. We empower Google customers with breakthrough capabilities and insights by delivering AI and Infrastructure at unparalleled scale, efficiency, reliability and velocity. Our customers include Googlers, Google Cloud customers, and billions of Google users worldwide. We're the driving force behind Google's groundbreaking innovations, empowering the development of our cutting‑edge AI models, delivering unparalleled computing power to global services, and providing the essential platforms that enable developers to build the future. From software to hardware our teams are shaping the future of world‑leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for Google Cloud, Google Global Networking, Data Center operations, systems research, and much more. Salary The US base salary range for this full‑time position is $207,000‑$300,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job‑related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process. Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google. Responsibilities Identify and maintain LLM training and serving benchmarks, using them to identify performance opportunities, drive XLA:GPU/Triton performance toward XLA releases. Engage with various teams, like DeepMind, to solve challenging ML model performance problems. Run architecture‑level simulations on GPU designs and perform roofline analysis to guide partner teams. Analyze performance and efficiency metrics to identify bottlenecks and then design and implement solutions at Google fleet‑wide scale. Run performance benchmarks on GPU hardware using internal and external tools such as TRT‑LLM, vLLM, and SGLang. EEO and Equal Opportunity Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also Google's EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know by completing our Accommodations for Applicants form. #J-18808-Ljbffr Google
$251k - $310k
...Staff Software Engineer, Capacity Optimization Waymo is an autonomous driving technology company... ...technical infrastructure resources (CPU, GPU, TPU, Storage). We are establishing... ...simulation environment is both high-performance and cost-effective. You will:...PerformanceFull timeRemote workShift work$140k - $170k
{{""" Job Title : Staff Software Engineer - Hypervisor Location : Remote Pay Range : $140,000 - $170... ...be run on separate cores for improved performance, reliability, and security. CoreSuite... ...graphics libraries and tools that enable GPU hardware acceleration for both...PerformanceRemote work$188k - $275k
...Staff Software Engineer, Compute Architecture Manhattan, NY / Sunnyvale, CA / Bellevue, WA / Livingston... ...combines superior infrastructure performance with deep technical expertise to... ...operate the backbone of our large-scale GPU data centers. The METALDEV team builds...PerformancePermanent employmentTemporary workCasual workWork at officeRemote workFlexible hours$188k - $275k
...CoreWeave combines superior infrastructure performance with deep technical expertise to... ...more at What you'll do As a Staff Software Engineer on the Identity & Access Management (... ...Knowledge: Familiarity with AI/ML workloads, GPU-based infrastructure, and the unique...PerformancePermanent employmentTemporary workCasual workWork at officeImmediate startRemote workFlexible hours- ...Cohere is a team of researchers, engineers, designers, and more, who are... ...energized by building high-performance, scalable and reliable... ...looking for Members of Technical Staff to join the Model Serving team... ...systems with Kubernetes, and GPU workloads on those clusters...PerformanceFull timeWork experience placementWork at officeRemote workFlexible hours
- ...infrastructure that allows the engineering team to execute quickly,... ...analytics, and alerting for performance and security across all endpoints... ...external‑dns. Experience with GPU‑enabled clusters is a bonus.... ...Production experience with database software such as PostgreSQL....PerformanceLocal areaRemote work
$190k - $270k
...Staff Software Engineer - AI Research Infrastructure P-1215 At Databricks, we are obsessed... ...and model training (e.g., HPC clusters, GPU fleets, or cloud-based systems) Enable... ...also include eligibility for annual performance bonus, equity, and the benefits listed...PerformanceLocal areaWorldwide$320k - $405k
...Staff Infrastructure Engineer, Node Infra Anthropic's Infrastructure organization... ...that keep every GPU, TPU and Trainium node in... ...Qualifications ~8+ years of software engineering experience, including... ...~ Familiarity with high-performance networking (EFA, RDMA,...PerformanceWork at officeVisa sponsorshipFlexible hours$190k - $261.25k
...insights to improve their business. Founded by engineers - and customer obsessed - we leap at... ...-time serving, ML infrastructure, or GPU orchestration Exposure to platforms like... ...may also include eligibility for annual performance bonus, equity, and the benefits listed...PerformanceLocal areaWorldwide$204k - $259k
...Software Engineer, GPU Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since... ...stack. To achieve our mission, we architect and create high-performance custom silicon; we develop system-level compute architectures...PerformanceFull timeRemote work- ...Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE We're seeking a GPU Kernel Engineer to join our team at the... ...acceleration, where your code directly impacts the performance of state-of-the-art machine learning models....PerformanceFlexible hours
$200k - $300k
...Hudson River Trading (HRT) is seeking a Software Engineer focused on GPU reliability to join our Systems... ...develop tools in Python to analyze the performance of GPU hardware and build creative... ...valued. HRT is proud of our diverse staff; we have offices all over the globe...PerformanceWork at officeLocal areaImmediate start- ...Software Engineer III Be an integral part of an agile team that's constantly pushing the envelope... ...and batching. Deploy and manage GPU workloads in Kubernetes environments.... ...Knowledge of GPU programming (CUDA) and performance optimization. Experience with model...Performance
- ...A leading tech company in the United States is seeking an experienced Infrastructure GPU Engineer to build and support high-performance cloud infrastructure. This role involves optimizing resource allocation for GPU workloads, ensuring system reliability, and collaborating...PerformanceRemote work
- ...A pioneering AI infrastructure company is seeking a GPU Cloud Platform Engineer to design and operate large-scale GPU clusters. This remote position aims to ensure high availability and performance of containerized AI workloads across cloud environments. The ideal candidate...PerformanceRemote work
- Darwin Recruitment is seeking a Senior GPU Systems / AI Infrastructure Engineer in New York City. This senior-level role focuses on building and optimising... ...cutting-edge of AI infrastructure, directly impacting performance and scalability of frontier AI models in a hybrid work...Performance
$152k - $241.5k
...York is seeking a Senior AI and FSI Developer Technology Engineer to enhance performance in the Financial Services Industry. The role involves developing... ...programming, C/C++, and have a deep understanding of CPU/GPU architecture. The base salary ranges from $152,000 to $241...Performance- ...training loops and distributed GPU training to massive-scale... ...pipelines The goal is to build the engineering foundation that allows... ...About You You are a strong software engineer who speaks the language... ...algorithms Distributed systems High-performance computing You care deeply...PerformanceRelocation package
- ...and help build the platform engineers turn to to ship AI products.... ...foundational engineers to lead our GPU Networking efforts, making... ...to architect the software fabric that unifies thousands... ...characterize and validate networking performance on bleeding-edge clusters (H1...PerformanceFlexible hours
- ...geo-distributed GPUs, enabling high-performance computing for AI training and inference... .... ️ Role Overview We are seeking a GPU Cloud Platform Engineer to join our core infrastructure team... ...or higher in Computer Science, Software Engineering, Electronic Engineering,...PerformanceFull timeRemote workFlexible hours
- ...Senior Staff Software Engineer We are seeking a highly experienced Senior Staff Software Engineer to lead and deliver complex technical... ...Experience with distributed systems, microservices, and high-performance applications. Preferred / Bonus Skills...Performance
- ...minimisation as you follow best practice engineering Identify, drive and support broader... ...specifications, and deployment procedures Performance monitoring and observability tools for... ...function: Information Technology Industries: Software Development Referrals increase your...PerformanceContract workFor contractors
$190k - $220k
...Senior / Staff Software Engineer (Product) Title of Role: Senior / Staff Software Engineer (Product) Location: New York, onsite... ...improvement. Troubleshoot and debug applications to optimize performance and ensure high availability. Stay current with...PerformanceWork at office- ...We are looking for a Staff or Senior Fullstack Engineer to work on developing and scaling the the company... ...system stability, and increase overall performance. Partner with engineering and... ...tools, and processes. Champion software quality, implement automation, drive...Performance
$150k - $200k
...Datavant today, you’re stepping onto a high-performing, values-driven team. Together, we’re... ...vision for healthcare. At Datavant we value Engineers who problem solve, build, and understand... ...and underlying concepts of software engineering. As a Senior Software Engineer...PerformanceRemote work- ...Senior Staff Engineer Rippling is rapidly expanding its global footprint and product ecosystem... ..., compliance), ensuring robustness, performance, and extensibility. Lead cross-org... ...Will Need ~10+ years of professional software engineering experience, including...PerformanceWork at office3 days per week
- ...training and evaluation. Train, deploy and monitor 2D/3D object detection models to production. Experiment SOTA models’ performance. Mentor junior engineers about best practices. You have Master’s or PhD in Computer Science, Robotics, Deep Learning, or a related field. 5+...Performance
- ...Software Engineer Opportunity At Extend Extend is building a modern document processing cloud. We're on a mission to transform how the... ...former founders, world record holders) operating in a high performance culture, in-person in NYC, with high equity ownership We'...PerformanceWork at officeRelocation
- ...future of wealth management with you. The Role As a Staff Software Engineer at Farther, you'll be a technical leader who shapes how we... ...in system design, scalability, reliability, and performance Operate with significant autonomy - own major technical...Performance
- ...Staff Software Engineer About Supernal Supernal helps small-to-medium businesses hire their first AI employee. Our AI teammates are built using... ...and evolve the core backend platform (Django/DRF/ASGI) performance and correctness Scale async execution across Celery + Dramatiq...PerformanceFull timeRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff Software Engineer, GPU Performance. Be the first to apply!
- software product owner New York, NY
- golang software developer New York, NY
- id software New York, NY
- software quality assurance New York, NY
- software quality assurance specialist New York, NY
- software technical writer New York, NY
- mid-level software developer New York, NY
- software integrator New York, NY
- software sales New York, NY
- internship software New York, NY

