Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Software Engineer, GPU Performance

$207k - $300k

Google

Benefits Health, dental, vision, life, disability insurance Retirement Benefits: 401(k) with company match Paid Time Off: 20 days of vacation per year, accruing at a rate of 6.15 hours per pay period for the first five years of employment Sick Time: 40 hours/year (increased to 69 hours/year for Seattle) including 5 discretionary sick days per instance Maternity Leave (Short-Term Disability + Baby Bonding): 28-30 weeks Baby Bonding Leave: 18 weeks Holidays: 13 paid days per year Location Preferences By applying to this position you will have an opportunity to share your preferred working location from the following: Sunnyvale, CA, USA; Kirkland, WA, USA; New York, NY, USA . Minimum Qualifications Bachelor’s degree or equivalent practical experience. 8 years of experience in software development. 5 years of experience testing, and launching software products, and 3 years of experience with software design and architecture. Experience with modern GPU architectures (NVIDIA, AMD, or other AI accelerators), memory hierarchies, and performance bottlenecks. Experience with modern LLMs and their deployment on AI accelerators. Experience with low-level GPU programming (CUDA, Triton, CUTLASS, etc.) and performance engineering techniques. Preferred Qualifications Master’s degree or PhD in Engineering, Computer Science, or a related technical field. 8 years of experience with data structures and algorithms. 3 years of experience in a technical leadership role leading project teams and setting technical direction. 3 years of experience working in a structured organization involving cross‑functional, or cross‑business projects. Experience with compiler optimization, code generation, and runtime systems for GPU architectures (OpenXLA, MLIR, Triton, etc.). About The Job Google Cloud’s mission is to make every business successful through AI by combining cutting‑edge technology, infrastructure, and talent. AI/ML software engineers in Cloud bridge the gap between pioneering models and a massive product vehicle reaching billions. Our talent density and AI‑powered tools drive rapid development, rooted in a culture of empowerment and a bias to action. In this role, you aren’t just building technology; you’re shaping the frontier of enterprise and driving the evolution of advanced models. While known for pioneering work with TPUs, GPUs are an equally vital and rapidly expanding frontier within Google's machine learning infrastructure. GPUs are indispensable to Google’s diverse and ever‑evolving landscape for strategic, pragmatic, and performance‑driven reasons ensuring top performance for our machine learning (ML) models, adapting to ML workloads, achieving results, and influencing next‑gen GPU architectures via partnerships. In recognition of hardware as a strength, Google’s Core ML organization is heavily invested in growing the powerhouse team of GPU experts, and we invite you to be at its vanguard. In this role, you will have the opportunity to move beyond incremental improvements and architect transformative solutions, shaping the future of AI and accelerated computing for Google and the world. The AI and Infrastructure team is redefining what’s possible. We empower Google customers with breakthrough capabilities and insights by delivering AI and Infrastructure at unparalleled scale, efficiency, reliability and velocity. Our customers include Googlers, Google Cloud customers, and billions of Google users worldwide. We're the driving force behind Google's groundbreaking innovations, empowering the development of our cutting‑edge AI models, delivering unparalleled computing power to global services, and providing the essential platforms that enable developers to build the future. From software to hardware our teams are shaping the future of world‑leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for Google Cloud, Google Global Networking, Data Center operations, systems research, and much more. Salary The US base salary range for this full‑time position is $207,000‑$300,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job‑related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process. Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google. Responsibilities Identify and maintain LLM training and serving benchmarks, using them to identify performance opportunities, drive XLA:GPU/Triton performance toward XLA releases. Engage with various teams, like DeepMind, to solve challenging ML model performance problems. Run architecture‑level simulations on GPU designs and perform roofline analysis to guide partner teams. Analyze performance and efficiency metrics to identify bottlenecks and then design and implement solutions at Google fleet‑wide scale. Run performance benchmarks on GPU hardware using internal and external tools such as TRT‑LLM, vLLM, and SGLang. EEO and Equal Opportunity Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also Google's EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know by completing our Accommodations for Applicants form. #J-18808-Ljbffr Google

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Staff Software Engineer, GPU Performance in New York, NY vacancy
  • $251k - $310k

     ...Staff Software Engineer, Capacity Optimization Waymo is an autonomous driving technology company...  ...technical infrastructure resources (CPU, GPU, TPU, Storage). We are establishing...  ...simulation environment is both high-performance and cost-effective. You will:... 
    Performance
    Full time
    Remote work
    Shift work

    Waymo

    New York, NY
    4 days ago
  • $140k - $170k

    {{""" Job Title : Staff Software Engineer - Hypervisor Location : Remote Pay Range : $140,000 - $170...  ...be run on separate cores for improved performance, reliability, and security. CoreSuite...  ...graphics libraries and tools that enable GPU hardware acceleration for both... 
    Performance
    Remote work

    LYNX

    New York, NY
    1 day ago
  • $188k - $275k

     ...Staff Software Engineer, Compute Architecture Manhattan, NY / Sunnyvale, CA / Bellevue, WA / Livingston...  ...combines superior infrastructure performance with deep technical expertise to...  ...operate the backbone of our large-scale GPU data centers. The METALDEV team builds... 
    Performance
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    New York, NY
    7 hours ago
  • $188k - $275k

     ...CoreWeave combines superior infrastructure performance with deep technical expertise to...  ...more at What you'll do As a Staff Software Engineer on the Identity & Access Management (...  ...Knowledge: Familiarity with AI/ML workloads, GPU-based infrastructure, and the unique... 
    Performance
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Immediate start
    Remote work
    Flexible hours

    CoreWeave

    New York, NY
    3 days ago
  •  ...Cohere is a team of researchers, engineers, designers, and more, who are...  ...energized by building high-performance, scalable and reliable...  ...looking for Members of Technical Staff to join the Model Serving team...  ...systems with Kubernetes, and GPU workloads on those clusters... 
    Performance
    Full time
    Work experience placement
    Work at office
    Remote work
    Flexible hours

    Cohere

    New York, NY
    3 days ago
  •  ...infrastructure that allows the engineering team to execute quickly,...  ...analytics, and alerting for performance and security across all endpoints...  ...external‑dns. Experience with GPU‑enabled clusters is a bonus....  ...Production experience with database software such as PostgreSQL.... 
    Performance
    Local area
    Remote work

    Cresta

    New York, NY
    1 day ago
  • $190k - $270k

     ...Staff Software Engineer - AI Research Infrastructure P-1215 At Databricks, we are obsessed...  ...and model training (e.g., HPC clusters, GPU fleets, or cloud-based systems) Enable...  ...also include eligibility for annual performance bonus, equity, and the benefits listed... 
    Performance
    Local area
    Worldwide

    Databricks

    New York, NY
    4 days ago
  • $320k - $405k

     ...Staff Infrastructure Engineer, Node Infra Anthropic's Infrastructure organization...  ...that keep every GPU, TPU and Trainium node in...  ...Qualifications ~8+ years of software engineering experience, including...  ...~ Familiarity with high-performance networking (EFA, RDMA,... 
    Performance
    Work at office
    Visa sponsorship
    Flexible hours

    anthropic

    New York, NY
    7 hours ago
  • $190k - $261.25k

     ...insights to improve their business. Founded by engineers - and customer obsessed - we leap at...  ...-time serving, ML infrastructure, or GPU orchestration Exposure to platforms like...  ...may also include eligibility for annual performance bonus, equity, and the benefits listed... 
    Performance
    Local area
    Worldwide

    Databricks

    New York, NY
    1 day ago
  • $204k - $259k

     ...Software Engineer, GPU Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since...  ...stack. To achieve our mission, we architect and create high-performance custom silicon; we develop system-level compute architectures... 
    Performance
    Full time
    Remote work

    Waymo

    New York, NY
    7 hours ago
  •  ...Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE We're seeking a GPU Kernel Engineer to join our team at the...  ...acceleration, where your code directly impacts the performance of state-of-the-art machine learning models.... 
    Performance
    Flexible hours

    Baseten

    New York, NY
    1 day ago
  • $200k - $300k

     ...Hudson River Trading (HRT) is seeking a Software Engineer focused on GPU reliability to join our Systems...  ...develop tools in Python to analyze the performance of GPU hardware and build creative...  ...valued. HRT is proud of our diverse staff; we have offices all over the globe... 
    Performance
    Work at office
    Local area
    Immediate start

    Hudson River Trading

    New York, NY
    1 day ago
  •  ...Software Engineer III Be an integral part of an agile team that's constantly pushing the envelope...  ...and batching. Deploy and manage GPU workloads in Kubernetes environments....  ...Knowledge of GPU programming (CUDA) and performance optimization. Experience with model... 
    Performance

    Chase

    Jersey City, NJ
    7 hours ago
  •  ...A leading tech company in the United States is seeking an experienced Infrastructure GPU Engineer to build and support high-performance cloud infrastructure. This role involves optimizing resource allocation for GPU workloads, ensuring system reliability, and collaborating... 
    Performance
    Remote work

    DevOpsChat

    New York, NY
    1 day ago
  •  ...A pioneering AI infrastructure company is seeking a GPU Cloud Platform Engineer to design and operate large-scale GPU clusters. This remote position aims to ensure high availability and performance of containerized AI workloads across cloud environments. The ideal candidate... 
    Performance
    Remote work

    Yotta Labs

    New York, NY
    1 day ago
  • Darwin Recruitment is seeking a Senior GPU Systems / AI Infrastructure Engineer in New York City. This senior-level role focuses on building and optimising...  ...cutting-edge of AI infrastructure, directly impacting performance and scalability of frontier AI models in a hybrid work... 
    Performance

    Darwin Recruitment

    New York, NY
    2 days ago
  • $152k - $241.5k

     ...York is seeking a Senior AI and FSI Developer Technology Engineer to enhance performance in the Financial Services Industry. The role involves developing...  ...programming, C/C++, and have a deep understanding of CPU/GPU architecture. The base salary ranges from $152,000 to $241... 
    Performance

    NVIDIA Corporation

    New York, NY
    5 days ago
  •  ...training loops and distributed GPU training to massive-scale...  ...pipelines The goal is to build the engineering foundation that allows...  ...About You You are a strong software engineer who speaks the language...  ...algorithms Distributed systems High-performance computing You care deeply... 
    Performance
    Relocation package

    Reflection

    New York, NY
    3 days ago
  •  ...and help build the platform engineers turn to to ship AI products....  ...foundational engineers to lead our GPU Networking efforts, making...  ...to architect the software fabric that unifies thousands...  ...characterize and validate networking performance on bleeding-edge clusters (H1... 
    Performance
    Flexible hours

    Baseten

    New York, NY
    1 day ago
  •  ...geo-distributed GPUs, enabling high-performance computing for AI training and inference...  .... ️ Role Overview We are seeking a GPU Cloud Platform Engineer to join our core infrastructure team...  ...or higher in Computer Science, Software Engineering, Electronic Engineering,... 
    Performance
    Full time
    Remote work
    Flexible hours

    Yotta Labs

    New York, NY
    1 day ago
  •  ...Senior Staff Software Engineer We are seeking a highly experienced Senior Staff Software Engineer to lead and deliver complex technical...  ...Experience with distributed systems, microservices, and high-performance applications. Preferred / Bonus Skills... 
    Performance

    Strategic Employment

    New York, NY
    2 days ago
  •  ...minimisation as you follow best practice engineering Identify, drive and support broader...  ...specifications, and deployment procedures Performance monitoring and observability tools for...  ...function: Information Technology Industries: Software Development Referrals increase your... 
    Performance
    Contract work
    For contractors

    hackajob

    New York, NY
    1 day ago
  • $190k - $220k

     ...Senior / Staff Software Engineer (Product) Title of Role: Senior / Staff Software Engineer (Product) Location: New York, onsite...  ...improvement. Troubleshoot and debug applications to optimize performance and ensure high availability. Stay current with... 
    Performance
    Work at office

    Recruiting from Scratch

    New York, NY
    7 hours ago
  •  ...We are looking for a Staff or Senior Fullstack Engineer to work on developing and scaling the the company...  ...system stability, and increase overall performance. Partner with engineering and...  ...tools, and processes. Champion software quality, implement automation, drive... 
    Performance

    Spartan Technologies

    New York, NY
    7 hours ago
  • $150k - $200k

     ...Datavant today, you’re stepping onto a high-performing, values-driven team. Together, we’re...  ...vision for healthcare. At Datavant we value Engineers who problem solve, build, and understand...  ...and underlying concepts of software engineering. As a Senior Software Engineer... 
    Performance
    Remote work

    Datavant

    New York, NY
    1 day ago
  •  ...Senior Staff Engineer Rippling is rapidly expanding its global footprint and product ecosystem...  ..., compliance), ensuring robustness, performance, and extensibility. Lead cross-org...  ...Will Need ~10+ years of professional software engineering experience, including... 
    Performance
    Work at office
    3 days per week

    ZoneIn

    New York, NY
    7 days ago
  •  ...training and evaluation. Train, deploy and monitor 2D/3D object detection models to production. Experiment SOTA models’ performance. Mentor junior engineers about best practices. You have Master’s or PhD in Computer Science, Robotics, Deep Learning, or a related field. 5+... 
    Performance

    AeroVect

    New York, NY
    3 days ago
  •  ...Software Engineer Opportunity At Extend Extend is building a modern document processing cloud. We're on a mission to transform how the...  ...former founders, world record holders) operating in a high performance culture, in-person in NYC, with high equity ownership We'... 
    Performance
    Work at office
    Relocation

    Extend

    New York, NY
    1 day ago
  •  ...future of wealth management with you. The Role As a Staff Software Engineer at Farther, you'll be a technical leader who shapes how we...  ...in system design, scalability, reliability, and performance Operate with significant autonomy - own major technical... 
    Performance

    Farther LLC

    New York, NY
    7 hours ago
  •  ...Staff Software Engineer About Supernal Supernal helps small-to-medium businesses hire their first AI employee. Our AI teammates are built using...  ...and evolve the core backend platform (Django/DRF/ASGI) performance and correctness Scale async execution across Celery + Dramatiq... 
    Performance
    Full time
    Remote work

    Infinity

    New York, NY
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Software Engineer, GPU Performance. Be the first to apply!