Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff GPU Performance Engineer — AI/LLM Scaling

$207k - $300k

Google Inc.

Google Inc. is seeking a Staff Software Engineer specializing in GPU performance to influence next-gen GPU architectures at their New York office. The ideal candidate will possess a bachelor's degree, 8 years of experience in software development, and a deep understanding of modern GPU architectures like NVIDIA and AMD. Responsibilities include engaging with teams across Google, analyzing performance metrics, and shaping solutions for AI applications. This full-time position offers a competitive salary range of $207,000-$300,000 plus bonuses. #J-18808-Ljbffr Google Inc.

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Staff GPU Performance Engineer — AI/LLM Scaling in New York, NY vacancy
  •  ...world's most dynamic AI companies, like...  ...build the platform engineers turn to to ship AI products...  ...We believe that as LLM and multi-modal workloads scale, the network is the...  ...engineers to lead our GPU Networking efforts,...  ...validate networking performance on bleeding-edge... 
    Performance
    Flexible hours

    Baseten

    New York, NY
    4 days ago
  • Pragmatike is hiring a CUDA Kernel Engineer to develop and optimize NVIDIA CUDA kernels for a leading AI startup. This remote role focuses on maximizing GPU performance and throughput for high-scale AI systems. Candidates should have substantial experience with CUDA and... 
    Performance
    Remote job
    Relocation package

    Pragmatike

    New York, NY
    5 days ago
  •  ...skilled professional in New York to design and operate large-scale GPU infrastructure for model inference and reinforcement learning....  ...years of experience in deploying GPU systems, optimizing model performance, and working with frameworks like SGLang and Megatron. The position... 
    Performance

    Reflection

    New York, NY
    1 day ago
  • A leading AI infrastructure company based in New York seeks skilled engineers to optimize ML systems at scale. The ideal candidate has over 5 years of high-performance coding experience, familiarity with Nvidia GPU architecture, and expertise in ML frameworks like Torch... 
    Performance

    AI Chopping Block, Inc.

    New York, NY
    2 days ago
  • $180k - $220k

     ...NewtonX NewtonX delivers AI-powered B2B insights to...  ..., you'll own the core LLM infrastructure powering...  ...intelligence at scale. Architect automated systems...  ...solutions while maintaining engineering best practices Nice to...  ...base and annual performance bonus) and equity. (Please... 
    Performance
    Immediate start
    Remote work
    Flexible hours

    NewtonX

    New York, NY
    1 day ago
  •  ...nonprofit applied AI research organization...  ...expanding the scale, complexity, and breadth...  ...seeking Software Engineers to develop scalable...  ...ensure robust, correct LLM outputs Collaborate...  ..., scaling, and performance optimizations in real...  ...from non‑technical staff Are flexible and... 
    Performance
    Full time
    Contract work
    Part time
    For contractors
    Flexible hours

    Basis Research Institute

    New York, NY
    4 days ago
  • Google is seeking AI/ML software engineers in New York City to design and enhance GPU architectures for cutting-edge AI applications. This role involves identifying performance metrics for LLMs, engaging with teams to address ML challenges, and delivering scalable solutions... 
    Performance

    Google

    New York, NY
    1 day ago
  •  ...HRB is seeking a QA Engineer in the United States with expertise in API testing and LLM fine-tuning. The successful candidate will ensure the quality of our 'Agent...  ...API interactions, preparing datasets for model performance, and maintaining security protocols.... 
    Performance

    HRB

    New York, NY
    4 days ago
  •  ...A pioneering AI infrastructure company is seeking a GPU Cloud Platform Engineer to design and operate large-scale GPU clusters. This remote position aims to ensure high availability and performance of containerized AI workloads across cloud environments. The ideal candidate... 
    Performance
    Remote work

    Yotta Labs

    New York, NY
    4 days ago
  • $125.5k - $230.2k

    Forward Deployed Engineer - Applied AI - Manager - Financial Services...  ...Build and integrate LLM, RAG, and agentic...  ...capabilities. Improve performance, resilience, maintainability...  ...at enterprise scale (e.g. LlamaIndex, LangChain...  ...execution. Familiarity with GPU‑accelerated AI... 
    Performance
    Contract work
    Summer holiday
    Flexible hours

    Ernst & Young Advisory Services Sdn Bhd

    New York, NY
    3 days ago
  • $144k - $329.1k

    Forward Deployed Engineer - Applied AI - Senior Manager - Financial...  ...at enterprise scale. Ensure API layers and...  ...development. Agentic and LLM Ops Expertise in designing...  ....). Experience with GPU‑accelerated AI...  ...rewarded based on your performance and recognized for the... 
    Performance
    Contract work
    Summer holiday
    Immediate start
    Flexible hours

    Ernst & Young Advisory Services Sdn Bhd

    New York, NY
    1 day ago
  •  ...As the first ML Ops Engineer at Tennr, you'll...  ...Machine Learning and AI systems. You'll own...  ...managing models at scale. Develop and maintain...  ...systems to enhance performance and efficiency....  ...evaluation of ML & LLM systems....  ...Experience with GPU orchestration, including... 
    Performance
    Work at office

    Tennr

    New York, NY
    3 days ago
  •  ...understanding in healthcare. Our AI-powered platform was purpose-...  ..., technologists, and engineers working together to empower people...  ...team and help improve the performance, stability, and scalability of...  ...security, will be under tremendous scale, and presents many... 
    Performance
    Hourly pay
    Full time
    Flexible hours

    Abridge Al, Inc

    New York, NY
    3 days ago
  • $250k - $350k

     ...AI is becoming vitally important in every...  ...of our society. At Scale, our mission is to...  ...algorithms to reach the performance necessary for...  ...an ML Sys Research Engineer, you'll work on building...  ...least 1-3 years of LLM training in a...  ...architecture of the modern GPU cluster... 
    Performance
    Full time

    Scale AI

    New York, NY
    4 days ago
  • Nerdleveltech is looking for a Senior AI Engineer (Node.js / Next.js / TypeScript) to enhance...  ...and develop production-ready LLM experiences. You'll take full responsibility...  ...emphasizes data-driven decisions around model performance and optimizes deployment based on quantitative... 
    Performance
    Remote job
    Flexible hours

    Nerdleveltech

    New York, NY
    4 days ago
  • $152k - $241.5k

    Senior AI and FSI Developer Technology Engineer page is loaded## Senior AI and FSI...  ...push the limits of performance at the intersection...  ...analyze, optimize, and scale complex AI and HPC...  ...for modern CPU and GPU architectures.* Profiling...  ...TensorRT, TensorRT-LLM, and cuTile.*... 
    Performance

    NVIDIA Corporation

    New York, NY
    3 days ago
  • $105k - $140k

     ...leading technology company in Secaucus, NJ seeks a Reliability Engineer responsible for ensuring cloud hardware reliability through...  ...principles. The ideal candidate will develop guidelines, analyze performance data, and collaborate across teams to meet customer... 
    Performance

    ZT Systems group

    Secaucus, NJ
    5 days ago
  • $144k - $286k

     ...the leader in digital performance solutions, helping our...  ...DoubleVerify is hiring a Staff Enterprise Architect...  ...at enterprise scale. This is a hands-on...  ...with enough rigor that engineering teams can build against...  ...reusable patterns. AI / LLM enablement Maintain... 
    Performance
    Live in

    DoubleVerify

    New York, NY
    3 days ago
  • $105k - $140k

     ...to our customers. We are looking for a passionate Reliability Engineer with exceptional knowledge and experience developing and...  ...of reliability engineering guidelines to improve product field performance through design enhancements to meet reliability goals. Uses principles... 
    Performance
    Permanent employment
    Work experience placement
    Work at office
    Local area

    Sanmina-SCI Systems de México

    Secaucus, NJ
    3 days ago
  • $116.25k - $193.75k

    Principal Systems Electrical Engineer - AI/Hyperscale Servers page is loaded## Principal Systems...  ...: R-104243**About The Role**The Senior Staff Electrical Engineer will have...  ...and limits* Drive balance between cost, performance, quality, and schedule and supervise international... 
    Performance
    Permanent employment
    For contractors
    Work at office
    Local area

    ZT Systems group

    Secaucus, NJ
    2 days ago
  • $170k - $235k

     ...We're hiring a Staff Technologist to own the delivery...  ...that spans product and engineering. You shape the product...  ...regulated environment where AI is reshaping what's...  ...-on experience with AI/LLM systems: building,...  ...Additionally, we offer a performance-based bonus program, 40... 
    Performance
    Work at office
    Remote work
    Flexible hours
    Shift work

    Clover Health

    New York, NY
    4 days ago
  • Intuit is looking for a Staff Design System Engineer in New York, New York. This role involves owning the technical vision for the QuickBooks Design...  ...such as React and TypeScript, experience with performance optimization, and a strong understanding of UI accessibility... 
    Performance

    ATX Venture Partners

    New York, NY
    5 days ago
  • $83k - $132k

     ...Bare Metal Support Engineer Livingston, NJ / New York...  ...Essential Cloud for AI™. Built for pioneers by...  ...innovators to build and scale AI with confidence. Trusted...  ...infrastructure performance with deep technical expertise...  ...CoreWeave's extensive GPU fleet across our growing... 
    Performance
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours
    Shift work

    CoreWeave

    New York, NY
    3 days ago
  • $300k

     ...Staff + Sr. Software Engineer, Inference San Francisco, CA | New York...  ...interpretable, and steerable AI systems. We want AI...  ...scientists the high-performance inference...  ...High-performance, large-scale distributed systems...  ...management systems LLM inference optimization... 
    Performance
    Work at office
    Worldwide
    Visa sponsorship
    Flexible hours

    anthropic

    New York, NY
    3 days ago
  • $253.3k - $354.6k

     ...Ladders is seeking a Staff Machine Learning Engineer to drive AI initiatives in the Media space. This fully remote...  ...Responsibilities include designing GPU-based systems, developing cloud-based...  ...AI solutions, and ensuring model performance. The role offers competitive compensation... 
    Performance
    Remote work

    Ladders

    New York, NY
    4 days ago
  •  ...search of a Quality Assurance Engineer – AI-Augmented Quality Engineering...  ...supplement the growth and scaling of the company. Millennium Systems...  ...ChatGPT, Cursor, or similar LLM-based assistants) to...  ...functional testing practices – performance, scalability, failover, reliability... 
    Performance
    Temporary work
    Remote work
    Shift work

    Meevo 2

    New York, NY
    4 days ago
  •  ...Nscale is the GPU cloud engineered for AI. We provide cost‑effective, high‑performance infrastructure for AI start‑ups and large enterprise customers. Nscale enables...  ...SOC). You will be responsible for building and scaling a world‑class cyber defense capability capable... 
    Performance
    Remote work
    Flexible hours

    Nscale

    New York, NY
    1 day ago
  •  ...Machine Learning Systems Engineer, RL Engineering...  ..., and steerable AI systems. We want AI...  ...obsessively on improving the performance, robustness, and...  ...performance, large scale distributed systems Large scale LLM training Python...  ..., we expect all staff to be in one of our... 
    Performance
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    New York, NY
    3 days ago
  • $184.05k - $262.93k

     ...machine learning, platform engineering, and regulatory...  ...is investing in large-scale rearchitecture and ML-driven...  ...including multimodal and LLM-based systems Build...  ...to improve model performance, reliability, and fairness...  ...artificial intelligence (AI) tools to support parts... 
    Performance
    Work from home
    Flexible hours

    Spotify

    New York, NY
    3 days ago
  • $139k - $204k

     ...Senior Software Engineer, Storage Engineer Livingston...  ...Essential Cloud for AI™. Built for pioneers by...  ...innovators to build and scale AI with confidence. Trusted...  ...infrastructure performance with deep technical expertise...  ...such as RDMA, GPU Direct Storage, and distributed... 
    Performance
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    New York, NY
    6 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff GPU Performance Engineer — AI/LLM Scaling. Be the first to apply!