Staff GPU Performance Engineer — AI/LLM Scaling
$207k - $300kGoogle Inc.
Google Inc. is seeking a Staff Software Engineer specializing in GPU performance to influence next-gen GPU architectures at their New York office. The ideal candidate will possess a bachelor's degree, 8 years of experience in software development, and a deep understanding of modern GPU architectures like NVIDIA and AMD. Responsibilities include engaging with teams across Google, analyzing performance metrics, and shaping solutions for AI applications. This full-time position offers a competitive salary range of $207,000-$300,000 plus bonuses. #J-18808-Ljbffr Google Inc.
- ...world's most dynamic AI companies, like... ...build the platform engineers turn to to ship AI products... ...We believe that as LLM and multi-modal workloads scale, the network is the... ...engineers to lead our GPU Networking efforts,... ...validate networking performance on bleeding-edge...PerformanceFlexible hours
- Pragmatike is hiring a CUDA Kernel Engineer to develop and optimize NVIDIA CUDA kernels for a leading AI startup. This remote role focuses on maximizing GPU performance and throughput for high-scale AI systems. Candidates should have substantial experience with CUDA and...PerformanceRemote jobRelocation package
- ...skilled professional in New York to design and operate large-scale GPU infrastructure for model inference and reinforcement learning.... ...years of experience in deploying GPU systems, optimizing model performance, and working with frameworks like SGLang and Megatron. The position...Performance
- A leading AI infrastructure company based in New York seeks skilled engineers to optimize ML systems at scale. The ideal candidate has over 5 years of high-performance coding experience, familiarity with Nvidia GPU architecture, and expertise in ML frameworks like Torch...Performance
$180k - $220k
...NewtonX NewtonX delivers AI-powered B2B insights to... ..., you'll own the core LLM infrastructure powering... ...intelligence at scale. Architect automated systems... ...solutions while maintaining engineering best practices Nice to... ...base and annual performance bonus) and equity. (Please...PerformanceImmediate startRemote workFlexible hours- ...nonprofit applied AI research organization... ...expanding the scale, complexity, and breadth... ...seeking Software Engineers to develop scalable... ...ensure robust, correct LLM outputs Collaborate... ..., scaling, and performance optimizations in real... ...from non‑technical staff Are flexible and...PerformanceFull timeContract workPart timeFor contractorsFlexible hours
- Google is seeking AI/ML software engineers in New York City to design and enhance GPU architectures for cutting-edge AI applications. This role involves identifying performance metrics for LLMs, engaging with teams to address ML challenges, and delivering scalable solutions...Performance
- ...HRB is seeking a QA Engineer in the United States with expertise in API testing and LLM fine-tuning. The successful candidate will ensure the quality of our 'Agent... ...API interactions, preparing datasets for model performance, and maintaining security protocols....Performance
- ...A pioneering AI infrastructure company is seeking a GPU Cloud Platform Engineer to design and operate large-scale GPU clusters. This remote position aims to ensure high availability and performance of containerized AI workloads across cloud environments. The ideal candidate...PerformanceRemote work
$125.5k - $230.2k
Forward Deployed Engineer - Applied AI - Manager - Financial Services... ...Build and integrate LLM, RAG, and agentic... ...capabilities. Improve performance, resilience, maintainability... ...at enterprise scale (e.g. LlamaIndex, LangChain... ...execution. Familiarity with GPU‑accelerated AI...PerformanceContract workSummer holidayFlexible hours$144k - $329.1k
Forward Deployed Engineer - Applied AI - Senior Manager - Financial... ...at enterprise scale. Ensure API layers and... ...development. Agentic and LLM Ops Expertise in designing... ....). Experience with GPU‑accelerated AI... ...rewarded based on your performance and recognized for the...PerformanceContract workSummer holidayImmediate startFlexible hours- ...As the first ML Ops Engineer at Tennr, you'll... ...Machine Learning and AI systems. You'll own... ...managing models at scale. Develop and maintain... ...systems to enhance performance and efficiency.... ...evaluation of ML & LLM systems.... ...Experience with GPU orchestration, including...PerformanceWork at office
- ...understanding in healthcare. Our AI-powered platform was purpose-... ..., technologists, and engineers working together to empower people... ...team and help improve the performance, stability, and scalability of... ...security, will be under tremendous scale, and presents many...PerformanceHourly payFull timeFlexible hours
$250k - $350k
...AI is becoming vitally important in every... ...of our society. At Scale, our mission is to... ...algorithms to reach the performance necessary for... ...an ML Sys Research Engineer, you'll work on building... ...least 1-3 years of LLM training in a... ...architecture of the modern GPU cluster...PerformanceFull time- Nerdleveltech is looking for a Senior AI Engineer (Node.js / Next.js / TypeScript) to enhance... ...and develop production-ready LLM experiences. You'll take full responsibility... ...emphasizes data-driven decisions around model performance and optimizes deployment based on quantitative...PerformanceRemote jobFlexible hours
$152k - $241.5k
Senior AI and FSI Developer Technology Engineer page is loaded## Senior AI and FSI... ...push the limits of performance at the intersection... ...analyze, optimize, and scale complex AI and HPC... ...for modern CPU and GPU architectures.* Profiling... ...TensorRT, TensorRT-LLM, and cuTile.*...Performance$105k - $140k
...leading technology company in Secaucus, NJ seeks a Reliability Engineer responsible for ensuring cloud hardware reliability through... ...principles. The ideal candidate will develop guidelines, analyze performance data, and collaborate across teams to meet customer...Performance$144k - $286k
...the leader in digital performance solutions, helping our... ...DoubleVerify is hiring a Staff Enterprise Architect... ...at enterprise scale. This is a hands-on... ...with enough rigor that engineering teams can build against... ...reusable patterns. AI / LLM enablement Maintain...PerformanceLive in$105k - $140k
...to our customers. We are looking for a passionate Reliability Engineer with exceptional knowledge and experience developing and... ...of reliability engineering guidelines to improve product field performance through design enhancements to meet reliability goals. Uses principles...PerformancePermanent employmentWork experience placementWork at officeLocal area$116.25k - $193.75k
Principal Systems Electrical Engineer - AI/Hyperscale Servers page is loaded## Principal Systems... ...: R-104243**About The Role**The Senior Staff Electrical Engineer will have... ...and limits* Drive balance between cost, performance, quality, and schedule and supervise international...PerformancePermanent employmentFor contractorsWork at officeLocal area$170k - $235k
...We're hiring a Staff Technologist to own the delivery... ...that spans product and engineering. You shape the product... ...regulated environment where AI is reshaping what's... ...-on experience with AI/LLM systems: building,... ...Additionally, we offer a performance-based bonus program, 40...PerformanceWork at officeRemote workFlexible hoursShift work- Intuit is looking for a Staff Design System Engineer in New York, New York. This role involves owning the technical vision for the QuickBooks Design... ...such as React and TypeScript, experience with performance optimization, and a strong understanding of UI accessibility...Performance
$83k - $132k
...Bare Metal Support Engineer Livingston, NJ / New York... ...Essential Cloud for AI™. Built for pioneers by... ...innovators to build and scale AI with confidence. Trusted... ...infrastructure performance with deep technical expertise... ...CoreWeave's extensive GPU fleet across our growing...PerformancePermanent employmentTemporary workCasual workWork at officeRemote workFlexible hoursShift work$300k
...Staff + Sr. Software Engineer, Inference San Francisco, CA | New York... ...interpretable, and steerable AI systems. We want AI... ...scientists the high-performance inference... ...High-performance, large-scale distributed systems... ...management systems LLM inference optimization...PerformanceWork at officeWorldwideVisa sponsorshipFlexible hours$253.3k - $354.6k
...Ladders is seeking a Staff Machine Learning Engineer to drive AI initiatives in the Media space. This fully remote... ...Responsibilities include designing GPU-based systems, developing cloud-based... ...AI solutions, and ensuring model performance. The role offers competitive compensation...PerformanceRemote work- ...search of a Quality Assurance Engineer – AI-Augmented Quality Engineering... ...supplement the growth and scaling of the company. Millennium Systems... ...ChatGPT, Cursor, or similar LLM-based assistants) to... ...functional testing practices – performance, scalability, failover, reliability...PerformanceTemporary workRemote workShift work
- ...Nscale is the GPU cloud engineered for AI. We provide cost‑effective, high‑performance infrastructure for AI start‑ups and large enterprise customers. Nscale enables... ...SOC). You will be responsible for building and scaling a world‑class cyber defense capability capable...PerformanceRemote workFlexible hours
- ...Machine Learning Systems Engineer, RL Engineering... ..., and steerable AI systems. We want AI... ...obsessively on improving the performance, robustness, and... ...performance, large scale distributed systems Large scale LLM training Python... ..., we expect all staff to be in one of our...PerformanceWork at officeVisa sponsorshipFlexible hours
$184.05k - $262.93k
...machine learning, platform engineering, and regulatory... ...is investing in large-scale rearchitecture and ML-driven... ...including multimodal and LLM-based systems Build... ...to improve model performance, reliability, and fairness... ...artificial intelligence (AI) tools to support parts...PerformanceWork from homeFlexible hours$139k - $204k
...Senior Software Engineer, Storage Engineer Livingston... ...Essential Cloud for AI™. Built for pioneers by... ...innovators to build and scale AI with confidence. Trusted... ...infrastructure performance with deep technical expertise... ...such as RDMA, GPU Direct Storage, and distributed...PerformancePermanent employmentTemporary workCasual workWork at officeRemote workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff GPU Performance Engineer — AI/LLM Scaling. Be the first to apply!
- staff security engineer New York, NY
- staff devops engineer New York, NY
- assistant engineer New York, NY
- engineering aide New York, NY
- assistant chief engineer New York, NY
- staff engineer New York, NY
- technology administrator New York, NY
- senior staff systems engineer New York, NY
- assistant mechanical engineer New York, NY
- staff data engineer New York, NY

