Remote GPU Performance Engineer: Scale Training & Inference
Reka
A global AI foundation model startup is seeking an experienced GPU Performance Engineer to enhance training infrastructure and optimize model performance. The ideal candidate will have strong skills in Python and experience with large-scale model training, including GPU code optimization. This role offers a collaborative environment featuring top-tier talent and generous benefits, including extensive paid leave and visa support. Join a team committed to AI innovation and excellence. #J-18808-Ljbffr
- ...hands-on support from AMD engineers the team is scaling rapidly to build the full... ...skilled Distributed Training and Inference Engineer to build, optimize... ...interactions, and optimizing performance at every layer of the ML... ...across multi-node GPU/accelerator clusters....Remote workTrainingPerformanceFlexible hours
$167.2k - $209k
...seeking a Senior Engineer 2 to join our AI Inference Data Plane team.... ...delivering high-scale, resilient data... ...industry-leading performance and reliability.... ...of GPU-level optimisation... ...09,000This is a remote roleWhy You'll Like... ...relevant conferences, training, and education....Remote workTrainingPerformanceLocal areaWorldwideFlexible hours- ...GPU Systems Engineer (CUDA) Job Title: GPU Systems Engineer... ...) Location: 100% Remote (Continental United... ...architecture, and high-performance computing to design... ...GPU platforms for AI training, inference, scientific computing... ...with large-scale distributed training...Remote workTrainingPerformanceFull timeH1bLocal areaVisa sponsorship
$160k - $253k
...accelerated computing is the engine of artificial... ...platforms integrate high performance compute, networking,... ...ecosystem to power AI at scale. We are looking for a... ...showcasing NVIDIA's GPU architecture, server-level... ...and efficiency for AI inference & training. What you'll be...Remote workTrainingPerformance- ...are currently seeking a On-Premise LLM Inference & GPU Systems Engineer to join our team in Charlotte, North... ...Engineer to build and maintain large-scale on-prem LLM infrastructure. This is... ...inferencing; this role involves no model training infrastructure or fine-tuning...Remote workTraining
$100k - $150k
...seeking a skilled GPU Systems Engineer (CUDA) to join... ...full-time, 100% remote position (Continental... ...implement high-performance CUDA kernels for... ...GPU and multi-node training using NCCL, RDMA,... ...in training and inference pipelines Develop... ...Experience with large-scale distributed...Remote jobTrainingPerformanceFull timeH1bImmediate startVisa sponsorship$184.94k - $305.13k
...vLLM and LLM-D Engineering team at Red Hat... ...our cutting‑edge inference platform (LLM-D... ...optimize, and scale distributed... ...deployments by running performance benchmarks,... ...Token (TPOT), GPU utilization, GPU... ...skills and training, external market... ...positions with Remote-US locations, the...Remote workTrainingPerformancePermanent employmentFull timeContract workWork experience placementWork at officeFlexible hours$170k - $300k
...from data and model training through to production... .... Built by engineers, for engineers. From large-scale GPU orchestration to inference optimization, we own... ...Systems Engineer - GPU Performance to play a key role... ...secondary caregivers. * Remote work reimbursement:...Remote workTrainingPerformanceTemporary workImmediate start- ...5+ years of experience in GPU computing or distributed systems... .... Experience optimizing performance of distributed AI workloads... ...: Experience with AI training or inference frameworks (PyTorch,... ...Science, Computer or Electrical Engineering, Mathematics, or a related...Remote workTrainingPerformance
- ...professional in New York to design and operate large-scale GPU infrastructure for model inference and reinforcement learning. The role demands several... ...experience in deploying GPU systems, optimizing model performance, and working with frameworks like SGLang and Megatron...Performance
- ...Seeking a full-time remote GPU Systems Engineer (CUDA) with over six years of experience to design and optimize high-performance CUDA kernels for compute-intensive workloads, collaborating... ...performance bottlenecks in training and inference pipelines Required Qualifications...Remote workTrainingPerformanceFull time
- ...GPU Kernel Engineer Our R&D team is seeking expert level GPU... ...collaborate with the training team to define robust... ...and implement high performance GPU kernels. This job... ...Gdansk or New York City. Remote work will be... ...combination with tuning inference engine (vLLM, SGlang,...Remote workTrainingPerformance
$167.2k - $209k
...seeking a Senior Engineer 2 to play a key technical... ...role in our AI Inference Optimization team... ...industry-leading performance for our inference... ...engine and GPU kernel layers, ensuring... ...,000 ~ This is a remote role JR: 2026... ...conferences, training, and education. All...Remote workTrainingPerformanceLocal areaWorldwideFlexible hours- ...innovative company is seeking a talented software engineer to join their dynamic Inference team. This role involves designing and implementing infrastructure for large-scale multimodal models, focusing on high-performance delivery of audio and image inputs. You'll...Performance
$224k - $356.5k
...era of computing. An era in which our GPU acts as the brains of computers, robots... ...world. We are now looking for a GPU Performance Engineer for Neural Reconstruction! NVIDIA is... ...CUDA, C++, and GPU profiling to optimize training and rendering workflows used in...Remote workTrainingPerformance- ...for a CUDA Kernel Engineer who has hands‑on experience... ...will work on the GPU performance layer powering large‑scale, high‑throughput AI... ...Location: Remote US Start date: ASAP... ...Knowledge of model inference optimization (TensorRT... ...compensation, and training. We are committed...Remote workTrainingPerformanceLocal areaImmediate startRelocation package
$150k - $240k
..., well-funded, remote-first company with... ..., deploy, and scale custom AI... ...purpose-built for GPU-centric compute... ...looking for an Engineering Manager, Datacenter... ...- ensuring performance, reliability, and... ...for training workloads. This... ...supporting training, inference, checkpointing,...Remote workTrainingPerformanceFlexible hours- ...Senior Distinguished Engineer, AI Compute (Remote Eligible)... ...and scalable, high‑performance AI infrastructure.... ...delivering the high‑scale developer and runtime... ...on top of CPU and GPU substrates. Your contributions... ...to ML / DL model training, model inference and feature...Remote workTrainingPerformanceLocal area
$250k
...Join a rapidly scaling AI cloud infrastructure... ...next-generation GPU platform designed for AI training, experimentation, and inference at scale. The... ...Reliability Engineer to support and scale... ..., and performance of HPC and cloud... ...options Bonus Remote working option and...Remote workTrainingPerformancePermanent employment$286.2k - $326.7k
...Senior Distinguished Engineer, AI Compute (Remote Eligible) At... ...and scalable, high-performance AI infrastructure.... ...delivering the high-scale developer and runtime... ...on top of CPU and GPU substrates. Your contributions... ...to ML / DL model training, model inference and feature...Remote workTrainingPerformanceFull timePart timeLocal area$250k - $280k
...organizations build, run, and scale AI and accelerated... ...grow and performance demands increase, NeuralMesh... ...that maximizes GPU utilization, accelerates... ...win, we win. Product & Engineering Highlights WEKA is architecting... ...and make AI model training and inference, machine learning,...Remote workTrainingPerformanceWork experience placementLocal areaFlexible hours- ...orchestration at a planetary scale. Our mission is to... ..., enabling high-performance computing for AI training and inference on a wide spectrum... ...seeking a Research Engineer with a passion for... ..., heterogeneous GPU resources.... ...in a collaborative, remote environment. A background...Remote workTrainingPerformanceFull timeFlexible hours
$144k - $192k
...Learning Systems Engineer Boston, MA... ...researchers to train frontier models at scale, focusing obsessively... ...and high-performance systems engineering... ...high-performance GPU kernels in Triton... ...during training and inference, alongside a... ...role can be fully remote. The salary...Remote workTrainingPerformanceWork at office- ...Model Serving Engineer Bright Vision... ...Location: 100% Remote (Continental United... ...and operate high-performance, highly reliable inference platforms for... ...caching, autoscaling, GPU utilization, and... ...AI APIs at scale. How to Apply... ..., hiring, training, compensation, promotion...Remote workTrainingPerformanceFull timeH1bLocal areaImmediate startVisa sponsorship
$152k - $241.5k
...NVIDIA's invention of the GPU 1999 sparked the... ...top-tier AI Compiler Engineers to drive innovation within... ...is possible in AI performance and help build the... ...tangible impact on a global scale. What you’ll be... ...AI workloads (both inference and training) and successfully...TrainingPerformance- ...Moveworks’ Reasoning Engine and natural... ...backed by the global scale of ServiceNow and... ...for distributed training and inference, model evaluation... ...for model performance and efficiency, making... ...to optimize our GPU infrastructure for... ...personas (flexible, remote, or required in...Remote workTrainingPerformanceWork at officeFlexible hours
- ...GPU Kernel Engineer Sciforium is an AI infrastructure company developing... ...engineers the team is scaling rapidly to build the full... ...pushing the limits of performance on modern accelerators. In... ...frameworks used for large-scale training and inference. This role is ideal...TrainingPerformanceFlexible hours
$180k - $250k
...Software Engineer, Distributed Systems... ...production, and do it at scale without... ...platform where high-performance inference, orchestration, and... ...orchestration, scheduling, GPU autoscaling,... .../ML inference or training infrastructure... ...willing to consider remote for Senior and...Remote workTrainingPerformanceCurrently hiringRelocation package- ...Tokens-as-a-Service (TaaS) Engineer We are seeking a... ...that convert large-scale infrastructure capacity... ...you will work across performance benchmarking, tokenomics... ...stack, ensuring GPU capacity can be onboarded... ...Familiarity with model porting, inference/training workloads, token...Remote workTrainingPerformance
- Machine Learning Engineer, Inference Want to solve realtime inference... ...used at massive scale across customer support... ...inference, scheduler design, GPU utilisation,... ...already operates beyond the performance of most publicly... ...and benefits. Location: Remote across the US or Europe...Remote jobPerformanceFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Remote GPU Performance Engineer: Scale Training & Inference. Be the first to apply!
- remote sales consultant New York, NY
- remote lvn New York, NY
- customer service associate remote New York, NY
- remote financial planning New York, NY
- medical records reviewer remote New York, NY
- remote sales jobs New York, NY
- remote video game New York, NY
- remote purchasing New York, NY
- junior ux designer remote New York, NY
- remote legal writer New York, NY

