Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Remote GPU Performance Engineer: Scale Training & Inference

Reka

A global AI foundation model startup is seeking an experienced GPU Performance Engineer to enhance training infrastructure and optimize model performance. The ideal candidate will have strong skills in Python and experience with large-scale model training, including GPU code optimization. This role offers a collaborative environment featuring top-tier talent and generous benefits, including extensive paid leave and visa support. Join a team committed to AI innovation and excellence. #J-18808-Ljbffr

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Remote GPU Performance Engineer: Scale Training & Inference in New York, NY vacancy
  •  ...hands-on support from AMD engineers the team is scaling rapidly to build the full...  ...skilled Distributed Training and Inference Engineer to build, optimize...  ...interactions, and optimizing performance at every layer of the ML...  ...across multi-node GPU/accelerator clusters.... 
    Remote work
    Training
    Performance
    Flexible hours

    Sciforium

    United States
    4 days ago
  • $167.2k - $209k

     ...seeking a Senior Engineer 2 to join our AI Inference Data Plane team....  ...delivering high-scale, resilient data...  ...industry-leading performance and reliability....  ...of GPU-level optimisation...  ...09,000This is a remote roleWhy You'll Like...  ...relevant conferences, training, and education.... 
    Remote work
    Training
    Performance
    Local area
    Worldwide
    Flexible hours

    DigitalOcean

    Seattle, WA
    7 hours ago
  •  ...GPU Systems Engineer (CUDA) Job Title: GPU Systems Engineer...  ...) Location: 100% Remote (Continental United...  ...architecture, and high-performance computing to design...  ...GPU platforms for AI training, inference, scientific computing...  ...with large-scale distributed training... 
    Remote work
    Training
    Performance
    Full time
    H1b
    Local area
    Visa sponsorship

    Bright Vision Technologies

    United States
    1 day ago
  • $160k - $253k

     ...accelerated computing is the engine of artificial...  ...platforms integrate high performance compute, networking,...  ...ecosystem to power AI at scale. We are looking for a...  ...showcasing NVIDIA's GPU architecture, server-level...  ...and efficiency for AI inference & training. What you'll be... 
    Remote work
    Training
    Performance

    NVIDIA

    United States
    19 hours ago
  •  ...are currently seeking a On-Premise LLM Inference & GPU Systems Engineer to join our team in Charlotte, North...  ...Engineer to build and maintain large-scale on-prem LLM infrastructure. This is...  ...inferencing; this role involves no model training infrastructure or fine-tuning... 
    Remote work
    Training

    The Nippon Telegraph and Telephone Corporation (NTT)

    Charlotte, NC
    3 days ago
  • $100k - $150k

     ...seeking a skilled GPU Systems Engineer (CUDA) to join...  ...full-time, 100% remote position (Continental...  ...implement high-performance CUDA kernels for...  ...GPU and multi-node training using NCCL, RDMA,...  ...in training and inference pipelines Develop...  ...Experience with large-scale distributed... 
    Remote job
    Training
    Performance
    Full time
    H1b
    Immediate start
    Visa sponsorship

    Bright Vision Technologies

    Fremont, CA
    2 days ago
  • $184.94k - $305.13k

     ...vLLM and LLM-D Engineering team at Red Hat...  ...our cutting‑edge inference platform (LLM-D...  ...optimize, and scale distributed...  ...deployments by running performance benchmarks,...  ...Token (TPOT), GPU utilization, GPU...  ...skills and training, external market...  ...positions with Remote-US locations, the... 
    Remote work
    Training
    Performance
    Permanent employment
    Full time
    Contract work
    Work experience placement
    Work at office
    Flexible hours

    Red Hat, Inc.

    Boston, MA
    3 days ago
  • $170k - $300k

     ...from data and model training through to production...  .... Built by engineers, for engineers. From large-scale GPU orchestration to inference optimization, we own...  ...Systems Engineer - GPU Performance to play a key role...  ...secondary caregivers. * Remote work reimbursement:... 
    Remote work
    Training
    Performance
    Temporary work
    Immediate start

    Nebius

    United States
    1 day ago
  •  ...5+ years of experience in GPU computing or distributed systems...  .... Experience optimizing performance of distributed AI workloads...  ...: Experience with AI training or inference frameworks (PyTorch,...  ...Science, Computer or Electrical Engineering, Mathematics, or a related... 
    Remote work
    Training
    Performance

    Cynet Systems

    United States
    3 days ago
  •  ...professional in New York to design and operate large-scale GPU infrastructure for model inference and reinforcement learning. The role demands several...  ...experience in deploying GPU systems, optimizing model performance, and working with frameworks like SGLang and Megatron... 
    Performance

    Reflection

    New York, NY
    1 day ago
  •  ...Seeking a full-time remote GPU Systems Engineer (CUDA) with over six years of experience to design and optimize high-performance CUDA kernels for compute-intensive workloads, collaborating...  ...performance bottlenecks in training and inference pipelines Required Qualifications... 
    Remote work
    Training
    Performance
    Full time

    Virtual Vocations Inc

    United States
    3 days ago
  •  ...GPU Kernel Engineer Our R&D team is seeking expert level GPU...  ...collaborate with the training team to define robust...  ...and implement high performance GPU kernels. This job...  ...Gdansk or New York City. Remote work will be...  ...combination with tuning inference engine (vLLM, SGlang,... 
    Remote work
    Training
    Performance

    Makora

    United States
    1 day ago
  • $167.2k - $209k

     ...seeking a Senior Engineer 2 to play a key technical...  ...role in our AI Inference Optimization team...  ...industry-leading performance for our inference...  ...engine and GPU kernel layers, ensuring...  ...,000 ~ This is a remote role JR: 2026...  ...conferences, training, and education. All... 
    Remote work
    Training
    Performance
    Local area
    Worldwide
    Flexible hours

    DigitalOcean

    San Francisco, CA
    2 days ago
  •  ...innovative company is seeking a talented software engineer to join their dynamic Inference team. This role involves designing and implementing infrastructure for large-scale multimodal models, focusing on high-performance delivery of audio and image inputs. You'll... 
    Performance

    OpenAI

    San Francisco, CA
    19 hours ago
  • $224k - $356.5k

     ...era of computing. An era in which our GPU acts as the brains of computers, robots...  ...world. We are now looking for a GPU Performance Engineer for Neural Reconstruction! NVIDIA is...  ...CUDA, C++, and GPU profiling to optimize training and rendering workflows used in... 
    Remote work
    Training
    Performance

    NVIDIA

    United States
    4 days ago
  •  ...for a CUDA Kernel Engineer who has hands‑on experience...  ...will work on the GPU performance layer powering large‑scale, high‑throughput AI...  ...Location: Remote US Start date: ASAP...  ...Knowledge of model inference optimization (TensorRT...  ...compensation, and training. We are committed... 
    Remote work
    Training
    Performance
    Local area
    Immediate start
    Relocation package

    Pragmatike

    Austin, TX
    2 days ago
  • $150k - $240k

     ..., well-funded, remote-first company with...  ..., deploy, and scale custom AI...  ...purpose-built for GPU-centric compute...  ...looking for an Engineering Manager, Datacenter...  ...- ensuring performance, reliability, and...  ...for training workloads. This...  ...supporting training, inference, checkpointing,... 
    Remote work
    Training
    Performance
    Flexible hours

    RunPod, Inc.

    United States
    3 days ago
  •  ...Senior Distinguished Engineer, AI Compute (Remote Eligible)...  ...and scalable, high‑performance AI infrastructure....  ...delivering the high‑scale developer and runtime...  ...on top of CPU and GPU substrates. Your contributions...  ...to ML / DL model training, model inference and feature... 
    Remote work
    Training
    Performance
    Local area

    Information Technology Senior Management Forum

    San Francisco, CA
    4 days ago
  • $250k

     ...Join a rapidly scaling AI cloud infrastructure...  ...next-generation GPU platform designed for AI training, experimentation, and inference at scale. The...  ...Reliability Engineer to support and scale...  ..., and performance of HPC and cloud...  ...options Bonus  Remote working option and... 
    Remote work
    Training
    Performance
    Permanent employment
    San Francisco, CA
    7 days ago
  • $286.2k - $326.7k

     ...Senior Distinguished Engineer, AI Compute (Remote Eligible) At...  ...and scalable, high-performance AI infrastructure....  ...delivering the high-scale developer and runtime...  ...on top of CPU and GPU substrates. Your contributions...  ...to ML / DL model training, model inference and feature... 
    Remote work
    Training
    Performance
    Full time
    Part time
    Local area

    Capital One

    United States
    3 days ago
  • $250k - $280k

     ...organizations build, run, and scale AI and accelerated...  ...grow and performance demands increase, NeuralMesh...  ...that maximizes GPU utilization, accelerates...  ...win, we win. Product & Engineering Highlights WEKA is architecting...  ...and make AI model training and inference, machine learning,... 
    Remote work
    Training
    Performance
    Work experience placement
    Local area
    Flexible hours

    WekaIO

    United States
    4 days ago
  •  ...orchestration at a planetary scale. Our mission is to...  ..., enabling high-performance computing for AI training and inference on a wide spectrum...  ...seeking a Research Engineer with a passion for...  ..., heterogeneous GPU resources....  ...in a collaborative, remote environment. A background... 
    Remote work
    Training
    Performance
    Full time
    Flexible hours

    Yotta Labs

    New York, NY
    4 days ago
  • $144k - $192k

     ...Learning Systems Engineer Boston, MA...  ...researchers to train frontier models at scale, focusing obsessively...  ...and high-performance systems engineering...  ...high-performance GPU kernels in Triton...  ...during training and inference, alongside a...  ...role can be fully remote. The salary... 
    Remote work
    Training
    Performance
    Work at office

    Venturefizz Product Management Community

    Boston, MA
    3 days ago
  •  ...Model Serving Engineer Bright Vision...  ...Location: 100% Remote (Continental United...  ...and operate high-performance, highly reliable inference platforms for...  ...caching, autoscaling, GPU utilization, and...  ...AI APIs at scale. How to Apply...  ..., hiring, training, compensation, promotion... 
    Remote work
    Training
    Performance
    Full time
    H1b
    Local area
    Immediate start
    Visa sponsorship

    Bright Vision Technologies

    United States
    2 days ago
  • $152k - $241.5k

     ...NVIDIA's invention of the GPU 1999 sparked the...  ...top-tier AI Compiler Engineers to drive innovation within...  ...is possible in AI performance and help build the...  ...tangible impact on a global scale. What you’ll be...  ...AI workloads (both inference and training) and successfully... 
    Training
    Performance

    NVIDIA

    Santa Clara, CA
    4 days ago
  •  ...Moveworks’ Reasoning Engine and natural...  ...backed by the global scale of ServiceNow and...  ...for distributed training and inference, model evaluation...  ...for model performance and efficiency, making...  ...to optimize our GPU infrastructure for...  ...personas (flexible, remote, or required in... 
    Remote work
    Training
    Performance
    Work at office
    Flexible hours

    ServiceNow

    Mountain View, CA
    2 days ago
  •  ...GPU Kernel Engineer Sciforium is an AI infrastructure company developing...  ...engineers the team is scaling rapidly to build the full...  ...pushing the limits of performance on modern accelerators. In...  ...frameworks used for large-scale training and inference. This role is ideal... 
    Training
    Performance
    Flexible hours

    Sciforium

    San Francisco, CA
    3 days ago
  • $180k - $250k

     ...Software Engineer, Distributed Systems...  ...production, and do it at scale without...  ...platform where high-performance inference, orchestration, and...  ...orchestration, scheduling, GPU autoscaling,...  .../ML inference or training infrastructure...  ...willing to consider remote for Senior and... 
    Remote work
    Training
    Performance
    Currently hiring
    Relocation package

    Fal

    United States
    3 days ago
  •  ...Tokens-as-a-Service (TaaS) Engineer We are seeking a...  ...that convert large-scale infrastructure capacity...  ...you will work across performance benchmarking, tokenomics...  ...stack, ensuring GPU capacity can be onboarded...  ...Familiarity with model porting, inference/training workloads, token... 
    Remote work
    Training
    Performance

    OpenAI

    United States
    2 days ago
  • Machine Learning Engineer, Inference Want to solve realtime inference...  ...used at massive scale across customer support...  ...inference, scheduler design, GPU utilisation,...  ...already operates beyond the performance of most publicly...  ...and benefits. Location: Remote across the US or Europe... 
    Remote job
    Performance
    Flexible hours

    Trades Workforce Solutions

    San Francisco, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote GPU Performance Engineer: Scale Training & Inference. Be the first to apply!