Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior ML Performance Engineer - GPU & Inference

Modal Labs

About Us: Modal provides the infrastructure foundation for AI teams. With instant GPU access, sub-second container startups, and native storage, Modal makes it simple to train models, run batch jobs, and serve low-latency inference. We have thousands of customers who rely on us for production AI workloads, including Lovable, Scale AI, Substack, and Suno. We're a fast-growing team based out of NYC, SF, and Stockholm. We've hit 9-figure ARR and recently raised a Series B at a $1.1B valuation. Our investors include Lux Capital, Redpoint Ventures, Amplify Partners, and Elad Gil. Working at Modal means joining one of the fastest-growing AI infrastructure organizations at an early stage, with many opportunities to grow within the company. Our team includes creators of popular open-source projects (e.g. Seaborn, Luigi), academic researchers, international olympiad medalists, and experienced engineering and product leaders with decades of experience. The Role We are looking for strong engineers with experience in making ML systems performant at scale. If you are interested in contributing to open-source projects and Modal’s container runtime to push language and diffusion models towards higher throughput and lower latency, we’d love to hear from you! Requirements 5+ years of experience writing high-quality, high-performance code. Experience working with torch, high-level ML frameworks, and inference engines (vLLM or TensorRT). Familiarity with Nvidia GPU architecture and CUDA. Experience with ML performance engineering (tell us a story about boosting GPU performance — debugging SM occupancy issues, rewriting an algorithm to be compute-bound, eliminating host overhead, etc). Nice-to-have: familiarity with low-level operating system foundations (Linux kernel, file systems, containers, etc). #J-18808-Ljbffr Modal Labs

Vacancy posted 3 hours ago
Similar jobs that could be interesting for youBased on the Senior ML Performance Engineer - GPU & Inference in New York, NY vacancy
  • $128.7k - $261.3k

     ...to model export, kernel development, and performance engineering so that every cycle on our accelerators...  ...AI Kernels team builds high‑performance GPU kernels and custom libraries that sit at the heart of our on‑vehicle ML inference for ADAS and autonomous driving. We own... 
    Senior
    Performance
    Flexible hours

    General Motors

    New York, NY
    12 hours ago
  •  ...professional in New York to design and operate large-scale GPU infrastructure for model inference and reinforcement learning. The role demands several...  ...experience in deploying GPU systems, optimizing model performance, and working with frameworks like SGLang and Megatron.... 
    Senior
    Performance

    Reflection

    New York, NY
    2 days ago
  • $200k - $250k

     ...we’re building the top-performing AI Shopping Agent that...  ..., and trust. Our ML models power the core...  ...seeking an experienced Senior MLOps Engineer to take ownership of how...  ...- for a custom-built inference platform powering a live...  ...latency, availability, GPU utilization, TTFT, ITL... 
    Senior
    Performance
    Remote work
    Flexible hours

    Wizard

    New York, NY
    12 hours ago
  • $128.7k - $261.3k

     ...kernel development, and performance engineering so that every cycle on...  ...into fast, reliable inference across GPUs powering GM...  ..., systems, and GPU engineers who enjoy working...  ...driving. The Role As a Senior Compiler Engineer on the...  ..., and effortless for ML engineers across the... 
    Senior
    Performance
    Flexible hours

    General Motors

    New York, NY
    12 hours ago
  • $165k - $225k

     ...one of its clients a Senior Machine Learning Engineer - this is a fully...  ...experienced Senior ML Engineer to join our...  ...evaluate algorithm performance, validate research hypotheses...  ...Experience with GPU acceleration and...  ...TensorRT/ONNX export, and inference serving frameworks... 
    Senior
    Performance
    Remote work
    Worldwide

    Career Renew

    New York, NY
    13 days ago
  • $200k - $220k

    Senior Machine Learning Engineer (Fully Remote) Base pay range: $200,000....  ...will build scalable ML infrastructure, deploy...  ...practices in a high‑performance environment....  ...frameworks. Build multi‑GPU training pipelines and...  ...training to batch inference, ensuring automation... 
    Senior
    Performance
    Remote job
    Flexible hours

    Harnham

    New York, NY
    12 hours ago
  •  ...members to join our team. Senior Machine Learning Engineer Position Summary We...  ...care more about deep ML/CV ability than any...  ...Design systems that infer structured attributes...  ...and benchmark high‑performance image retrieval capabilities...  ..., quantization, and GPU acceleration Read... 
    Senior
    Performance
    Remote work
    Flexible hours

    Clearview AI

    New York, NY
    1 day ago
  •  ...looking for an experienced AI Model Engineer with deep expertise in kernel...  ...optimization, fine‑tuning, and GPU acceleration. The engineer will extend the inference framework to support inference and...  ...testing, fine‑tuned adapter performance). Conduct GPU testing across desktop... 
    Senior
    Performance
    Remote job

    Framework Ventures

    New York, NY
    12 hours ago
  •  ...infrastructure company based in New York is seeking experienced engineers to enhance the performance of ML systems and contribute to open-source projects. Ideal...  ...writing high-quality code and familiarity with Nvidia GPU architecture and ML frameworks. This role offers... 
    Performance

    Modal

    New York, NY
    12 hours ago
  • $216.7k - $303.4k

    Senior Machine Learning Systems Engineer Remote - United States Reddit is a community of communities...  ...You’ll Do: As a Senior ML Infrastructure Engineer,...  ...with ML engineers on performance tuning, including improving...  ...training time, efficiency, and GPU training costs in a large,... 
    Senior
    Performance
    Remote job
    For contractors
    Work experience placement

    reddit

    New York, NY
    12 hours ago
  • $160k - $240k

    Senior MLOps Engineer - Artificial Intelligence Location New York...  ...of Machine Learning (ML) and Software...  ...processes, enhance the performance of our systems and more...  ...disk / network / CPU / GPU) usage Work closely...  ...continuous model training, inference, and monitoring... 
    Senior
    Performance
    Temporary work
    For contractors
    Work experience placement

    Bloomberg L.P.

    New York, NY
    2 days ago
  • $170k - $220k

     ...,000 clients nationwide. Our ML and AI capabilities are expanding...  ...first dedicated ML Platform Engineer, you'll define the technical...  ...and are investing in hosted GPU inference to support the next...  ...deployment workflows Develop high-performance GPU inference pipelines with... 
    Senior
    Performance
    Full time
    Work at office
    Local area

    Charlie Health Engineering, Product & Design

    New York, NY
    23 days ago
  • $144.7k - $261.3k

    Senior ML Validation Research Engineer will lead applied machine learning research focused on improving verification...  ...Prototype research concepts into performant tools integrated into CI/CD and...  ...Knowledge of Bayesian ML, causal inference, and sequential testing. Experience... 
    Senior
    Performance
    Flexible hours

    General Motors

    New York, NY
    12 hours ago
  • $180k - $250k

     ...Description Job Description Senior Machine Learning Engineer Location: Remote (U....  ...Design, train, and deploy ML models that power...  ...uplift modeling, and causal inference. Collaborate with leadership...  ...techniques and performance measurement. Bonus... 
    Senior
    Performance
    Full time
    Remote work

    SW5 Consulting

    New York, NY
    22 days ago
  • $144.7k - $261.3k

     ...developer environments, cloud infrastructure, and ML/AI GPU platforms for AV research and development teams...  ...run faster in GM. The Role GM is looking for a Senior Capacity Engineer to join the AV Capacity and Performance Engineering team in the AV Infrastructure org to... 
    Senior
    Performance
    Work experience placement
    Local area
    Remote work
    Work from home
    Flexible hours

    General Motors

    New York, NY
    12 hours ago
  • $150k - $175k

     ...production deployment of multimodal ML models that quantify creative quality and predict ad performance. This role is the technical...  ...distributed training and inference. Reduce training time and inference...  ...Required: ~5+ years in ML engineering or MLOps, with shipped... 
    Senior
    Performance
    Work experience placement
    Local area

    KARGO

    New York, NY
    25 days ago
  • $152k - $228k

     ...Job Description Job Description Senior ML Engineer About Invoca Invoca is an AI-powered...  ...model training and fine-tuning through inference optimization and production APIs. We move...  ...Server, Baseten, and Kubernetes-based GPU infrastructure. Profile and tune for... 
    Senior
    Currently hiring
    Remote work
    Flexible hours

    Invoca

    New York, NY
    14 days ago
  •  ...Go, or C++. You will play a pivotal role in creating predictive models that enhance ad ranking and recommendations, ensuring high performance and user engagement. This is an opportunity to make a significant impact in a rapidly growing tech environment. #J-18808-Ljbffr... 
    Senior
    Performance

    Toogeza

    New York, NY
    2 days ago
  •  ...healthcare management company seeks a Machine Learning Engineer to develop and deliver end-to-end ML solutions. The ideal candidate will have strong...  ...partners to enhance ML products and ensure robust model performance. Join us to help improve patient outcomes and contribute... 
    Senior
    Performance

    InterWell Health

    New York, NY
    12 hours ago
  • Mozilla Corporation is looking for a Senior Machine Learning Engineer specializing in applied AI modeling. This role involves the development of...  ...position offers generous benefits, including health coverage, performance bonuses, and professional development opportunities. #J... 
    Senior
    Performance
    Remote work

    Mozilla Corporation

    New York, NY
    1 day ago
  • The Athletic is seeking a Senior Machine Learning Operations Engineer to join their team remotely. You will be responsible for designing infrastructure...  ...learning model productionization, ensuring high-performance models and driving impactful projects. In this role,... 
    Senior
    Performance
    Remote job

    The Athletic

    New York, NY
    2 days ago
  • We are looking for a Senior Machine Learning Engineer, MLOps to help operationalize and...  ...and processes that enable ML to move from research into...  ...support model training and inference Build tooling and processes for monitoring model performance , system reliability, and operational... 
    Senior
    Performance
    Flexible hours

    ExaCare AI

    New York, NY
    1 day ago
  •  ...business needs. Collaborate with data scientists and software engineers to design and implement scalable and efficient solutions. Clean...  ...models into production environments and monitor their performance. Continuously improve model accuracy and performance through experimentation... 
    Senior
    Performance

    Resolve Tech Solutions, LLC

    New York, NY
    4 days ago
  • Overview As a Senior Machine Learning Engineer at Phia, you’ll build and scale production ML systems that power core product experiences...  ...Drive improvements to model performance, reliability, and...  ...experiment design and causal inference, including A/B testing and offline... 
    Senior
    Performance

    Phia

    New York, NY
    3 days ago
  • $153k - $198k

     ...have a good time doing it. As a Senior Machine Learning Engineer, you will own the end to end ML lifecycle at Button, from the...  ...workflows, model deployment, inference services, monitoring, and retraining...  ...inference services with clear performance, reliability, and latency... 
    Senior
    Performance
    Local area

    Button

    New York, NY
    12 hours ago
  • General Compute Inc. is seeking a Senior IC Engineer to develop the platform layer of their inference cloud, focusing on the OpenAI-compatible API. This role requires...  ...and optimization tasks while ensuring high performance and reliability. The ideal candidate will have... 
    Senior
    Performance

    General Compute Inc.

    New York, NY
    2 days ago
  • A leading automotive company in the United States is seeking an experienced GPU Software Engineer to design and implement high-performance GPU kernels for autonomous driving technologies. The position requires strong programming skills in CUDA and C++, and the ability... 
    Senior
    Performance

    General Motors

    New York, NY
    12 hours ago
  • Job Title: ML Platform Engineer - GPU Infrastructure Support team by designing, implementing, and maintaining the automation and ML workload...  ...Troubleshoot and improve platform reliability, scalability, and performance Collaborate with ML, infrastructure, and engineering... 
    Performance

    Optimal

    Brooklyn, NY
    2 days ago
  • $171.7k - $274.3k

    About the team As a Senior Machine Learning Engineer within Zillow’s Rich Media Experiences...  ...especially where structured inference, computer vision, spatial signals, or performance tradeoffs matter. Help establish...  ...machine learning models or ML‑powered systems in production... 
    Senior
    Performance
    Permanent employment
    Live in
    Work at office
    Local area
    Remote work

    Zillow

    New York, NY
    12 hours ago
  • $152k - $241.5k

     ...leading technology company in New York is seeking a Senior AI and FSI Developer Technology Engineer to enhance performance in the Financial Services Industry. The role...  ...programming, C/C++, and have a deep understanding of CPU/GPU architecture. The base salary ranges from $152,0... 
    Senior
    Performance

    NVIDIA Corporation

    New York, NY
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior ML Performance Engineer - GPU & Inference. Be the first to apply!