Senior ML Performance Engineer - GPU & Inference
Modal Labs
About Us: Modal provides the infrastructure foundation for AI teams. With instant GPU access, sub-second container startups, and native storage, Modal makes it simple to train models, run batch jobs, and serve low-latency inference. We have thousands of customers who rely on us for production AI workloads, including Lovable, Scale AI, Substack, and Suno. We're a fast-growing team based out of NYC, SF, and Stockholm. We've hit 9-figure ARR and recently raised a Series B at a $1.1B valuation. Our investors include Lux Capital, Redpoint Ventures, Amplify Partners, and Elad Gil. Working at Modal means joining one of the fastest-growing AI infrastructure organizations at an early stage, with many opportunities to grow within the company. Our team includes creators of popular open-source projects (e.g. Seaborn, Luigi), academic researchers, international olympiad medalists, and experienced engineering and product leaders with decades of experience. The Role We are looking for strong engineers with experience in making ML systems performant at scale. If you are interested in contributing to open-source projects and Modal’s container runtime to push language and diffusion models towards higher throughput and lower latency, we’d love to hear from you! Requirements 5+ years of experience writing high-quality, high-performance code. Experience working with torch, high-level ML frameworks, and inference engines (vLLM or TensorRT). Familiarity with Nvidia GPU architecture and CUDA. Experience with ML performance engineering (tell us a story about boosting GPU performance — debugging SM occupancy issues, rewriting an algorithm to be compute-bound, eliminating host overhead, etc). Nice-to-have: familiarity with low-level operating system foundations (Linux kernel, file systems, containers, etc). #J-18808-Ljbffr
$128.7k - $261.3k
...to model export, kernel development, and performance engineering so that every cycle on our accelerators... ...AI Kernels team builds high‑performance GPU kernels and custom libraries that sit at the heart of our on‑vehicle ML inference for ADAS and autonomous driving. We own...SeniorPerformanceFlexible hours$128.7k - $261.3k
...Team The Model Deployment & Inference Solutions team in GM AV... ...mission is two-fold: build the ML deployment platform that... ...automating workflows currently performed manually by engineers. Build the developer... ...Familiarity with the NVIDIA GPU stack at the integration level...SeniorPerformanceFlexible hoursShift work- ...professional in New York to design and operate large-scale GPU infrastructure for model inference and reinforcement learning. The role demands several... ...experience in deploying GPU systems, optimizing model performance, and working with frameworks like SGLang and Megatron....SeniorPerformance
$128.7k - $261.3k
...seeks a skilled professional to develop its ML deployment platform within the... ...deployment from training to on-vehicle inference and enhancing developer experience through... ...from $128,700 to $261,300 with additional performance bonuses and a comprehensive benefits package...SeniorPerformance$200k - $250k
...we’re building the top-performing AI Shopping Agent that... ..., and trust. Our ML models power the core... ...seeking an experienced Senior MLOps Engineer to take ownership of how... ...– for a custom-built inference platform powering a live... ...latency, availability, GPU utilization, TTFT, ITL...SeniorPerformanceRemote workFlexible hours$128.7k - $261.3k
...kernel development, and performance engineering so that every cycle on... ...into fast, reliable inference across GPUs powering GM... ..., systems, and GPU engineers who enjoy working... ...driving. The Role As a Senior Compiler Engineer on the... ..., and effortless for ML engineers across the...SeniorPerformanceFlexible hours$175k - $250k
...Senior Machine Learning Engineer (ML Infrastructure & Data Systems) Our client is an... ...continuously improving system performance through tight feedback... ...scaled ML training and inference systems in production environments... ...at scale (e.g., large GPU workloads) Familiarity...SeniorPerformance- ...Darwin Recruitment is seeking a Senior GPU Systems / AI Infrastructure Engineer in New York City. This senior-level... ...large-scale model training and inference. Candidates should have 5-10+ years... ..., directly impacting performance and scalability of frontier AI models...SeniorPerformance
$200k - $220k
Senior Machine Learning Engineer (Fully Remote) Base pay range: $200,000.... ...will build scalable ML infrastructure, deploy... ...practices in a high‑performance environment.... ...frameworks. Build multi‑GPU training pipelines and... ...training to batch inference, ensuring automation...SeniorPerformanceRemote jobFlexible hours- ...infrastructure company based in New York is seeking experienced engineers to enhance the performance of ML systems and contribute to open-source projects. Ideal... ...writing high-quality code and familiarity with Nvidia GPU architecture and ML frameworks. This role offers...Performance
$216.7k - $303.4k
...Senior Machine Learning Systems Engineer Remote - United States Reddit is a community of communities... ...You’ll Do: As a Senior ML Infrastructure Engineer,... ...with ML engineers on performance tuning, including improving... ...training time, efficiency, and GPU training costs in a large,...SeniorPerformanceFor contractorsWork experience placementRemote work$160k - $240k
...Senior MLOps Engineer - Artificial Intelligence Location New York... ...of Machine Learning (ML) and Software... ...processes, enhance the performance of our systems and more... ...disk / network / CPU / GPU) usage Work closely with... ...model training, inference, and monitoring workflows...SeniorPerformanceTemporary workFor contractorsWork experience placement- ...looking for an experienced AI Model Engineer with deep expertise in kernel... ...optimization, fine‑tuning, and GPU acceleration. The engineer will extend the inference framework to support inference and... ...testing, fine‑tuned adapter performance). Conduct GPU testing across desktop...SeniorPerformanceRemote job
$150k - $300k
...We Do At Goldman Sachs, our Engineers don't just make things - we... ...Look For We are seeking a Senior AI Engineering Expert with... ...microservices. Scalability & Performance: Optimize inference latency and manage token... ...least 3 years focused on AI/ML integration in production....SeniorPerformanceImmediate start- ...leading AI technology company is seeking a Senior Machine Learning Engineer to enhance their speech recognition and... ...frameworks and improving model accuracy and performance. The ideal candidate will have extensive experience in ML model deployment and evaluation, along...SeniorPerformanceRemote work
$180k - $220k
...An innovative advertising technology firm is seeking a Senior Machine Learning Engineer in New York City. This role involves the development of high-performance services, driving innovations in AI and machine learning for ad optimization, and requires an experienced engineer...SeniorPerformance$150k - $300k
...Senior AI Engineering Expert At Goldman Sachs, our Engineers don't just make things – we... ...and microservices. Scalability & Performance: Optimize inference latency and manage token costs for... ...with at least 3 years focused on AI/ML integration in production. Domain...SeniorPerformanceFull timeTemporary workPart timeImmediate start- ...healthcare management company seeks a Machine Learning Engineer to develop and deliver end-to-end ML solutions. The ideal candidate will have strong... ...partners to enhance ML products and ensure robust model performance. Join us to help improve patient outcomes and contribute...SeniorPerformance
$144.7k - $261.3k
Senior ML Validation Research Engineer will lead applied machine learning research focused on improving verification... ...Prototype research concepts into performant tools integrated into CI/CD and... ...Knowledge of Bayesian ML, causal inference, and sequential testing. Experience...SeniorPerformanceFlexible hours$140k - $180k
...Artera is seeking a Machine Learning Engineer to develop scalable pipelines for model training and evaluation, collaborate with AI teams, and optimize model performance. The ideal candidate will have over 5 years of software engineering experience and strong expertise...SeniorPerformanceRemote work$170k - $223k
...Senior ML/AI Engineer San Francisco About Us Beast Industries is a multifaceted media... ...systems. Develop and deploy ML models, inference services and end-to-end pipelines.... ...reliability, and business impact. Improve performance across latency, throughput,...SeniorPerformanceRelocation packageFlexible hours$144.7k - $261.3k
...developer environments, cloud infrastructure, and ML/AI GPU platforms for AV research and development teams... ...run faster in GM. The Role GM is looking for a Senior Capacity Engineer to join the AV Capacity and Performance Engineering team in the AV Infrastructure org to...SeniorPerformanceWork experience placementLocal areaRemote workWork from homeFlexible hours$115k - $130k
...and we’re looking for an AI / ML Engineer to help shape how that... ...more at Joining E Source as a Senior AI/ML Engineer is an exciting... ...design and create optimized performance queries for efficient data processing... ..., MLOps, and scalable inference. Hands-on experience with...SeniorPerformanceTemporary workRemote workWork visaFlexible hours- ...We are looking for a Senior Machine Learning Engineer, MLOps to help operationalize and... ...and processes that enable ML to move from research into... ...support model training and inference Build tooling and processes for monitoring model performance , system reliability, and operational...SeniorPerformanceFlexible hours
- ...global bank is seeking a Machine Learning Engineer to join their innovative team. In this role... ...models for production, and monitoring their performance for accuracy. Ideal candidates will have experience in Python and various ML frameworks. This position offers an opportunity...SeniorPerformance
- ...Overview As a Senior Machine Learning Engineer at Phia, you’ll build and scale production ML systems that power core product experiences... ...Drive improvements to model performance, reliability, and... ...experiment design and causal inference, including A/B testing and offline...SeniorPerformance
$170k - $220k
...,000 clients nationwide. Our ML and AI capabilities are expanding... ...first dedicated ML Platform Engineer, you'll define the technical... ...and are investing in hosted GPU inference to support the next... ...deployment workflows Develop high-performance GPU inference pipelines with...SeniorPerformanceFull timeWork at officeLocal area- ...Senior Machine Learning Engineer, Cybersecurity / Threat Detection Remote, US We are... ...targets for attackers, and the ML systems you build will... ...applied production engineering (inference systems, integration, and... ...models in production for performance, scalability, and...SeniorPerformanceRemote work
- ...Netflix, Inc. is looking for a skilled engineer for their Decisioning & Optimization team.... ...This role involves building and maintaining ML model serving infrastructure for real-... ...ensuring system reliability, and optimizing performance for numerous concurrent models. The ideal...SeniorPerformance
- ...Go, or C++. You will play a pivotal role in creating predictive models that enhance ad ranking and recommendations, ensuring high performance and user engagement. This is an opportunity to make a significant impact in a rapidly growing tech environment. #J-18808-Ljbffr...SeniorPerformance
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior ML Performance Engineer - GPU & Inference. Be the first to apply!
- graduate machine learning engineer New York, NY
- machine learning engineer New York, NY
- data scientist machine learning engineer New York, NY
- junior machine learning research engineer New York, NY
- senior ml engineer New York, NY
- computer vision machine learning engineer New York, NY
- ai ml engineer New York, NY
- machine learning software engineer New York, NY
- machine learning ai engineer New York, NY
- senior learning manager New York, NY

