Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior ML Performance Engineer — GPU/Inference Systems

AI Chopping Block, Inc.

A leading AI infrastructure company based in New York seeks skilled engineers to optimize ML systems at scale. The ideal candidate has over 5 years of high-performance coding experience, familiarity with Nvidia GPU architecture, and expertise in ML frameworks like Torch. You will enhance open-source projects and improve throughput and latency for machine learning models. Join a growing team in a dynamic environment and contribute to groundbreaking developments in AI. #J-18808-Ljbffr AI Chopping Block, Inc.

Vacancy posted 9 hours ago
Similar jobs that could be interesting for youBased on the Senior ML Performance Engineer — GPU/Inference Systems in New York, NY vacancy
  •  ...professional in New York to design and operate large-scale GPU infrastructure for model inference and reinforcement learning. The role demands several years of experience in deploying GPU systems, optimizing model performance, and working with frameworks like SGLang and... 
    Senior
    Performance

    Reflection

    New York, NY
    4 days ago
  • $128.7k - $261.3k

     ...capable fully self-driving systems, to move us toward safer, more...  ..., kernel development, and performance engineering so that every cycle on our...  ...builds high‑performance GPU kernels and custom libraries...  ...the heart of our on‑vehicle ML inference for ADAS and autonomous driving... 
    Senior
    Performance
    Flexible hours

    General Motors

    New York, NY
    2 days ago
  • $175k - $250k

     ...Senior Machine Learning Engineer (ML Infrastructure & Data Systems) Our client is an early-stage robotics and...  ...improving system performance through tight feedback...  ...scaled ML training and inference systems in production...  ...at scale (e.g., large GPU workloads) Familiarity... 
    Senior
    Performance

    Right Hand Talent

    Brooklyn, NY
    1 day ago
  • $128.7k - $261.3k

     ...The Model Deployment & Inference Solutions team in GM...  ...two-fold: build the ML deployment platform that...  ...workflows currently performed manually by engineers. Build the developer...  ...or infrastructure systems where reliability, observability...  ...with the NVIDIA GPU stack at the... 
    Senior
    Performance
    Flexible hours
    Shift work

    General Motors

    New York, NY
    2 days ago
  • $200k - $250k

     ...building the top-performing AI Shopping Agent...  ..., and trust. Our ML models power the...  ...seeking an experienced Senior MLOps Engineer to take ownership...  ...machine learning systems run reliably and...  ...a custom-built inference platform powering...  ..., availability, GPU utilization, TTFT... 
    Senior
    Performance
    Remote work
    Flexible hours

    Wizard

    New York, NY
    2 days ago
  • $128.7k - $261.3k

     ...fully self-driving systems, to move us...  ...development, and performance engineering so that every cycle...  ...into fast, reliable inference across GPUs...  ...compiler, systems, and GPU engineers who...  ...driving. The Role As a Senior Compiler Engineer...  ...effortless for ML engineers across... 
    Senior
    Performance
    Flexible hours

    General Motors

    New York, NY
    2 days ago
  •  ...development firm in New York is looking for an AI/ML Systems Engineer to build and maintain on-device inference engines for local LLMs. The ideal candidate will...  ...You will integrate new AI technologies and enhance performance across hardware. Strong problem-solving and... 
    Performance
    Local area

    LM Studio

    New York, NY
    2 days ago
  • $128.7k - $261.3k

     ...seeks a skilled professional to develop its ML deployment platform within the...  ...deployment from training to on-vehicle inference and enhancing developer experience through...  ...from $128,700 to $261,300 with additional performance bonuses and a comprehensive benefits package... 
    Senior
    Performance

    General Motors

    New York, NY
    2 days ago
  •  ...company is looking for exceptional generalist engineers who thrive with autonomy. This fully...  ...kernels to designing distributed orchestration systems. Ideal candidates will have a Bachelor's...  ...track record in systems programming or ML infrastructure. Competitive compensation... 
    Senior
    Remote work

    Inferact

    New York, NY
    2 days ago
  • $216.7k - $303.4k

     ...Senior Machine Learning Systems Engineer Remote - United States Reddit is a community of communities...  ...You’ll Do As a Senior ML Infrastructure Engineer,...  ...with ML engineers on performance tuning, including improving...  ...training time, efficiency, and GPU training costs in a large,... 
    Senior
    Performance
    Remote work

    Reddit

    New York, NY
    2 days ago
  • $165.5k - $248.3k

     ...looking for a Machine Learning Engineer for their Agent Platform team in...  ...involves owning models and systems for AI agent performance, requiring over 5 years of ML experience and strong Python skills...  ..., and optimize LLM inference. The position provides annual cash... 
    Senior
    Performance

    HubSpot

    New York, NY
    2 days ago
  • $200k - $240k

     ...Senior Machine Learning Engineer (Computer Vision / Vision-Language Models)...  ...real-world multimodal AI systems that directly improve human...  ...frameworks Optimize inference pipelines for performance, accuracy, and cost...  ...Proven record of deploying ML systems into production... 
    Senior
    Performance
    Remote work

    Harnham

    New York, NY
    1 day ago
  • $220k

     ...Senior Machine Learning Engineer Location: Remote (with optional...  ...scalable ML infrastructure and...  ...on distributed systems, cutting-edge neural...  ...in a high-performance environment....  ...will build multi-GPU training pipelines...  ...training to batch inference, ensuring automation... 
    Senior
    Performance
    Remote work
    Flexible hours

    Harnham

    New York, NY
    9 hours ago
  • $165k - $225k

     ...of its clients a Senior Machine Learning Engineer - this is a...  ...experienced Senior ML Engineer to join...  ...evaluate algorithm performance, validate...  ...production-ready systems meeting latency...  ...Experience with GPU acceleration and...  ...ONNX export, and inference serving frameworks... 
    Senior
    Performance
    Remote work
    Worldwide

    Career Renew

    New York, NY
    1 day ago
  • $173k - $213k

     ...Matterport - Senior ML Ops Engineer Job Description CoStar Group (NASDAQ: CSGP...  ...pivotal in enhancing the performance, efficiency, and scalability...  ...model performance, optimize inference speed and resource...  ...Familiarity with version control systems (e.g., Git) and agile development... 
    Senior
    Performance
    Work at office

    CoStar Realty Information, Inc.

    New York, NY
    2 days ago
  • Darwin Recruitment is seeking a Senior GPU Systems / AI Infrastructure Engineer in New York City. This senior-level...  ...large-scale model training and inference. Candidates should have 5-10+ years...  ..., directly impacting performance and scalability of frontier AI models... 
    Senior
    Performance

    Darwin Recruitment

    New York, NY
    3 days ago
  • $216.7k - $303.4k

     ...Senior Machine Learning Systems Engineer Remote - United States Reddit is a community of communities...  ...You’ll Do: As a Senior ML Infrastructure Engineer,...  ...with ML engineers on performance tuning, including improving...  ...training time, efficiency, and GPU training costs in a large,... 
    Senior
    Performance
    For contractors
    Work experience placement
    Remote work

    Reddit

    New York, NY
    2 days ago
  •  ...Inc. is seeking a Staff Machine Learning Engineer to lead the development of a large-scale ML Inference Platform. Responsibilities include designing cloud-based ML systems on Kubernetes and ensuring reliable, low-latency performance. Candidates should have 7+ years of... 
    Performance

    Reddit

    New York, NY
    2 days ago
  • $213k - $263k

     ...Senior Machine Learning Engineer, Prediction & Planning, System Architecture Waymo is an autonomous driving technology company...  ...into robust, generalizable, and performant deep neural networks. These...  ...challenging real-world problems with ML and engineering solutions.... 
    Senior
    Performance
    Full time
    Contract work
    Internship
    Remote work

    Waymo

    New York, NY
    4 days ago
  •  ...intersection of foundational AI and systems design for real-world impact...  ...experienced Principal ML Engineer (Applied / Systems) to join...  ...all with a strong focus on performance, maintainability, and impact...  ...Build, optimize, and scale inference pipelines and model serving... 
    Performance

    Soris

    New York, NY
    4 days ago
  • $281.2k - $401.71k

     ...Spotify's Agent Engine Team Spotify's Personalization team is building the next generation...  ...is reshaping how we build products and systems. As part of this shift, we're creating a...  ...experimentation, observability, and performance optimization Contribute to technical... 
    Senior
    Performance
    Flexible hours
    Shift work

    Spotify

    New York, NY
    1 day ago
  • $253.3k - $354.6k

     ...seeking a Staff Machine Learning Engineer to drive AI initiatives in...  ..., requiring 7+ years of ML Engineering experience. Responsibilities...  ...include designing GPU-based systems, developing cloud-based AI...  ...solutions, and ensuring model performance. The role offers competitive... 
    Performance
    Remote work

    Ladders

    New York, NY
    2 days ago
  • $50k

     ...Key Responsibilities AI/ML Model Operations...  ...environments. Implement model performance monitoring including accuracy, latency, and inference metrics. Detect and...  ...pipelines for AI systems. Agentic Systems...  ...data scientists, ML engineers, platform teams, and product... 
    Senior
    Performance
    Full time
    For contractors
    Shift work

    Photon

    New York, NY
    9 hours ago
  • $160k - $240k

    Senior MLOps Engineer - Artificial Intelligence Location...  ...Machine Learning (ML) and Software...  ...009. We build AI systems to help process and...  ...processes, enhance the performance of our systems...  ...network / CPU / GPU) usage Work...  ...model training, inference, and monitoring workflows... 
    Senior
    Performance
    Temporary work
    For contractors
    Work experience placement

    Bloomberg L.P.

    New York, NY
    2 days ago
  •  ...Senior AI/ML & Data Engineer Gesture is where technology meets humanity -- a place...  ...signal you score, every system you deploy here matters --...  ...own AI and ML systems that perform in production -- not in a notebook...  ...from raw data to deployed inference: ingestion, transformation,... 
    Senior
    Performance
    Price work

    Gesture US, Inc

    New York, NY
    1 day ago
  • $144.7k - $261.3k

     ...environments, cloud infrastructure, and ML/AI GPU platforms for AV research and...  .... The Role GM is looking for a Senior Capacity Engineer to join the AV Capacity and Performance Engineering team in the AV...  ...understanding of distributed systems including modern ML system architecture... 
    Senior
    Performance
    Work experience placement
    Local area
    Remote work
    Work from home
    Flexible hours

    General Motors

    New York, NY
    2 days ago
  •  ...DESCRIPTION Transflo is seeking a Senior AI/ML Engineer to lead the design,...  ...will architect and operate AI systems that automatically classify...  ...ingestion through model inference, feedback loops, retraining...  ...workflows Optimize model inference performance and cost-efficiency using... 
    Senior
    Performance
    Remote work

    Transflo

    New York, NY
    2 days ago
  • $150k - $300k

     ...Senior AI Engineering Expert At Goldman Sachs, our Engineers don't just...  ...scalable software and systems, architect low latency infrastructure...  .... Scalability & Performance: Optimize inference latency and manage token...  ...3 years focused on AI/ML integration in production... 
    Senior
    Performance
    Full time
    Temporary work
    Part time
    Immediate start

    The Goldman Sachs Group, Inc.

    New York, NY
    2 days ago
  •  ...technology company is seeking a Senior Machine Learning Engineer to enhance their speech recognition and NLP systems. This role is pivotal in...  ...improving model accuracy and performance. The ideal candidate will have extensive experience in ML model deployment and... 
    Senior
    Performance
    Remote work

    Cresta

    New York, NY
    2 days ago
  • $189.6k - $237k

     ...Scale's ML platform (RLXF) team builds our internal...  ...model training and inference. The platform has been...  ...to optimize our ML system Ideally you'd have: Strong...  ...Strong software engineering skills, proficient in...  ...qualifications, interview performance, and relevant education... 
    Performance
    Full time

    DiversityJobs Inc

    New York, NY
    9 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior ML Performance Engineer — GPU/Inference Systems. Be the first to apply!