Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior ML Inference Engineer — High-Performance PyTorch

Comfy

Comfy is seeking a skilled engineer to optimize model inference as part of the core ComfyUI team. This role focuses on enhancing AI model performance, memory management, and collaborating on innovative features. Ideal candidates have a strong background in PyTorch and a desire to improve machine learning deployment outcomes. Join us in tackling complex technical challenges in visual AI and contribute to shaping the future of our technology. #J-18808-Ljbffr Comfy

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Senior ML Inference Engineer — High-Performance PyTorch in San Francisco, CA vacancy
  •  ...to design and operate large-scale GPU infrastructure. This role requires expertise in deploying GPU systems for high-throughput inference and model performance optimization. The ideal candidate will have hands-on experience with modern inference frameworks and a solid... 
    Senior
    Performance

    Reflection AI

    San Francisco, CA
    1 day ago
  • $128.7k - $261.3k

     ...development, and performance engineering so that every cycle...  ...compiler that turns high‑level models into fast, reliable inference across GPUs...  ...driving. The Role As a Senior Compiler Engineer...  ...and effortless for ML engineers across the...  ...frameworks (e.g., PyTorch, TensorFlow, JAX)... 
    Senior
    Performance
    Local area
    Flexible hours

    Israelvcforum

    San Francisco, CA
    5 days ago
  •  ...We're looking for an ML Inference Engineer with deep expertise in high-performance ML engineering. This is a highly technical, high-impact role focused on squeezing...  ...and resolving bottlenecks Deep expertise in PyTorch, TensorRT, TransformerEngine, Nsight, ONNX Runtime... 
    Performance
    Full time
    Visa sponsorship
    Relocation package

    Reactor.am

    San Francisco, CA
    5 days ago
  • MakerMaker.AI is looking for a Senior Machine Learning Systems Engineer in San Francisco. In this role, you will build and operate production inference systems, optimizing for performance and reliability. The ideal candidate will have 3+ years of experience in production... 
    Senior
    Performance

    MakerMaker.AI

    San Francisco, CA
    3 days ago
  • Position: Senior ML Performance Engineer Location: SF Bay Area (US) or Toronto (Canada...  ...company is building a high-performance, portable compiler...  ...performance testing platform for LLM inference workloads across GPU...  ...with ML frameworks: PyTorch, TensorFlow, ONNX Runtime,... 
    Senior
    Performance
    Full time

    Amadeus Search

    San Francisco, CA
    3 days ago
  •  ...Technical Staff to design and optimize inference systems. The role involves...  ...allocation and improving execution performance across various components. Ideal candidates...  ...should have strong software engineering skills and experience with ML inference systems, particularly in... 
    Senior
    Performance

    Gimlet Labs

    San Francisco, CA
    4 days ago
  •  ...critic. You have a high bar for quality...  ...expertise in Python and PyTorch , with a strong...  ..., storage , performance , and scale . You'...  ...experienced with modern inference systems like TGI ,...  ...current with ML infrastructure developments...  ...requires a large engineering effort dedicated... 
    Performance
    Work at office

    Gravity Engineering Services Pvt Ltd.

    San Francisco, CA
    2 days ago
  •  ...Senior ML/RL Engineer, Behavior Planning At Bot Auto, we are revolutionizing...  ...of our large-scale, high-throughput training environments...  ...: Expertise in Python and PyTorch; strong understanding of modern...  ..., with opportunities for performance bonuses and equity.... 
    Senior
    Performance
    Shift work

    Bot Auto

    San Francisco, CA
    1 day ago
  •  ...Senior ML Engineer Highlight is building a shared intelligence layer for...  ...impact. We move fast, hold a high bar, and believe the best...  ...measure and improve ML system performance Investigate alternative models...  ...~ Hands on proficiency with PyTorch, vector databases, and... 
    Senior
    Performance
    Work at office
    Relocation
    Relocation package
    Flexible hours

    Highlight AI

    San Francisco, CA
    2 days ago
  • $200k - $260k

     ...Senior Machine Learning Engineer, Voice AI San Francisco About...  ...the best inference infrastructure for...  ...looking for a Senior ML Engineer to...  ...hire on a small, high-impact team. Voice...  ...inference performance for voice models...  ...proficiency in Python and PyTorch; experience with... 
    Senior
    Performance
    Full time

    Together AI

    San Francisco, CA
    2 days ago
  •  ...who loves optimizing model inference to join us in building the...  ...bleeding-edge part of our engine. You'll be working on...  ...You've written production PyTorch code that pushes performance boundaries You love diving...  ...think the current state of ML deployment could be way better... 
    Senior
    Performance

    Comfy

    San Francisco, CA
    5 days ago
  •  ...are looking for a Senior Machine Learning Engineer to build...  ...robust solutions to ML/CV software and...  ...is close-knit & highly driven, you’ll work...  ...evaluation, and inference, both in the cloud...  ...improve model performance. Collaborate with...  ...g., TensorFlow, PyTorch). Strong... 
    Senior
    Performance
    Full time
    Work at office
    Flexible hours
    Weekend work

    Orchard Robotics

    San Francisco, CA
    4 days ago
  •  ...Senior Applied Machine Learning Engineer, Asset Intelligence MaintainX...  ...are seeking a highly skilled and...  ...ll combine deep ML expertise with...  ...retraining. Drive performance optimization...  ..., and scalable inference serving. Work...  ...with PyTorch, TensorFlow, and... 
    Senior
    Performance

    MaintainX

    San Francisco, CA
    2 days ago
  •  ...technology company in San Francisco is seeking a Founding Engineer specializing in ML Inference. This highly technical role requires expertise in the ML...  ...infrastructure stack and aims to optimize generative media performance. The ideal candidate will drive innovations in real... 
    Performance
    Relocation package

    Reactor

    San Francisco, CA
    2 days ago
  •  ...Senior Principal Ai Agent / Ml Software Engineer The Senior Principal AI Agent / ML Software...  ...autonomous workflows, scalable inference infrastructure, and...  ...optimized for low latency, high throughput, GPU...  ...including reliability, performance, security posture, cost... 
    Senior
    Performance

    Oracle

    San Francisco, CA
    2 days ago
  • Senior Staff Machine Learning Engineer, Post Training Remote - USA Airbnb was...  ...we rely on ML to ensure that guests...  ...optimizing models for high‑performance deployment on Airbnb...  ...frameworks such as PyTorch. Proven record of...  ...optimizing models and inference run‑time Post‑... 
    Senior
    Performance
    Work experience placement
    Remote work

    airbnb, Inc.

    San Francisco, CA
    2 days ago
  • Jaide Health is seeking an engineer for their Model...  ...focuses on building reliable ML systems while enhancing core performance metrics across model execution...  ...5 years of experience in high-performance coding, plus...  ...and insights into the LLM inference ecosystem. A commitment... 
    Performance
    Remote job

    Jaide Health

    San Francisco, CA
    4 days ago
  • $96.8k - $306.4k

     ...Job Description The Senior Principal AI Agent / ML Software Engineer is a Senior Staff-level,...  ...workflows, scalable inference infrastructure, and enterprise...  ...for low latency, high throughput, GPU efficiency...  ...including reliability, performance, security posture, cost... 
    Senior
    Performance
    Temporary work
    Flexible hours

    Oracle

    San Francisco, CA
    3 days ago
  •  ...About the Role We are looking for a visionary Senior ML Engineer who will bridge the gap between high-level architecture and hands-on execution,...  ...to monitor agent reasoning paths, tool usage, and performance in real-time Develop and enforce technical safety... 
    Senior
    Performance
    Shift work

    Palm Venture Studios

    San Francisco, CA
    7 days ago
  •  ...building production‑grade ML infrastructure used...  ...are looking for a Senior AI/ML Engineer to own model training...  ...evaluation systems, and inference serving at scale....  ...deployment Own model performance, latency, and cost trade...  ...Strong Python and PyTorch (or JAX) fundamentals... 
    Senior
    Performance
    Full time

    Clera

    San Francisco, CA
    2 days ago
  •  ...Senior Machine Learning Engineer, Computer Vision, HD Map and SLAM Houston, TX...  ...model experimentations to performance metrics verifications, and...  ...learning. Be familiar with PyTorch, TensorFlow and other...  ...distillation, or model inference acceleration (e.g. TensorRT... 
    Senior
    Performance

    Bot Auto

    San Francisco, CA
    3 days ago
  • $185k - $275k

    Senior Machine Learning Engineer - GeoAI Platform Wherobots, Inc....  ...fully‑managed, highly scalable geospatial...  ...geospatial ML platform that powers...  ...systems, ML inference, and geospatial...  ...tooling. We use Ray, PyTorch, and the...  ...understanding of performance tradeoffs across... 
    Senior
    Performance
    Full time
    Work at office
    Remote work
    Work visa

    Wherobots, Inc

    San Francisco, CA
    1 day ago
  • $204k - $259k

     ...set of sensors, enabling engineers like you to (1) develop...  ...and Weather. We take a high-level business problem...  .... Most of our work is ML-related. Recently we have...  ...and improving performance of pre-trained and fine...  ...with ML frameworks like PyTorch, JAX, or Tensorflow.... 
    Senior
    Performance
    Full time
    Remote work

    Waymo

    San Francisco, CA
    4 days ago
  • $200k - $350k

     ...Python, C++, or Rust, and a solid understanding of reinforcement learning principles. The position offers a competitive compensation range of $200K to $350K, and you’ll work with a small, elite team in a dynamic, high-performance environment. #J-18808-Ljbffr Pantera Capital
    Senior
    Performance

    Pantera Capital

    San Francisco, CA
    4 days ago
  • $180k - $220k

     ...Machine Learning Engineer At Ouster, we build sensors...  ...is a full range of high-resolution LIDAR sensors...  ...real-time, on-device performance. This role requires...  ...models for real-time inference and on-device deployment...  ...in Python and PyTorch. ~3+ years proficiency... 
    Senior
    Performance
    Work experience placement
    Local area

    Ouster

    San Francisco, CA
    22 days ago
  • $160k - $250k

     ...Senior Machine Learning Engineer In order to execute our vision, we need...  ...involved in applying a ML model to a production...  ...it at scale with high throughput and uptime...  ...and maintain scalable, performant and secure code that...  ...frameworks, such as PyTorch or Tensorflow You... 
    Senior
    Performance

    Hive

    San Francisco, CA
    3 days ago
  • A leading AI infrastructure company is seeking a Senior ML Performance Engineer to design a comprehensive performance testing platform for large...  ...engineering and strong experience with GPU programming and ML inference workloads. Candidates should have expertise in Python and... 
    Senior
    Performance

    Amadeus Search

    San Francisco, CA
    3 days ago
  • $180k - $270k

    Cerebras is seeking a Senior Machine Learning Engineer for their Avatar Technology team in San Francisco...  ...animation systems, delivering high-quality performance in production. Candidates should have...  ...and C++, and familiarity with ML frameworks. The position offers a... 
    Senior
    Performance

    Cerebras

    San Francisco, CA
    3 days ago
  •  ...production-minded ML team based in Orange...  ...with other engineers and researchers to...  ...segmentation and detection (PyTorch). You must...  ...curate datasets, drive high-priority...  ...to track metrics, perform ablations, write clear...  ...models, writing basic inference code, adding tests... 
    Senior
    Performance

    Kinetic Corporation

    Oakland, CA
    2 days ago
  • $200k - $400k

     ...are an early‑stage, high-growth venture...  ...innovative strategic engineer to help us scale. Role Overview The Senior Machine Learning Engineer...  ...across the full ML lifecycle, from...  ...measurable costs, performance targets, and real‑world...  ...relevance ranking using PyTorch and Hugging Face.... 
    Senior
    Performance
    Work experience placement

    Troveo AI

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior ML Inference Engineer — High-Performance PyTorch. Be the first to apply!