Senior ML Inference Engineer — High-Performance PyTorch

Comfy

Comfy is seeking a skilled engineer to optimize model inference as part of the core ComfyUI team. This role focuses on enhancing AI model performance, memory management, and collaborating on innovative features. Ideal candidates have a strong background in PyTorch and a desire to improve machine learning deployment outcomes. Join us in tackling complex technical challenges in visual AI and contribute to shaping the future of our technology. #J-18808-Ljbffr Comfy

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Senior ML Inference Engineer — High-Performance PyTorch in San Francisco, CA vacancy

Senior GPU ML Infra Engineer — Mid-Training & Inference
...to design and operate large-scale GPU infrastructure. This role requires expertise in deploying GPU systems for high-throughput inference and model performance optimization. The ideal candidate will have hands-on experience with modern inference frameworks and a solid...
Senior
Performance
Reflection AI
San Francisco, CA
1 day ago
Senior ML Compiler Engineer
$128.7k - $261.3k
...development, and performance engineering so that every cycle... ...compiler that turns high‑level models into fast, reliable inference across GPUs... ...driving. The Role As a Senior Compiler Engineer... ...and effortless for ML engineers across the... ...frameworks (e.g., PyTorch, TensorFlow, JAX)...
Senior
Performance
Local area
Flexible hours
Israelvcforum
San Francisco, CA
5 days ago
ML Inference Engineer San Francisco Engineering Full Time
...We're looking for an ML Inference Engineer with deep expertise in high-performance ML engineering. This is a highly technical, high-impact role focused on squeezing... ...and resolving bottlenecks Deep expertise in PyTorch, TensorRT, TransformerEngine, Nsight, ONNX Runtime...
Performance
Full time
Visa sponsorship
Relocation package
Reactor.am
San Francisco, CA
5 days ago
Senior ML Inference Engineer Production Systems
MakerMaker.AI is looking for a Senior Machine Learning Systems Engineer in San Francisco. In this role, you will build and operate production inference systems, optimizing for performance and reliability. The ideal candidate will have 3+ years of experience in production...
Senior
Performance
MakerMaker.AI
San Francisco, CA
3 days ago
Senior ML Performance Engineer
Position: Senior ML Performance Engineer Location: SF Bay Area (US) or Toronto (Canada... ...company is building a high-performance, portable compiler... ...performance testing platform for LLM inference workloads across GPU... ...with ML frameworks: PyTorch, TensorFlow, ONNX Runtime,...
Senior
Performance
Full time
Amadeus Search
San Francisco, CA
3 days ago
Senior ML Inference Systems Engineer
...Technical Staff to design and optimize inference systems. The role involves... ...allocation and improving execution performance across various components. Ideal candidates... ...should have strong software engineering skills and experience with ML inference systems, particularly in...
Senior
Performance
Gimlet Labs
San Francisco, CA
4 days ago
LLM/ML Engineer (Inference)
...critic. You have a high bar for quality... ...expertise in Python and PyTorch , with a strong... ..., storage , performance , and scale . You'... ...experienced with modern inference systems like TGI ,... ...current with ML infrastructure developments... ...requires a large engineering effort dedicated...
Performance
Work at office
Gravity Engineering Services Pvt Ltd.
San Francisco, CA
2 days ago
Senior ML/RL Engineer, Behavior Planning
...Senior ML/RL Engineer, Behavior Planning At Bot Auto, we are revolutionizing... ...of our large-scale, high-throughput training environments... ...: Expertise in Python and PyTorch; strong understanding of modern... ..., with opportunities for performance bonuses and equity....
Senior
Performance
Shift work
Bot Auto
San Francisco, CA
1 day ago
Senior ML Engineer
...Senior ML Engineer Highlight is building a shared intelligence layer for... ...impact. We move fast, hold a high bar, and believe the best... ...measure and improve ML system performance Investigate alternative models... ...~ Hands on proficiency with PyTorch, vector databases, and...
Senior
Performance
Work at office
Relocation
Relocation package
Flexible hours
Highlight AI
San Francisco, CA
2 days ago
Senior Machine Learning Engineer, Voice AI
$200k - $260k
...Senior Machine Learning Engineer, Voice AI San Francisco About... ...the best inference infrastructure for... ...looking for a Senior ML Engineer to... ...hire on a small, high-impact team. Voice... ...inference performance for voice models... ...proficiency in Python and PyTorch; experience with...
Senior
Performance
Full time
Together AI
San Francisco, CA
2 days ago
Senior/Staff ML Engineer, Performance Optimization
...who loves optimizing model inference to join us in building the... ...bleeding-edge part of our engine. You'll be working on... ...You've written production PyTorch code that pushes performance boundaries You love diving... ...think the current state of ML deployment could be way better...
Senior
Performance
Comfy
San Francisco, CA
5 days ago
Senior Machine Learning Engineer
...are looking for a Senior Machine Learning Engineer to build... ...robust solutions to ML/CV software and... ...is close-knit & highly driven, you’ll work... ...evaluation, and inference, both in the cloud... ...improve model performance. Collaborate with... ...g., TensorFlow, PyTorch). Strong...
Senior
Performance
Full time
Work at office
Flexible hours
Weekend work
Orchard Robotics
San Francisco, CA
4 days ago
Senior Applied Machine Learning Engineer, Asset Intelligence
...Senior Applied Machine Learning Engineer, Asset Intelligence MaintainX... ...are seeking a highly skilled and... ...ll combine deep ML expertise with... ...retraining. Drive performance optimization... ..., and scalable inference serving. Work... ...with PyTorch, TensorFlow, and...
Senior
Performance
MaintainX
San Francisco, CA
2 days ago
Founding ML Inference Engineer — Ultra-Low Latency AI
...technology company in San Francisco is seeking a Founding Engineer specializing in ML Inference. This highly technical role requires expertise in the ML... ...infrastructure stack and aims to optimize generative media performance. The ideal candidate will drive innovations in real...
Performance
Relocation package
Reactor
San Francisco, CA
2 days ago
Senior Principal AI Agent / ML Engineer (OCI)
...Senior Principal Ai Agent / Ml Software Engineer The Senior Principal AI Agent / ML Software... ...autonomous workflows, scalable inference infrastructure, and... ...optimized for low latency, high throughput, GPU... ...including reliability, performance, security posture, cost...
Senior
Performance
Oracle
San Francisco, CA
2 days ago
Senior Staff Machine Learning Engineer, Post Training
Senior Staff Machine Learning Engineer, Post Training Remote - USA Airbnb was... ...we rely on ML to ensure that guests... ...optimizing models for high‑performance deployment on Airbnb... ...frameworks such as PyTorch. Proven record of... ...optimizing models and inference run‑time Post‑...
Senior
Performance
Work experience placement
Remote work
airbnb, Inc.
San Francisco, CA
2 days ago
Staff ML Inference Engineer — Model Efficiency (Remote)
Jaide Health is seeking an engineer for their Model... ...focuses on building reliable ML systems while enhancing core performance metrics across model execution... ...5 years of experience in high-performance coding, plus... ...and insights into the LLM inference ecosystem. A commitment...
Performance
Remote job
Jaide Health
San Francisco, CA
4 days ago
Senior Principal AI Agent / ML Software Engineer (OCI)
$96.8k - $306.4k
...Job Description The Senior Principal AI Agent / ML Software Engineer is a Senior Staff-level,... ...workflows, scalable inference infrastructure, and enterprise... ...for low latency, high throughput, GPU efficiency... ...including reliability, performance, security posture, cost...
Senior
Performance
Temporary work
Flexible hours
Oracle
San Francisco, CA
3 days ago
Gentoro | Senior ML Engineer
...About the Role We are looking for a visionary Senior ML Engineer who will bridge the gap between high-level architecture and hands-on execution,... ...to monitor agent reasoning paths, tool usage, and performance in real-time Develop and enforce technical safety...
Senior
Performance
Shift work
Palm Venture Studios
San Francisco, CA
7 days ago
Senior Machine Learning Engineer
...building production‑grade ML infrastructure used... ...are looking for a Senior AI/ML Engineer to own model training... ...evaluation systems, and inference serving at scale.... ...deployment Own model performance, latency, and cost trade... ...Strong Python and PyTorch (or JAX) fundamentals...
Senior
Performance
Full time
Clera
San Francisco, CA
2 days ago
Senior Machine Learning Engineer, Computer Vision, HD Map and SLAM
...Senior Machine Learning Engineer, Computer Vision, HD Map and SLAM Houston, TX... ...model experimentations to performance metrics verifications, and... ...learning. Be familiar with PyTorch, TensorFlow and other... ...distillation, or model inference acceleration (e.g. TensorRT...
Senior
Performance
Bot Auto
San Francisco, CA
3 days ago
Senior Machine Learning Engineer - GeoAI Platform
$185k - $275k
Senior Machine Learning Engineer - GeoAI Platform Wherobots, Inc.... ...fully‑managed, highly scalable geospatial... ...geospatial ML platform that powers... ...systems, ML inference, and geospatial... ...tooling. We use Ray, PyTorch, and the... ...understanding of performance tradeoffs across...
Senior
Performance
Full time
Work at office
Remote work
Work visa
Wherobots, Inc
San Francisco, CA
1 day ago
Senior Machine Learning Engineer, Weather & Degraded Road Surfaces
$204k - $259k
...set of sensors, enabling engineers like you to (1) develop... ...and Weather. We take a high-level business problem... .... Most of our work is ML-related. Recently we have... ...and improving performance of pre-trained and fine... ...with ML frameworks like PyTorch, JAX, or Tensorflow....
Senior
Performance
Full time
Remote work
Waymo
San Francisco, CA
4 days ago
Senior ML Engineer — Whole-Body Control & Simulation
$200k - $350k
...Python, C++, or Rust, and a solid understanding of reinforcement learning principles. The position offers a competitive compensation range of $200K to $350K, and you’ll work with a small, elite team in a dynamic, high-performance environment. #J-18808-Ljbffr Pantera Capital
Senior
Performance
Pantera Capital
San Francisco, CA
4 days ago
Sr. Machine Learning Engineer (Perception and Tracking)
$180k - $220k
...Machine Learning Engineer At Ouster, we build sensors... ...is a full range of high-resolution LIDAR sensors... ...real-time, on-device performance. This role requires... ...models for real-time inference and on-device deployment... ...in Python and PyTorch. ~3+ years proficiency...
Senior
Performance
Work experience placement
Local area
Ouster
San Francisco, CA
22 days ago
Senior Machine Learning Engineer
$160k - $250k
...Senior Machine Learning Engineer In order to execute our vision, we need... ...involved in applying a ML model to a production... ...it at scale with high throughput and uptime... ...and maintain scalable, performant and secure code that... ...frameworks, such as PyTorch or Tensorflow You...
Senior
Performance
Hive
San Francisco, CA
3 days ago
Senior ML Performance Engineer: LLM Benchmarking & GPU
A leading AI infrastructure company is seeking a Senior ML Performance Engineer to design a comprehensive performance testing platform for large... ...engineering and strong experience with GPU programming and ML inference workloads. Candidates should have expertise in Python and...
Senior
Performance
Amadeus Search
San Francisco, CA
3 days ago
Senior ML Engineer: Real-Time Avatar Animation (Hybrid)
$180k - $270k
Cerebras is seeking a Senior Machine Learning Engineer for their Avatar Technology team in San Francisco... ...animation systems, delivering high-quality performance in production. Candidates should have... ...and C++, and familiarity with ML frameworks. The position offers a...
Senior
Performance
Cerebras
San Francisco, CA
3 days ago
Senior Machine Learning Engineer
...production-minded ML team based in Orange... ...with other engineers and researchers to... ...segmentation and detection (PyTorch). You must... ...curate datasets, drive high-priority... ...to track metrics, perform ablations, write clear... ...models, writing basic inference code, adding tests...
Senior
Performance
Kinetic Corporation
Oakland, CA
2 days ago
Senior Machine Learning Engineer
$200k - $400k
...are an early‑stage, high-growth venture... ...innovative strategic engineer to help us scale. Role Overview The Senior Machine Learning Engineer... ...across the full ML lifecycle, from... ...measurable costs, performance targets, and real‑world... ...relevance ranking using PyTorch and Hugging Face....
Senior
Performance
Work experience placement
Troveo AI
San Francisco, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior ML Inference Engineer — High-Performance PyTorch. Be the first to apply!