Senior ML Inference Engineer — High-Performance PyTorch
Comfy
Comfy is seeking a skilled engineer to optimize model inference as part of the core ComfyUI team. This role focuses on enhancing AI model performance, memory management, and collaborating on innovative features. Ideal candidates have a strong background in PyTorch and a desire to improve machine learning deployment outcomes. Join us in tackling complex technical challenges in visual AI and contribute to shaping the future of our technology. #J-18808-Ljbffr Comfy
- ...to design and operate large-scale GPU infrastructure. This role requires expertise in deploying GPU systems for high-throughput inference and model performance optimization. The ideal candidate will have hands-on experience with modern inference frameworks and a solid...SeniorPerformance
$128.7k - $261.3k
...development, and performance engineering so that every cycle... ...compiler that turns high‑level models into fast, reliable inference across GPUs... ...driving. The Role As a Senior Compiler Engineer... ...and effortless for ML engineers across the... ...frameworks (e.g., PyTorch, TensorFlow, JAX)...SeniorPerformanceLocal areaFlexible hours- ...We're looking for an ML Inference Engineer with deep expertise in high-performance ML engineering. This is a highly technical, high-impact role focused on squeezing... ...and resolving bottlenecks Deep expertise in PyTorch, TensorRT, TransformerEngine, Nsight, ONNX Runtime...PerformanceFull timeVisa sponsorshipRelocation package
- MakerMaker.AI is looking for a Senior Machine Learning Systems Engineer in San Francisco. In this role, you will build and operate production inference systems, optimizing for performance and reliability. The ideal candidate will have 3+ years of experience in production...SeniorPerformance
- Position: Senior ML Performance Engineer Location: SF Bay Area (US) or Toronto (Canada... ...company is building a high-performance, portable compiler... ...performance testing platform for LLM inference workloads across GPU... ...with ML frameworks: PyTorch, TensorFlow, ONNX Runtime,...SeniorPerformanceFull time
- ...Technical Staff to design and optimize inference systems. The role involves... ...allocation and improving execution performance across various components. Ideal candidates... ...should have strong software engineering skills and experience with ML inference systems, particularly in...SeniorPerformance
- ...critic. You have a high bar for quality... ...expertise in Python and PyTorch , with a strong... ..., storage , performance , and scale . You'... ...experienced with modern inference systems like TGI ,... ...current with ML infrastructure developments... ...requires a large engineering effort dedicated...PerformanceWork at office
- ...Senior ML/RL Engineer, Behavior Planning At Bot Auto, we are revolutionizing... ...of our large-scale, high-throughput training environments... ...: Expertise in Python and PyTorch; strong understanding of modern... ..., with opportunities for performance bonuses and equity....SeniorPerformanceShift work
- ...Senior ML Engineer Highlight is building a shared intelligence layer for... ...impact. We move fast, hold a high bar, and believe the best... ...measure and improve ML system performance Investigate alternative models... ...~ Hands on proficiency with PyTorch, vector databases, and...SeniorPerformanceWork at officeRelocationRelocation packageFlexible hours
$200k - $260k
...Senior Machine Learning Engineer, Voice AI San Francisco About... ...the best inference infrastructure for... ...looking for a Senior ML Engineer to... ...hire on a small, high-impact team. Voice... ...inference performance for voice models... ...proficiency in Python and PyTorch; experience with...SeniorPerformanceFull time- ...who loves optimizing model inference to join us in building the... ...bleeding-edge part of our engine. You'll be working on... ...You've written production PyTorch code that pushes performance boundaries You love diving... ...think the current state of ML deployment could be way better...SeniorPerformance
- ...are looking for a Senior Machine Learning Engineer to build... ...robust solutions to ML/CV software and... ...is close-knit & highly driven, you’ll work... ...evaluation, and inference, both in the cloud... ...improve model performance. Collaborate with... ...g., TensorFlow, PyTorch). Strong...SeniorPerformanceFull timeWork at officeFlexible hoursWeekend work
- ...Senior Applied Machine Learning Engineer, Asset Intelligence MaintainX... ...are seeking a highly skilled and... ...ll combine deep ML expertise with... ...retraining. Drive performance optimization... ..., and scalable inference serving. Work... ...with PyTorch, TensorFlow, and...SeniorPerformance
- ...technology company in San Francisco is seeking a Founding Engineer specializing in ML Inference. This highly technical role requires expertise in the ML... ...infrastructure stack and aims to optimize generative media performance. The ideal candidate will drive innovations in real...PerformanceRelocation package
- ...Senior Principal Ai Agent / Ml Software Engineer The Senior Principal AI Agent / ML Software... ...autonomous workflows, scalable inference infrastructure, and... ...optimized for low latency, high throughput, GPU... ...including reliability, performance, security posture, cost...SeniorPerformance
- Senior Staff Machine Learning Engineer, Post Training Remote - USA Airbnb was... ...we rely on ML to ensure that guests... ...optimizing models for high‑performance deployment on Airbnb... ...frameworks such as PyTorch. Proven record of... ...optimizing models and inference run‑time Post‑...SeniorPerformanceWork experience placementRemote work
- Jaide Health is seeking an engineer for their Model... ...focuses on building reliable ML systems while enhancing core performance metrics across model execution... ...5 years of experience in high-performance coding, plus... ...and insights into the LLM inference ecosystem. A commitment...PerformanceRemote job
$96.8k - $306.4k
...Job Description The Senior Principal AI Agent / ML Software Engineer is a Senior Staff-level,... ...workflows, scalable inference infrastructure, and enterprise... ...for low latency, high throughput, GPU efficiency... ...including reliability, performance, security posture, cost...SeniorPerformanceTemporary workFlexible hours- ...About the Role We are looking for a visionary Senior ML Engineer who will bridge the gap between high-level architecture and hands-on execution,... ...to monitor agent reasoning paths, tool usage, and performance in real-time Develop and enforce technical safety...SeniorPerformanceShift work
- ...building production‑grade ML infrastructure used... ...are looking for a Senior AI/ML Engineer to own model training... ...evaluation systems, and inference serving at scale.... ...deployment Own model performance, latency, and cost trade... ...Strong Python and PyTorch (or JAX) fundamentals...SeniorPerformanceFull time
- ...Senior Machine Learning Engineer, Computer Vision, HD Map and SLAM Houston, TX... ...model experimentations to performance metrics verifications, and... ...learning. Be familiar with PyTorch, TensorFlow and other... ...distillation, or model inference acceleration (e.g. TensorRT...SeniorPerformance
$185k - $275k
Senior Machine Learning Engineer - GeoAI Platform Wherobots, Inc.... ...fully‑managed, highly scalable geospatial... ...geospatial ML platform that powers... ...systems, ML inference, and geospatial... ...tooling. We use Ray, PyTorch, and the... ...understanding of performance tradeoffs across...SeniorPerformanceFull timeWork at officeRemote workWork visa$204k - $259k
...set of sensors, enabling engineers like you to (1) develop... ...and Weather. We take a high-level business problem... .... Most of our work is ML-related. Recently we have... ...and improving performance of pre-trained and fine... ...with ML frameworks like PyTorch, JAX, or Tensorflow....SeniorPerformanceFull timeRemote work$200k - $350k
...Python, C++, or Rust, and a solid understanding of reinforcement learning principles. The position offers a competitive compensation range of $200K to $350K, and you’ll work with a small, elite team in a dynamic, high-performance environment. #J-18808-Ljbffr Pantera CapitalSeniorPerformance$180k - $220k
...Machine Learning Engineer At Ouster, we build sensors... ...is a full range of high-resolution LIDAR sensors... ...real-time, on-device performance. This role requires... ...models for real-time inference and on-device deployment... ...in Python and PyTorch. ~3+ years proficiency...SeniorPerformanceWork experience placementLocal area$160k - $250k
...Senior Machine Learning Engineer In order to execute our vision, we need... ...involved in applying a ML model to a production... ...it at scale with high throughput and uptime... ...and maintain scalable, performant and secure code that... ...frameworks, such as PyTorch or Tensorflow You...SeniorPerformance- A leading AI infrastructure company is seeking a Senior ML Performance Engineer to design a comprehensive performance testing platform for large... ...engineering and strong experience with GPU programming and ML inference workloads. Candidates should have expertise in Python and...SeniorPerformance
$180k - $270k
Cerebras is seeking a Senior Machine Learning Engineer for their Avatar Technology team in San Francisco... ...animation systems, delivering high-quality performance in production. Candidates should have... ...and C++, and familiarity with ML frameworks. The position offers a...SeniorPerformance- ...production-minded ML team based in Orange... ...with other engineers and researchers to... ...segmentation and detection (PyTorch). You must... ...curate datasets, drive high-priority... ...to track metrics, perform ablations, write clear... ...models, writing basic inference code, adding tests...SeniorPerformance
$200k - $400k
...are an early‑stage, high-growth venture... ...innovative strategic engineer to help us scale. Role Overview The Senior Machine Learning Engineer... ...across the full ML lifecycle, from... ...measurable costs, performance targets, and real‑world... ...relevance ranking using PyTorch and Hugging Face....SeniorPerformanceWork experience placement
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior ML Inference Engineer — High-Performance PyTorch. Be the first to apply!
- computer vision machine learning engineer San Francisco, CA
- machine learning ai engineer San Francisco, CA
- senior ml engineer San Francisco, CA
- machine learning software engineer San Francisco, CA
- data scientist machine learning engineer San Francisco, CA
- machine learning engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- graduate machine learning engineer San Francisco, CA
- senior office manager San Francisco, CA

