Senior Machine Learning Engineer (Inference Platform)

$200k - $250k

Full-time

Wizard

About Wizard AI

At Wizard AI, we’re building a high-performing AI Shopping Agent that helps people discover the best products across the web with speed, accuracy, and trust. Our ML systems sit at the core of that experience, and we’re looking for a Senior MLOps Engineer to help us run them reliably and efficiently in production.

The Role

As a Senior MLOps Engineer at Wizard, you’ll own the end-to-end lifecycle of our ML systems — from packaging and deployment to monitoring, performance, and scaling — across a custom-built inference platform powering a live conversational product.

This isn’t a typical “pipeline” role. Our platform runs multiple specialized inference engines (LLMs, embeddings, and extraction models), each with different performance and scaling characteristics. A big part of the role is thinking through tradeoffs — latency vs. cost, throughput vs. reliability — and helping us evolve the system as we grow.

You’ll work closely with ML, Data, and DevOps, and have real input into how the platform is designed — not just how it’s maintained.

What You’ll Do

Build and improve production ML pipelines, making it easy to move models from experimentation to reliable production use

Help own and evolve our multi-engine inference platform (LLMs, embeddings, and extraction), improving how different workloads are served and scaled

Put strong foundations in place for model versioning, rollouts, and rollbacks so systems stay reproducible and safe to iterate on

Define and monitor key system metrics like latency, availability, and GPU utilization, and set clear expectations around performance

Improve overall system performance — whether that’s reducing latency, increasing throughput, or making better use of GPU resources

Design systems that are resilient and cost-aware, with thoughtful approaches to autoscaling, failure isolation, and graceful degradation

Bring solid engineering practices (testing, CI/CD, observability) into ML workflows to help the team move faster without sacrificing reliability

Partner closely with ML, Data, Product, and DevOps to turn ideas into production-ready systems and help guide technical decisions

What We’re Looking For

5–8+ years of experience in software, ML, platform, or infrastructure engineering, with hands-on ownership of production ML systems

Experience deploying and running LLMs or other deep learning models in real-world environments

Strong Python skills and a solid foundation in software engineering

Familiarity with cloud platforms (AWS, GCP, Azure) and common ML tooling (model registries, experiment tracking, etc.)

A good understanding of inference performance — batching, memory usage, quantization, and how systems behave across CPU and GPU

Experience working with (or curiosity about) systems that serve different types of models with different constraints

Ability to think through tradeoffs between speed, cost, and reliability in a practical way

Comfort working in a fast-moving environment where things evolve quickly

What Success Looks Like

Reliable, Scalable Systems
Our ML systems run smoothly with clear visibility into performance, and can scale as demand grows without constant firefighting.

End-to-End Ownership
You’re able to take a model from idea to production and keep it running well, while making it easier for others to do the same.

Real Impact
You help shape how our ML platform evolves — improving performance, reducing costs, and making the overall system stronger over time.

Compensation & Benefits

The expected base salary range for this role is $200,000 – $250,000 USD, and will vary based on skills, experience, role level, and geographic location. Final compensation will be determined by considering these factors alongside overall role scope and responsibilities.

In addition to base salary, Wizard offers:

Equity in the form of stock options

Medical, dental, and vision coverage

401(k) plan

Flexible PTO and company holidays

Fully remote work within the United States

Periodic company offsites and team gatherings

Wizard is committed to fair, transparent, and competitive compensation practices.

Apply

Vacancy posted 8 hours ago

Similar jobs that could be interesting for youBased on the Senior Machine Learning Engineer (Inference Platform) in United States vacancy

Senior ML Engineer - AI Platform & Distributed Inference (San Mateo)
A leading gaming platform is seeking a Senior Machine Learning Engineer to contribute to their AI Platform team. Your role will include building efficient and scalable AI systems, with a focus on distributed inference and generative AI. A Ph.D. in a relevant field and...
Senior
Full time
Roblox
San Mateo, CA
3 hours ago
Senior Manager, Machine Learning Engineering - Ad Platform
...Description Fetch is looking for a Senior Manager of Machine Learning Engineering to lead the team building and... ...learning capabilities across our Ad Platform. You will partner with Product, Data... ...understanding of experimentation, causal inference, and incrementality measurement....
Senior
Full time
Fetch
Remote
3 days ago
[2026] Senior Machine Learning Engineer, AI Platform - PhD Early Career (San Mateo)
$192.89k - $238.52k
...explore, create, play, learn, and connect with friends... ...building the tools and platform that empower our... ...cases and billions of inferences daily across Discovery, Safety, Engine, and more. We are seeking... ...What You Will Do As a Senior Machine Learning Engineer on the...
Senior
Full time
Work experience placement
Work at office
Local area
Monday to Friday
Roblox
San Mateo, CA
3 hours ago
Senior Staff Machine Learning Engineer, GenAI Platform
$292.5k - $409.5k
...information, visit . Who We Are: The Machine Learning Platform team at Reddit is a high-impact... ...teams. What You’ll Do: As a Senior Staff Software Engineer, you will help define and lead... ...knowledge of model serving, inference pipelines, monitoring, and observability...
Senior
Full time
For contractors
Work experience placement
Flexible hours
Reddit
United States
8 hours ago
Senior Machine Learning Platform Engineer
$170k - $220k
...with them. As our first dedicated ML Platform Engineer, you’ll define the technical direction... ...today and are investing in hosted GPU inference to support the next generation of our... ...Foster a culture of collaboration and learning across engineering, product, and design...
Senior
Full time
Work at office
Local area
Charlie Health
New York, NY
3 days ago
Senior Inference Platform Engineer — Kubernetes & Latency (Sunnyvale)
$165k - $242k
...A cloud service provider is seeking a Senior Software Engineer II for their Inference team in Sunnyvale, California. In this role, you'll lead design reviews, implement optimizations, and improve service reliability. The ideal candidate has extensive experience with distributed...
Senior
Full time
CoreWeave
Sunnyvale, CA
3 hours ago
Senior Machine Learning Engineer
...researchers, data scientists, and investment teams to engineer, deploy, and operate production-grade machine learning models that drive research, analytics, and... ...production deployment. Engineer scalable training, inference, and retraining workflows using AWS SageMaker....
Senior
Work experience placement
Vanguard
Malvern, PA
4 days ago
Senior/Staff Software Engineer - Machine Learning Platform (Inference)
...Snowflake team. The Snowflake Machine Learning Platform team’s mission is to enable customers... ...collaboratively and proactively with senior architects, PMs, and team leadership... ...Experience in serving LLMs using inference engines like vLLM, TensorRT-LLM, TEI, SGLang...
Senior
Full time
Snowflake
Remote
9 hours ago
Senior Inference Platform Engineer - Data Center (San Francisco)
$300k
...center startup building an AI and cloud platform, powered by thousands of H100s, H200s, and... ..., full-scale model training, or inference. Our client operates high-performance... ...Integrate, tune, and operate inference engines such as vLLM, SGLang, and TensorRT-LLM across...
Senior
Full time
Worldwide
Hamilton Barnes Associates Limited
San Francisco, CA
3 hours ago
Senior Software Engineer, Machine Learning Inference Platform
Role Description In the Senior Engineer role, you will own meaningful subsystems of Stack AV's inference platform and drive them from design through production. You will be the... ...Rust or Python. ~Familiarity with deep learning frameworks (PyTorch, etc.) as well as...
Senior
Full time
Stack AV
Remote
4 days ago
Machine Learning Engineer, Inference & Serving (Speech LLM) - San Francisco
$180k - $270k
...highest standards of data security and privacy protection. To learn more about Plaud, please visit and follow along on... ...experience building and deploying high-throughput, ultra-low-latency inference engines for large language models or foundational speech models....
Full time
Work at office
Worldwide
Plaud
San Francisco, CA
8 hours ago
Senior ML Serving Engineer for LLMs & Inference (San Jose)
A tech company in AI/ML is seeking a Senior Software Engineer specializing in ML Serving to build robust infrastructure for ML models. The ideal candidate has 5+ years of experience in software engineering, with a focus on ML serving. Proficiency in Python and knowledge...
Senior
Full time
Alldus
San Jose, CA
3 hours ago
Senior Machine Learning Engineer, Ads
...to explore, create, play, learn, and connect with friends in... ...’re building the tools and platform that empower our community... ...we are seeking experienced machine learning engineers who thrive on solving complex... ...based model training, inference, and product integration are...
Senior
Full time
Roblox
Remote
8 hours ago
Senior Machine Learning Engineer, Home Modeling
...record pace, we are seeking experienced machine learning engineers who thrive on solving complex... ...Roblox. You have: ~4+ (for senior) years of experience designing and building... ...with transformer based model training, inference, and product integration. Break down...
Senior
Full time
Roblox
Remote
8 hours ago
Senior Machine Learning Engineer, Computer Vision, HD Map and SLAM
...and experience in deep learning , neural networks... ...entire life cycle of machine learning projects from... ...computer science, computer engineering, robotics, mathematics... ...machine learning platforms and tools. Have strong... ..., or model inference acceleration (e.g. TensorRT...
Senior
Full time
Bot Auto
Remote
8 hours ago
Senior Machine Learning Engineer
...The Role As a Senior Machine Learning Engineer, you will bridge the gap between raw computer vision and physical, 3D-printed reality. You will... ...scans. ~ CAD Integration: Bridge the gap between AI inference and parametric CAD modeling for automated 3D printing preparation...
Senior
Full time
Hike Medical Co
Remote
8 hours ago
Senior Machine Learning Engineer, Computer Vision
...Apella is applying computer vision and machine learning to improve the standard of care in the... ...We’re looking for a machine learning engineer who thrives on leveraging their AI and... .... Hands on experience with video ML inference Experience with DAG frameworks What...
Senior
Full time
Flexible hours
Apella
United States
8 hours ago
Senior Machine Learning Engineer, Dynamic Economics
...As a Senior Machine Learning Engineer on the Economy ML team, you will build models that power ranking... ..., how creators earn, and how the platform grows sustainably. The Economy ML... ...or vision, and large‑scale training/inference on distributed infrastructure. You...
Senior
Full time
Roblox
Remote
8 hours ago
Senior Machine Learning Engineer (Auto Labeling)
...We are looking for the best At 42dot, our Senior Machine Learning Engineers conduct research and development on machine learning algorithms to... ...autonomous driving datasets. Efficient Learning and Inference : We optimize learning algorithms and inference processes...
Senior
Full time
42dot
Remote
8 hours ago
Senior Machine Learning Engineer, Insights
$150k - $210k
...deeply impactful for members. As a Senior Machine Learning Engineer on our Health Insights team, you... ...designing, deploying and operating ML inference systems at scale (real-time streaming... ...and maintaining ML systems on cloud platforms (AWS or GCP), including CI/CD and...
Senior
Full time
Work at office
Relocation
Whoop
Remote
8 hours ago
[2026] Senior Machine Learning Engineer, Engine Optimization - PhD Early Career
...explore, create, play, learn, and connect with friends... ...building the tools and platform that empower our... ...experiences for everyone. Our engine’s resource management... ...the application of machine learning in real-time engine... ...Design ML models that infer player and interaction...
Senior
Full time
Roblox
Remote
8 hours ago
Senior Machine Learning Engineer
...Overview As a Senior Machine Learning Engineer at Phia, you’ll build and scale production ML systems that... ..., product engineering, and data platforms, with ownership over systems that directly... ...of experiment design and causal inference, including A/B testing and offline evaluation...
Senior
Full time
Phia
New York, NY
8 hours ago
Senior Machine Learning Engineer, User Behavior
...to explore, create, play, learn, and connect with friends... ...re building the tools and platform that empower our community... ...User Behavior? As a Senior Machine Learning Engineer for User Behavior, you will... ...constrained thresholds, and causal inference fundamentals. ~ Nice to...
Senior
Full time
Casual work
Shift work
Roblox
Remote
8 hours ago
Senior Machine Learning Engineer, Recommendation & AI Applications
$195k - $230k
...NewsBreak is the Content Intelligence platform shaping the future content economy.... ...the Role We are looking for a Senior Machine Learning Engineer to help evolve our large-scale recommendation... ...from offline training → online inference → A/B experimentation → metric...
Senior
Full time
Local area
Work from home
Newsbreak
Mountain View, CA
8 hours ago
Senior Machine Learning Engineer (LLMs)
...thousands of jobs. This is not a “prompt engineer” role. You’ll design, train, and ship... ...~ If there’s something interesting to learn or solve, it doesn’t matter if it’s Saturday... ...training (FSDP, DeepSpeed, Megatron, etc.) Inference optimization (quantization, speculative...
Senior
Full time
Weekend work
albi
Chicago, IL
8 hours ago
Senior Machine Learning Engineer
...success story, then let’s talk! Role Overview As a Senior Machine Learning Engineer, you will be the person we trust with the training side... ...slapped on the side. Mentor engineers who can call an inference endpoint but have never trained one themselves. What...
Senior
Full time
Evocs
United States
8 hours ago
Senior Machine Learning Engineer, Voice AI
$200k - $260k
...Together AI is building the best inference infrastructure for voice applications. Our Voice AI platform powers production-grade, real... .... We're looking for a Senior ML Engineer to drive the model serving layer... ...but not required — you can learn this quickly if you have...
Senior
Full time
Together Ai
San Francisco, CA
8 hours ago
Senior Machine Learning Engineer
...developed by a team of seasoned leaders, engineers, AI scientists, and clinicians spun... ...the position: We're looking for a Senior Machine Learning Engineer with deep expertise in some... ...systems, large-scale training and inference infrastructure, model evaluation and...
Senior
Full time
Flexible hours
Videa
Boston, MA
8 hours ago
Senior Machine Learning Engineer, Runtime and Serving
$213k - $263k
...to a range of vehicle platforms and product use cases... ...the lifecycle of the machine learning workflow, including... ...We are looking for engineers with ML software & systems... ...Waymo onboard ML inference engine for Waymo fundamental... ...will report to the Senior Manager of Runtime...
Senior
Full time
Remote work
Waymo
Remote
8 hours ago
Senior Machine Learning Engineer
...Description Position Summary The Machine Learning Engineer will be responsible for the end-to-... ...and ML pipelines for model training, inference, and deployment. Collaboration:... ...architectures. Experience with cloud platforms (e.g., GCP, AWS) and distributed...
Senior
Full time
H1b
Remote work
Flexible hours
C The Signs
Boston, MA
8 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Machine Learning Engineer (Inference Platform). Be the first to apply!