Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Machine Learning Engineer (Inference Platform)

$200k - $250k
Full-time

Wizard

About Wizard AI


At Wizard AI, we’re building a high-performing AI Shopping Agent that helps people discover the best products across the web with speed, accuracy, and trust. Our ML systems sit at the core of that experience, and we’re looking for a Senior MLOps Engineer to help us run them reliably and efficiently in production.

The Role


As a Senior MLOps Engineer at Wizard, you’ll own the end-to-end lifecycle of our ML systems — from packaging and deployment to monitoring, performance, and scaling — across a custom-built inference platform powering a live conversational product.

This isn’t a typical “pipeline” role. Our platform runs multiple specialized inference engines (LLMs, embeddings, and extraction models), each with different performance and scaling characteristics. A big part of the role is thinking through tradeoffs — latency vs. cost, throughput vs. reliability — and helping us evolve the system as we grow.

You’ll work closely with ML, Data, and DevOps, and have real input into how the platform is designed — not just how it’s maintained.

What You’ll Do



  • Build and improve production ML pipelines, making it easy to move models from experimentation to reliable production use

  • Help own and evolve our multi-engine inference platform (LLMs, embeddings, and extraction), improving how different workloads are served and scaled

  • Put strong foundations in place for model versioning, rollouts, and rollbacks so systems stay reproducible and safe to iterate on

  • Define and monitor key system metrics like latency, availability, and GPU utilization, and set clear expectations around performance

  • Improve overall system performance — whether that’s reducing latency, increasing throughput, or making better use of GPU resources

  • Design systems that are resilient and cost-aware, with thoughtful approaches to autoscaling, failure isolation, and graceful degradation

  • Bring solid engineering practices (testing, CI/CD, observability) into ML workflows to help the team move faster without sacrificing reliability

  • Partner closely with ML, Data, Product, and DevOps to turn ideas into production-ready systems and help guide technical decisions

What We’re Looking For



  • 5–8+ years of experience in software, ML, platform, or infrastructure engineering, with hands-on ownership of production ML systems

  • Experience deploying and running LLMs or other deep learning models in real-world environments

  • Strong Python skills and a solid foundation in software engineering

  • Familiarity with cloud platforms (AWS, GCP, Azure) and common ML tooling (model registries, experiment tracking, etc.)

  • A good understanding of inference performance — batching, memory usage, quantization, and how systems behave across CPU and GPU

  • Experience working with (or curiosity about) systems that serve different types of models with different constraints

  • Ability to think through tradeoffs between speed, cost, and reliability in a practical way

  • Comfort working in a fast-moving environment where things evolve quickly

What Success Looks Like


Reliable, Scalable Systems
Our ML systems run smoothly with clear visibility into performance, and can scale as demand grows without constant firefighting.

End-to-End Ownership
You’re able to take a model from idea to production and keep it running well, while making it easier for others to do the same.

Real Impact
You help shape how our ML platform evolves — improving performance, reducing costs, and making the overall system stronger over time.

Compensation & Benefits


The expected base salary range for this role is $200,000 – $250,000 USD, and will vary based on skills, experience, role level, and geographic location. Final compensation will be determined by considering these factors alongside overall role scope and responsibilities.

In addition to base salary, Wizard offers:


  • Equity in the form of stock options

  • Medical, dental, and vision coverage

  • 401(k) plan

  • Flexible PTO and company holidays

  • Fully remote work within the United States

  • Periodic company offsites and team gatherings

Wizard is committed to fair, transparent, and competitive compensation practices.

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Senior Machine Learning Engineer (Inference Platform) in United States vacancy
  •  ...This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Machine Learning Engineer (Inference Platform) in the United States. In this role, you will take ownership of the production inference systems that power a high... 
    Senior
    Remote job
    Full time

    jobgether

    United States
    4 days ago
  • $200k - $250k

     ...Our ML models power the core of our platform, and we’re seeking an experienced Senior MLOps Engineer to take ownership of how our machine learning systems run reliably and efficiently...  ...optimization and scaling – for a custom-built inference platform powering a live... 
    Senior
    Remote work
    Flexible hours

    Wizard

    New York, NY
    4 days ago
  •  ...Israelvcforum is looking for a Senior ML Infrastructure Engineer in Mountain View, California. This...  ...aims to build and scale robust platforms for ML inference workflows supporting GM’s AI efforts...  ...years of industry experience in machine learning systems, proficiency in Python... 
    Senior
    Remote work

    Israelvcforum

    Mountain View, CA
    1 day ago
  • $128.7k - $261.3k

     ...professional to develop its ML deployment platform within the autonomous vehicle sector....  ...deployment from training to on-vehicle inference and enhancing developer experience...  ...and possess significant experience in machine learning, Python programming, and cross-team collaboration... 
    Senior

    General Motors

    New York, NY
    4 days ago
  •  ...General Motors is seeking a Senior ML Infrastructure Engineer to build and scale a robust platform for machine learning inference workflows. You will design backend software components, collaborate with ML engineers, and lead initiatives across GM's ML ecosystem. With... 
    Senior
    Remote work

    General Motors

    Sunnyvale, CA
    13 hours ago
  •  ...leading automotive company seeks a Senior ML Infrastructure Engineer in Austin, Texas, to design and implement backend software for ML inference workflows. The engineer will collaborate...  ...have over 5 years of experience in machine learning systems and strong skills in... 
    Senior
    Remote work

    General Motors

    Austin, TX
    1 day ago
  • $230k - $265k

     ...Parafin is seeking a Software Engineer to lead the evolution of their ML Platform, ensuring robust and scalable systems for data scientists. The role requires...  ...core platform functionalities, enhance real-time inference processes, and collaborate across teams to ensure quality... 
    Senior
    Remote work

    Parafin Inc

    San Francisco, CA
    1 day ago
  • $128.7k - $261.3k

     ...the Team The Model Deployment & Inference Solutions team in GM AV deploys machine learning models from training frameworks...  ...two-fold: build the ML deployment platform that makes model rollouts fast...  ...currently performed manually by engineers. Build the developer experience... 
    Senior
    Flexible hours
    Shift work

    General Motors

    New York, NY
    4 days ago
  • $155.42k - $395.9k

     ...About the Team: The ML Inference Platform is part of the AV ML Infrastructure organization...  ...of state-of-the-art (SOTA) machine learning models for experimental, online and...  ...About the Role: We are seeking a Senior ML Infrastructure engineer to help build and scale robust... 
    Senior
    Remote work
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Austin, TX
    11 days ago
  • Bright Vision Technologies is looking for a Model Serving Engineer to design and operate high-performance AI inference platforms. The role is fully remote and focuses on distributed systems, where you'll optimize for models like LLMs and manage GPU utilization. Candidates... 
    Senior
    Remote job
    H1b

    Bright Vision Technologies

    Edison, NJ
    4 days ago
  • $174.19k - $287.41k

     ...Red Hat, Inc. is looking for a Senior Machine Learning Engineer to work on LLM deployment and optimization. You will design and develop inference optimization algorithms, manage deployment pipelines, and mentor team members. The position requires strong skills in machine... 
    Senior

    Red Hat

    Boston, MA
    13 hours ago
  •  ...PICTOR LABS INC is seeking a Senior ML Inference Engineer based in the United States to optimize and deploy production virtual staining models. This role demands deep expertise in ML inference optimization, proficiency in Python, and experience with PyTorch and NVIDIA... 
    Senior
    Remote work

    PICTOR LABS INC

    California, MO
    13 hours ago
  •  ...Location Type Remote Department ML Engineering About Inworld Inworld is a...  ...only realtime orchestration platform optimized for thousands of...  ...systems and sub-second multimodal inference at scale barely existed....  ...from varied backgrounds who learn fast, thrive in ambiguity, and... 
    Senior
    Permanent employment
    Full time
    Remote work
    Relocation
    Relocation package

    careers.bitkraft.vc - Jobboard

    Indiana, PA
    12 hours ago
  •  ...technology company in Seattle is seeking a Senior or Staff Software Engineer for the ML Infrastructure team....  ...large-scale model training and inference, focusing on reliability and...  ...distributed systems, Kubernetes, and machine learning infrastructure. This position promises... 
    Senior

    Salesforce

    Seattle, WA
    14 hours ago
  •  ...requires expertise in deploying GPU systems for high-throughput inference and model performance optimization. The ideal candidate will...  ...inference frameworks and a solid understanding of reinforcement learning technologies. Comprehensive healthcare benefits, parental... 
    Senior

    Reflection AI

    San Francisco, CA
    1 day ago
  •  ...MakerMaker.AI is looking for a Senior Machine Learning Systems Engineer in San Francisco. In this role, you will build and operate production inference systems, optimizing for performance and reliability. The ideal candidate will have 3+ years of experience in production... 
    Senior

    MakerMaker.AI

    San Francisco, CA
    1 day ago
  • $228.7k - $306.7k

     ...Senior Principal Machine Learning Engineer, Ad Platforms Technology is at the heart of Disney’s past, present, and future. Disney Entertainment and ESPN Product...  ..., Hugging Face libraries. Model optimization and inference (TensorRT, ONNX, DeepSpeed). Ad Tech Industry... 
    Senior
    Work experience placement

    5014 Disney Entertainment & Sports LLC

    Iowa, LA
    12 hours ago
  •  ...A leading AI technology firm in the United States is seeking an experienced engineer to optimize model performance. The role requires expertise in inference optimization, model acceleration, and proficiency in C++, CUDA, and Python, among other skills. You'll work collaboratively... 
    Senior
    Relocation package

    Inworld

    New Bremen, OH
    13 hours ago
  • $185k - $275k

     ...Senior Machine Learning Engineer – GeoAI Platform Wherobots, Inc. San Francisco, California, United States | Information Technology About this position About...  ...sits at the intersection of distributed systems, ML inference, and geospatial data infrastructure. If you can... 
    Senior
    Full time
    Work at office
    Remote work
    Work visa

    Wherobots, Inc

    San Francisco, CA
    13 hours ago
  • $150k - $210k

     ...actionable recommendations. Our AI platform is central to this mission...  ...day. WHOOP is hiring a Senior AI/ML Engineer to help scale the...  ...of experience in applied machine learning, AI engineering, or ML-focused...  ...deployments with inference optimization, observability... 
    Senior
    Full time
    Work at office
    Relocation

    WHOOP

    Boston, MA
    1 day ago
  • $152k - $287.5k

     ...NVIDIA Gruppe is seeking a Senior Machine Learning Applications and Compiler Engineer in Santa Clara, California. This role...  ...algorithms for their LPX inference and compiler stack, optimizing...  ...neural network workloads on NVIDIA platforms. Ideal candidates will possess... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    13 hours ago
  • $228.7k - $306.7k

     ...Senior Principal Machine Learning Engineer, Ad Platforms Technology is at the heart of Disney's past, present, and future. Disney Entertainment and ESPN...  ...Hugging Face libraries ~ Model optimization and inference (TensorRT, ONNX, DeepSpeed) ~ Ad Tech Industry knowledge... 
    Senior
    Work experience placement

    Disney

    Seattle, WA
    1 day ago
  •  ...Technology and Imager - platform is an AI-enabled wearable...  ...OCT), embedded computing, machine learning, and AR/VR technologies....  ...We are looking for a Senior Machine Learning Engineer to build the AI foundation...  ...Integrate and optimize AI inference into the VETi platform's... 
    Senior

    Kodiak Sciences Inc

    Palo Alto, CA
    5 days ago
  • $160k - $220k

     ...Senior Machine Learning Engineer, AI Platform Canada (Remote); Toronto, Ontario Affinity stitches together billions of data points from massive datasets...  ...with serving ML models for streaming and batch inference at scale. ~ Experience with vector or graph databases... 
    Senior
    Remote work
    Worldwide
    Flexible hours

    Affinity Inc

    United States
    3 days ago
  •  ...only architectures, combining rigorous engineering with learning systems proven in globally deployed...  ...pipelines for training, evaluation, and inference on multimodal datasets. Build and...  ...experience in ML infrastructure or platform engineering. ~ Strong coding skills... 
    Senior
    Local area

    Field AI

    Irvine, CA
    3 days ago
  • $243.1k - $286k

     ...Connected Operations™ Cloud, which is a platform that enables organizations that...  ...the role: We are looking for a Senior Machine Learning Engineer to lead the architectural evolution...  ...services for data storage, processing, and inference Experience building end-to-end ML... 
    Senior
    Full time
    Remote work
    Flexible hours

    Samsara

    United States
    2 days ago
  • $186.07k - $225k

     ...as we build the emerging onchain platform — and with it, the future global financial...  ...Team: We are looking for a Senior Machine Learning Platform Engineer to join our Machine Learning...  ...availability and low-latency for our ML inference infrastructure that runs both... 
    Senior
    Local area

    Coinbase

    Little Rock, AR
    5 days ago
  • $160k - $210k

     ...Inc. 5000. As the leading platform for Daily Fantasy Sports,...  ...together? As a Senior ML Platform Engineer, you will contribute to building...  ...productionize our core machine learning capabilities. Your work will...  ...services. Real-Time Inference at Scale: Build... 
    Senior
    Full time
    Remote work
    Work visa
    Flexible hours

    AEG Presents

    Atlanta, GA
    5 days ago
  • $229.5k - $367.1k

     ...Roku is the #1 TV streaming platform in the U.S., Canada, and...  ...leveraging state-of-the-art machine learning. Our mission is to deliver...  ...Our work blends innovation, engineering excellence, and a deep commitment...  ...: feature store, real-time inference services, vector DBs etc.,... 
    Senior
    Work at office
    Local area
    Remote work
    Monday to Thursday
    Flexible hours

    Roku

    San Jose, CA
    13 hours ago
  • $212k - $318.4k

     ...Senior Machine Learning Platform Engineer - AI, Search & Knowledge Work Locations (2) Submit Resume Join us in building the AI, Search & Knowledge...  ...systems (vLLM, Ray Serve, TorchServe, TensorRT) or inference optimization Contributions to open-source ML frameworks... 
    Senior
    Relocation

    Apple

    Cupertino, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Machine Learning Engineer (Inference Platform). Be the first to apply!