Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Machine Learning Engineer - Inference / Serving

Yobi AI

Machine Learning Engineer - Inference / Serving Join to apply for the Machine Learning Engineer - Inference / Serving role at Yobi AI Overview Yobi is a rapidly growing Behavioral AI company on a mission to ethically democratize the benefits of data and AI. Since 2019, we have built one of the largest consented behavioral datasets in the United States, extending far beyond the walled gardens of Big Tech. Unlike traditional LLM companies, Yobi builds foundation models of human behavior grounded in real‑world actions such as purchases and store visits. Our private‑by‑design modeling enables state‑of‑the‑art personalization and decisioning for leading brands and agencies while protecting privacy, safety, and ethics. Today, we are focused on bringing the performance of closed‑web user acquisition to the open web and connected TV, giving brands walled‑garden results without the walls. At our core, Yobi is building the behavioral intelligence layer for any system that makes a personalization decision. Working at Yobi We’re at an inflection point—customer adoption is accelerating, but there’s still room to shape the architecture and culture from the ground up. Engineers here own major surface areas, build 0→1 systems in large‑scale data and model infrastructure, and help define how Behavioral AI scales ethically and effectively. Highlights Well‑funded with 5+ years of runway. We are scaling revenue quickly and project to be breakeven in 2026. Partnerships with Microsoft and Databricks. Fully remote or hybrid from hubs in SF Bay Area, Seattle, NYC. World‑class team of Machine Learning experts with experience at Amazon, Uber, Twitter, Meta, etc. Product and Go‑to‑Market teams that have taken ideas from concept to nine‑figure revenue streams. Benefits Competitive base salary. Meaningful equity and financial upside. Annual bonus target based on personal and company performance. Health, dental, vision plans with low out‑of‑pocket costs. Unlimited PTO. 401(k) with company match. About the Role As a Machine Learning Engineer focused on inference and serving at Yobi, you’ll design, optimize, and operate the systems that bring our Behavioral AI models to life in real time. You’ll work at the core of our production environment, turning trained models into performant, reliable, and continuously improving services that power our open‑web and CTV products. This is an applied ML systems role—equal parts engineering depth, deployment craft, and model intuition. You’ll shape how models are packaged, versioned, rolled out, and observed across environments, ensuring every prediction is fast, accurate, and accountable. Responsibilities & Expectations Build and scale production ML serving systems—handle versioning, rollouts, rollback strategies, and live experimentation. Ensure low‑latency inference by optimizing model graphs, quantizing, batching, caching, and efficient feature retrieval. Write robust, high‑performance code in Go, Rust, C++, or Java and bridge to Python for model integration and analysis. Treat inference as a living system—monitor drift, track model lineage, and ensure observability from input to outcome. Make serving systems reproducible and portable without over‑engineering—for instance, custom runtime design, model registries, or lightweight orchestration. Reason about model performance and trade‑offs, and work with researchers to deploy more practical models. Qualifications Deep expertise in model deployment and production ML serving. Strong low‑latency mindset and knowledge of inference optimization techniques. Systems fluency: comfortable writing high‑performance code and bridging to Python. Operational maturity: experienced with monitoring, drift detection, and observability. Infrastructure intuition: understanding of custom runtimes, registries, and orchestration. Applied ML understanding: can interpret performance, reasoning about trade‑offs, and collaborate with researchers. Seniority Level Mid‑Senior level. Employment Type Full‑time. Job Function Engineering and Information Technology. Software Development industry. #J-18808-Ljbffr

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Machine Learning Engineer - Inference / Serving in New York, NY vacancy
  • A leading Behavioral AI company is seeking a Machine Learning Engineer focused on inference and serving. In this role, you will design and optimize systems to operationalize AI models. The ideal candidate has deep expertise in model deployment, a strong low-latency mindset... 
    Suggested
    Remote work

    Yobi AI

    New York, NY
    4 days ago
  •  ...infrastructure and tooling that powers machine learning across Canva. Our Inference Platform team sits at the heart of...  ...that ML models are deployed, served, and optimised efficiently at scale...  .../Specialty: As a Machine Learning Engineer, you’ll focus on building and... 
    Suggested
    Work at office
    Remote work
    Flexible hours

    Canva

    New York, NY
    2 days ago
  • $200k - $250k

     ...seeking an experienced Senior MLOps Engineer to take ownership of how our machine learning systems run reliably and...  ...and scaling – for a custom-built inference platform powering a live conversational...  ...ML systems. Define and enforce serving-layer SLAs – latency,... 
    Suggested
    Remote work
    Flexible hours

    Wizard

    New York, NY
    2 days ago
  •  ...Reddit, Inc. is seeking a Staff Machine Learning Engineer to lead the development of a large-scale ML Inference Platform. Responsibilities include designing cloud-based ML systems on Kubernetes and ensuring reliable, low-latency performance. Candidates should have 7+ years... 
    Suggested

    Reddit

    New York, NY
    2 days ago
  • $155k - $235k

     ...Senior Lead / Lead ML Platform Engineer to architect and own the...  ...direction for our Training and Inference infrastructure. This is a high...  ...massive models efficiently and serve them with sub-millisecond...  ...training, and reinforcement learning. High-Performance Inference... 
    Suggested
    Shift work

    Paramount

    New York, NY
    17 hours ago
  • $148.7k - $229.9k

     ...Platform team is looking for a senior machine learning engineer to lead the evolution of how we validate...  ...into the next generation of causal inference and high-sensitivity evaluation methodologies...  ...-Functional Technical Leadership: Serve as the lead subject matter expert on... 
    Temporary work
    Work at office
    Worldwide
    Relocation package

    Unity

    New York, NY
    1 day ago
  • $148.7k - $199.4k

     ...Senior Machine Learning Engineer - News Technology is at the heart of Disney's past, present, and...  ...foundation and consumer media touch points serving millions of people around the world....  ...for scalable learning, inference, and monitoring, conduct in-depth data... 
    Work experience placement
    Local area
    Day shift

    Disney

    New York, NY
    1 day ago
  •  ...Senior Machine Learning Engineer - News Technology is at the heart of Disney's past, present, and...  ...foundation and consumer media touch points serving millions of people around the world....  ...for scalable learning, inference, and monitoring, conduct in-depth data... 
    Work experience placement
    Local area
    Day shift

    Walt Disney Company

    New York, NY
    1 day ago
  • $128k - $160k

     ...Grailed is looking for a Senior Machine Learning Engineer to drive personalization, recommendation,...  ...quality of inventory impressions that are served to prospective buyers. Develop...  ...advanced statistical modeling, causal inference, experiment/test design, and working... 
    Work experience placement
    Local area

    GOAT Group

    New York, NY
    2 days ago
  • $140k - $210k

     ...highly skilled and motivated engineer to join our team. You will...  ...deploying state-of-the-art machine learning solutions to advance our mission...  ...scale its technology to serve a growing number of customers...  ...using cloud-based training and inference pipelines. ~5+ years of... 
    Full time
    Work experience placement
    Work at office
    2 days per week

    Treeswift Inc

    New York, NY
    17 hours ago
  •  ...The Role We're looking for a Machine Learning Engineer to join our Engineering team. You'll...  ...build data pipelines for training and inference. Develop a robust set of tools for...  ...PyTorch or TensorFlow. ~ Experience with serving models for inference (FastAPI) ~... 

    Soris

    New York, NY
    4 days ago
  • $148.7k - $199.4k

     ...Senior Machine Learning Engineer Technology is at the heart of Disney's past, present, and future...  ...foundation and consumer media touch points serving millions of people around the world....  ...for scalable learning, inference, and monitoring, conduct in-depth data... 
    Work experience placement
    Local area
    Day shift

    Disney France

    New York, NY
    1 day ago
  • $205k - $316.4k

     ...Machine Learning Engineer At Quizlet, our mission is to help every learner achieve their outcomes...  ...implement systems for real-time and batch inference Build end-to-end ML pipelines for...  ...with data pipelines, model serving, and scalable systems Proficiency... 
    Work at office
    3 days per week

    Quizlet

    New York, NY
    17 hours ago
  • $190k - $260k

     ...Machine Learning Engineer – Search, Ranking & Personalization Stage: Seed Founded: 2022 Key Job Information...  ...personalization across a platform serving hundreds of millions of items daily....  ...‑scale data processing for real‑time inference. Strong backend integration... 
    Full time
    H1b
    Remote work
    Relocation
    Visa sponsorship

    Fuku

    New York, NY
    4 days ago
  •  ...innovation agency and precision engineering partner. For over 20 years,...  ...We are seeking hands-on Machine Learning Engineers for an urgent...  ...Execute daily model training and inference tasks. Build and manage...  ...Familiarity with real-time model serving and infrastructure (e.g.,... 
    Temporary work
    Remote work

    Halo Media

    New York, NY
    2 days ago
  • $165k - $225k

     ...Career Renew is recruiting for one of its clients a Senior Machine Learning Engineer - this is a fully remote role for US/Canada based...  ...including CUDA kernel engineering, TensorRT/ONNX export, and inference serving frameworks such as Triton Experience with hosting... 
    Remote work
    Worldwide

    Career Renew

    New York, NY
    1 day ago
  •  ...getting started. Role We are seeking a Founding ML Engineer to define and build Adaptive's ML capabilities. Our products...  ...data pipelines, model training, evaluation frameworks, and inference serving. Establish evaluation methodology. Define how we measure... 
    Work at office
    Local area

    Adaptive Security Corporation

    New York, NY
    3 days ago
  •  ...We are looking for an engineer with experience in low-level systems...  ...growing ML team. Machine learning is a critical pillar of Jane...  ...evolving trading environment serves as a unique, rapid-feedback...  ...models - both training and inference. We care about efficient large... 

    Jane Street

    New York, NY
    3 days ago
  • $111.24k - $222.48k

     ...Senior Machine Learning Engineer We're building a world of health around every individual — shaping...  ...optimization and decision-making to better serve millions of customers nationwide. We'...  ...: reinforcement learning, causal inference, LLM, MCP ~ Experience as a mentor or... 
    Hourly pay
    Full time
    Temporary work

    Oak St. Health

    New York, NY
    4 days ago
  • $150k - $215k

     ...team combining world‑class engineers with veteran strategists who...  ...standing still. About the Role Machine learning is core to Vannevar's...  ...deploying high‑performance inference services, and we operate these...  ...process large volumes of data and serve predictions with strict... 
    Permanent employment
    Contract work
    For contractors
    For subcontractor
    Work at office
    Remote work

    Vannevar Labs

    New York, NY
    2 days ago
  • $117k - $167k

     ...every Fanatics surface. We are seeking a Machine Learning Engineer III to own the infrastructure and...  ...behavior, you build the platforms that serve those models in production. Responsibilities...  ...embedding pipelines and low-latency inference infrastructure. Solid understanding... 
    Full time

    Fanatics Betting & Gaming

    New York, NY
    17 hours ago
  •  ...We are looking for an engineer with robust experience in machine learning and strong mathematical foundations to join...  ...ever-evolving trading environment serves as a unique, rapid-feedback platform...  ...and maintaining training and inference infrastructure, with an understanding... 

    Jane Street

    New York, NY
    4 days ago
  • $148.7k - $199.4k

     ...Senior Machine Learning Engineer Technology is at the heart of Disney's past, present, and future...  ...foundation and consumer media touch points serving millions of people around the world....  ...for scalable learning, inference, and monitoring, conduct in-depth data... 
    Work experience placement

    The Walt Disney Studios

    New York, NY
    1 day ago
  • $128.7k - $261.3k

     ...the Team The Model Deployment & Inference Solutions team in GM AV deploys machine learning models from training frameworks...  ...layer that makes deployment self-serve for every ML model development team...  ...currently performed manually by engineers. Build the developer experience... 
    Flexible hours
    Shift work

    General Motors

    New York, NY
    2 days ago
  • $144k - $192k

     ...framework, powers this discovery. As a Machine Learning Engineer on the Data Mining team, your mission...  ...techniques such as batch inference and quantization to ensure models run...  ...contrastive learning. Knowledge of model serving tools (TF Serving, Triton, TorchServe... 
    Remote work

    Motional AD Inc.

    New York, NY
    2 days ago
  •  ...and endless opportunity to serve the varied needs of our community...  ...and fulfillment. We use machine learning and Internet-scale data to...  ..., and general causal inference. Search & Discovery ML :...  ...works alongside world-class engineers, data scientists, and product... 
    Remote job
    Permanent employment
    Work experience placement
    Internship
    Work at office
    Work from home
    Flexible hours

    Instacart

    New York, NY
    1 day ago
  •  ...Machine Learning Engineer / Researcher BoldVoice helps the 1 billion global non native English speakers...  ...Education apps on the App Store and serves non-native speakers of 100+ different...  ...environments for real-time and batch inference. Pipeline Development and... 
    Work at office
    Relocation package

    BoldVoice

    New York, NY
    1 day ago
  • $200k - $240k

     ...for you. The Opportunity As a Staff Machine Learning Engineer, Multimodal Modeling you will lead the...  ...and architecture pruning, to improve inference efficiency and deployability. Experience...  ..., which lead to good months. This serves as a preview of the 90 day plan you will... 
    Work at office
    Work from home
    Home office
    Flexible hours

    Flock Safety Group

    New York, NY
    2 days ago
  • $230k - $322k

     ...Staff Machine Learning Engineer, Ads Auction (Ads Marketplace Quality) Remote - United States Reddit...  ...feature engineering, model training, and inference. Proficiency with programming...  ...representative of the diverse communities we serve. Reddit is committed to providing reasonable... 
    For contractors
    Work experience placement
    Work at office
    Remote work
    Home office
    Flexible hours

    Reddit

    New York, NY
    2 days ago
  • $234k - $260k

     ...positive mark on culture. Principal Machine Learning Engineer, Ads Personalization (45447) Role...  ...of Paramount+, ensuring that every ad served adds value to the viewer while...  ...frequency capping. Knowledge of Causal Inference to measure the incremental boost ad-personalization... 
    Temporary work
    Immediate start
    Shift work

    Paramount

    New York, NY
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning Engineer - Inference / Serving. Be the first to apply!