Machine Learning Engineer - Inference / Serving
Yobi AI
Machine Learning Engineer - Inference / Serving Join to apply for the Machine Learning Engineer - Inference / Serving role at Yobi AI Overview Yobi is a rapidly growing Behavioral AI company on a mission to ethically democratize the benefits of data and AI. Since 2019, we have built one of the largest consented behavioral datasets in the United States, extending far beyond the walled gardens of Big Tech. Unlike traditional LLM companies, Yobi builds foundation models of human behavior grounded in real‑world actions such as purchases and store visits. Our private‑by‑design modeling enables state‑of‑the‑art personalization and decisioning for leading brands and agencies while protecting privacy, safety, and ethics. Today, we are focused on bringing the performance of closed‑web user acquisition to the open web and connected TV, giving brands walled‑garden results without the walls. At our core, Yobi is building the behavioral intelligence layer for any system that makes a personalization decision. Working at Yobi We’re at an inflection point—customer adoption is accelerating, but there’s still room to shape the architecture and culture from the ground up. Engineers here own major surface areas, build 0→1 systems in large‑scale data and model infrastructure, and help define how Behavioral AI scales ethically and effectively. Highlights Well‑funded with 5+ years of runway. We are scaling revenue quickly and project to be breakeven in 2026. Partnerships with Microsoft and Databricks. Fully remote or hybrid from hubs in SF Bay Area, Seattle, NYC. World‑class team of Machine Learning experts with experience at Amazon, Uber, Twitter, Meta, etc. Product and Go‑to‑Market teams that have taken ideas from concept to nine‑figure revenue streams. Benefits Competitive base salary. Meaningful equity and financial upside. Annual bonus target based on personal and company performance. Health, dental, vision plans with low out‑of‑pocket costs. Unlimited PTO. 401(k) with company match. About the Role As a Machine Learning Engineer focused on inference and serving at Yobi, you’ll design, optimize, and operate the systems that bring our Behavioral AI models to life in real time. You’ll work at the core of our production environment, turning trained models into performant, reliable, and continuously improving services that power our open‑web and CTV products. This is an applied ML systems role—equal parts engineering depth, deployment craft, and model intuition. You’ll shape how models are packaged, versioned, rolled out, and observed across environments, ensuring every prediction is fast, accurate, and accountable. Responsibilities & Expectations Build and scale production ML serving systems—handle versioning, rollouts, rollback strategies, and live experimentation. Ensure low‑latency inference by optimizing model graphs, quantizing, batching, caching, and efficient feature retrieval. Write robust, high‑performance code in Go, Rust, C++, or Java and bridge to Python for model integration and analysis. Treat inference as a living system—monitor drift, track model lineage, and ensure observability from input to outcome. Make serving systems reproducible and portable without over‑engineering—for instance, custom runtime design, model registries, or lightweight orchestration. Reason about model performance and trade‑offs, and work with researchers to deploy more practical models. Qualifications Deep expertise in model deployment and production ML serving. Strong low‑latency mindset and knowledge of inference optimization techniques. Systems fluency: comfortable writing high‑performance code and bridging to Python. Operational maturity: experienced with monitoring, drift detection, and observability. Infrastructure intuition: understanding of custom runtimes, registries, and orchestration. Applied ML understanding: can interpret performance, reasoning about trade‑offs, and collaborate with researchers. Seniority Level Mid‑Senior level. Employment Type Full‑time. Job Function Engineering and Information Technology. Software Development industry. #J-18808-Ljbffr
- A leading Behavioral AI company is seeking a Machine Learning Engineer focused on inference and serving. In this role, you will design and optimize systems to operationalize AI models. The ideal candidate has deep expertise in model deployment, a strong low-latency mindset...SuggestedRemote work
- ...infrastructure and tooling that powers machine learning across Canva. Our Inference Platform team sits at the heart of... ...that ML models are deployed, served, and optimised efficiently at scale... .../Specialty: As a Machine Learning Engineer, you’ll focus on building and...SuggestedWork at officeRemote workFlexible hours
$200k - $250k
...seeking an experienced Senior MLOps Engineer to take ownership of how our machine learning systems run reliably and... ...and scaling – for a custom-built inference platform powering a live conversational... ...ML systems. Define and enforce serving-layer SLAs – latency,...SuggestedRemote workFlexible hours- ...Reddit, Inc. is seeking a Staff Machine Learning Engineer to lead the development of a large-scale ML Inference Platform. Responsibilities include designing cloud-based ML systems on Kubernetes and ensuring reliable, low-latency performance. Candidates should have 7+ years...Suggested
$155k - $235k
...Senior Lead / Lead ML Platform Engineer to architect and own the... ...direction for our Training and Inference infrastructure. This is a high... ...massive models efficiently and serve them with sub-millisecond... ...training, and reinforcement learning. High-Performance Inference...SuggestedShift work$148.7k - $229.9k
...Platform team is looking for a senior machine learning engineer to lead the evolution of how we validate... ...into the next generation of causal inference and high-sensitivity evaluation methodologies... ...-Functional Technical Leadership: Serve as the lead subject matter expert on...Temporary workWork at officeWorldwideRelocation package$148.7k - $199.4k
...Senior Machine Learning Engineer - News Technology is at the heart of Disney's past, present, and... ...foundation and consumer media touch points serving millions of people around the world.... ...for scalable learning, inference, and monitoring, conduct in-depth data...Work experience placementLocal areaDay shift- ...Senior Machine Learning Engineer - News Technology is at the heart of Disney's past, present, and... ...foundation and consumer media touch points serving millions of people around the world.... ...for scalable learning, inference, and monitoring, conduct in-depth data...Work experience placementLocal areaDay shift
$128k - $160k
...Grailed is looking for a Senior Machine Learning Engineer to drive personalization, recommendation,... ...quality of inventory impressions that are served to prospective buyers. Develop... ...advanced statistical modeling, causal inference, experiment/test design, and working...Work experience placementLocal area$140k - $210k
...highly skilled and motivated engineer to join our team. You will... ...deploying state-of-the-art machine learning solutions to advance our mission... ...scale its technology to serve a growing number of customers... ...using cloud-based training and inference pipelines. ~5+ years of...Full timeWork experience placementWork at office2 days per week- ...The Role We're looking for a Machine Learning Engineer to join our Engineering team. You'll... ...build data pipelines for training and inference. Develop a robust set of tools for... ...PyTorch or TensorFlow. ~ Experience with serving models for inference (FastAPI) ~...
$148.7k - $199.4k
...Senior Machine Learning Engineer Technology is at the heart of Disney's past, present, and future... ...foundation and consumer media touch points serving millions of people around the world.... ...for scalable learning, inference, and monitoring, conduct in-depth data...Work experience placementLocal areaDay shift$205k - $316.4k
...Machine Learning Engineer At Quizlet, our mission is to help every learner achieve their outcomes... ...implement systems for real-time and batch inference Build end-to-end ML pipelines for... ...with data pipelines, model serving, and scalable systems Proficiency...Work at office3 days per week$190k - $260k
...Machine Learning Engineer – Search, Ranking & Personalization Stage: Seed Founded: 2022 Key Job Information... ...personalization across a platform serving hundreds of millions of items daily.... ...‑scale data processing for real‑time inference. Strong backend integration...Full timeH1bRemote workRelocationVisa sponsorship- ...innovation agency and precision engineering partner. For over 20 years,... ...We are seeking hands-on Machine Learning Engineers for an urgent... ...Execute daily model training and inference tasks. Build and manage... ...Familiarity with real-time model serving and infrastructure (e.g.,...Temporary workRemote work
$165k - $225k
...Career Renew is recruiting for one of its clients a Senior Machine Learning Engineer - this is a fully remote role for US/Canada based... ...including CUDA kernel engineering, TensorRT/ONNX export, and inference serving frameworks such as Triton Experience with hosting...Remote workWorldwide- ...getting started. Role We are seeking a Founding ML Engineer to define and build Adaptive's ML capabilities. Our products... ...data pipelines, model training, evaluation frameworks, and inference serving. Establish evaluation methodology. Define how we measure...Work at officeLocal area
- ...We are looking for an engineer with experience in low-level systems... ...growing ML team. Machine learning is a critical pillar of Jane... ...evolving trading environment serves as a unique, rapid-feedback... ...models - both training and inference. We care about efficient large...
$111.24k - $222.48k
...Senior Machine Learning Engineer We're building a world of health around every individual — shaping... ...optimization and decision-making to better serve millions of customers nationwide. We'... ...: reinforcement learning, causal inference, LLM, MCP ~ Experience as a mentor or...Hourly payFull timeTemporary work$150k - $215k
...team combining world‑class engineers with veteran strategists who... ...standing still. About the Role Machine learning is core to Vannevar's... ...deploying high‑performance inference services, and we operate these... ...process large volumes of data and serve predictions with strict...Permanent employmentContract workFor contractorsFor subcontractorWork at officeRemote work$117k - $167k
...every Fanatics surface. We are seeking a Machine Learning Engineer III to own the infrastructure and... ...behavior, you build the platforms that serve those models in production. Responsibilities... ...embedding pipelines and low-latency inference infrastructure. Solid understanding...Full time- ...We are looking for an engineer with robust experience in machine learning and strong mathematical foundations to join... ...ever-evolving trading environment serves as a unique, rapid-feedback platform... ...and maintaining training and inference infrastructure, with an understanding...
$148.7k - $199.4k
...Senior Machine Learning Engineer Technology is at the heart of Disney's past, present, and future... ...foundation and consumer media touch points serving millions of people around the world.... ...for scalable learning, inference, and monitoring, conduct in-depth data...Work experience placement$128.7k - $261.3k
...the Team The Model Deployment & Inference Solutions team in GM AV deploys machine learning models from training frameworks... ...layer that makes deployment self-serve for every ML model development team... ...currently performed manually by engineers. Build the developer experience...Flexible hoursShift work$144k - $192k
...framework, powers this discovery. As a Machine Learning Engineer on the Data Mining team, your mission... ...techniques such as batch inference and quantization to ensure models run... ...contrastive learning. Knowledge of model serving tools (TF Serving, Triton, TorchServe...Remote work- ...and endless opportunity to serve the varied needs of our community... ...and fulfillment. We use machine learning and Internet-scale data to... ..., and general causal inference. Search & Discovery ML :... ...works alongside world-class engineers, data scientists, and product...Remote jobPermanent employmentWork experience placementInternshipWork at officeWork from homeFlexible hours
- ...Machine Learning Engineer / Researcher BoldVoice helps the 1 billion global non native English speakers... ...Education apps on the App Store and serves non-native speakers of 100+ different... ...environments for real-time and batch inference. Pipeline Development and...Work at officeRelocation package
$200k - $240k
...for you. The Opportunity As a Staff Machine Learning Engineer, Multimodal Modeling you will lead the... ...and architecture pruning, to improve inference efficiency and deployability. Experience... ..., which lead to good months. This serves as a preview of the 90 day plan you will...Work at officeWork from homeHome officeFlexible hours$230k - $322k
...Staff Machine Learning Engineer, Ads Auction (Ads Marketplace Quality) Remote - United States Reddit... ...feature engineering, model training, and inference. Proficiency with programming... ...representative of the diverse communities we serve. Reddit is committed to providing reasonable...For contractorsWork experience placementWork at officeRemote workHome officeFlexible hours$234k - $260k
...positive mark on culture. Principal Machine Learning Engineer, Ads Personalization (45447) Role... ...of Paramount+, ensuring that every ad served adds value to the viewer while... ...frequency capping. Knowledge of Causal Inference to measure the incremental boost ad-personalization...Temporary workImmediate startShift work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Machine Learning Engineer - Inference / Serving. Be the first to apply!
- machine learning ai engineer New York, NY
- machine learning engineer New York, NY
- entry level machine learning engineer New York, NY
- junior machine learning research engineer New York, NY
- machine learning software engineer New York, NY
- ai ml engineer New York, NY
- senior ml engineer New York, NY
- graduate machine learning engineer New York, NY
- computer vision machine learning engineer New York, NY
- data scientist machine learning engineer New York, NY


