Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Real-Time ML Inference Engineer for Scalable Serving

Yobi

A Behavioral AI company is seeking a Machine Learning Engineer to design and optimize systems for bringing their models to life. The role involves ensuring ML models are efficient and reliable, requiring experience in model deployment and robust coding skills. Candidates should be familiar with low-latency techniques and operational maturity in ML systems. This position can be remote or hybrid from several hubs. #J-18808-Ljbffr Yobi

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Real-Time ML Inference Engineer for Scalable Serving in New York, NY vacancy
  •  ...ML Engineer Jersey City, NJ, 07311 (4 days onsite per week) Video...  ...experimentation to production by building scalable, reliable systems that serve predictions in real time or batch environments. What...  ...Optimization: Improve model inference speed and scalability.... 
    Suggested

    United Software Group

    Jersey City, NJ
    1 day ago
  •  ...Learning / Software Engineer Dyania...  ...solves important real-world problems,...  .... As a senior ML engineer at Dyania...  ..., and deploy scalable ML-driven...  ...deployment, and inference at scale. Architect...  ...deploying, and serving ML models in...  ...Generous Paid Time Off (Vacation,... 
    Suggested
    Internship
    Local area
    Remote work
    Flexible hours
    Shift work

    HealthX Ventures

    Jersey City, NJ
    2 days ago
  • Machine Learning Engineer - Inference / Serving Join to apply for the Machine Learning...  ...human behavior grounded in real‑world actions such as purchases...  ...AI models to life in real time. You’ll work at the core of...  ...products. This is an applied ML systems role—equal parts engineering... 
    Suggested
    Full time
    Remote work

    Yobi AI

    New York, NY
    5 days ago
  • Instacart is hiring a Senior Machine Learning Engineer to join the Matching & Positioning team focused on real-time decision-making for fulfillment processes. This role requires expertise in operations research and machine learning to design algorithms impacting profitability... 
    Suggested
    Remote work

    Instacart

    New York, NY
    2 days ago
  • $150k - $200k

    Affirm is seeking a Senior Machine Learning Engineer (Fraud) to lead the development of fraud prediction models. You will be working...  ...and will collaborate with cross-functional teams to build ML systems for real-time transaction decisions. Candidates should have 6+ years of... 
    Suggested
    Remote job

    Affirm

    New York, NY
    2 days ago
  • $200k - $250k

     ...quality, and trust. Our ML models power the...  ...Senior MLOps Engineer to take ownership...  ...for a custom-built inference platform powering...  ...engines handling real-time inference for high...  ...Define and enforce serving-layer SLAs - latency...  ...cost-efficient, and scalable, partnering with... 
    Remote work
    Flexible hours

    Wizard

    New York, NY
    2 days ago
  • Olik Global is seeking an experienced Data Engineer for projects in New Jersey, Irving, and...  ...a focus on MEM SQL (SingleStore) for real-time processing. The ideal candidate will require...  ...crucial. Join a dynamic team to drive scalable data solutions. #J-18808-Ljbffr Olik... 

    Olik Global

    New York, NY
    3 days ago
  •  ...Learning Engineerto serve as a hands-on...  ...most of their time working directly...  ...principal-level engineer: shaping unclear...  ...Develop practical ML models that...  ...batch scoring, real-time or near-real-time inference, model versioning...  ...supportable, secure, scalable, and aligned... 
    Temporary work
    Remote work

    Medical Guardian

    New York, NY
    1 day ago
  • Mirage is looking for an ML Engineer in New York City to build and scale systems for video generation models. The role focuses on optimizing advanced models for real-time generation and includes responsibilities like training models and improving system efficiency. The... 

    Mirage

    New York, NY
    3 days ago
  •  ...**Job Description:****ML Engineer****3M Health Care is now...  ...work reliably in the real world. You will help...  ...services are secure and scalable.**Key Responsibilities...  ...or Git).* **Model Serving:** Deploy ML models as...  ...for model training and inference.* **Feature Management... 
    H1b
    Remote work

    Solventum

    New York, NY
    5 days ago
  • The New York Times is seeking a Senior Data Engineer in New York City to contribute to the Customer-Facing Data Products team. This role involves developing real-time data pipelines and APIs that serve customer needs. The ideal candidate has over 5 years of experience... 

    The New York Times

    New York, NY
    3 days ago
  • $148.7k - $199.4k

     ...global organization of engineers, product...  ..., innovation, and scalability for our businesses...  ...media touch points serving millions of people...  ...identity. The News ML team is responsible...  ...models to enable real-time content personalization...  ...learning, inference, and monitoring, conduct... 
    Work experience placement
    Local area
    Day shift

    The Walt Disney Company

    New York, NY
    1 day ago
  • $145k - $180k

     ...Whythisrolematters: The ML Engineer is a new role within...  ...andoptimizingML inference systems that run in production...  ...video as well as real-time SBERT for embedding...  ...Design and implement scalable data processing...  ...transformer, rewritingits serving path for a 2-3x latency... 

    The Associated Press

    New York, NY
    4 days ago
  • $148.7k - $199.4k

     ...Machine Learning Engineer - News Technology...  ..., innovation, and scalability for our businesses...  ...touch points serving millions of people...  ...identity. The News ML team is responsible...  ...models to enable real-time content personalization...  ...learning, inference, and monitoring, conduct... 
    Work experience placement
    Local area
    Day shift

    Disney

    New York, NY
    1 day ago
  • $111.24k - $222.48k

     ...Machine Learning Engineer We're...  ...community at a time. At CVS Health...  ...making to better serve millions of customers...  ...expertise in ML to work on...  ...technologies into real business...  ...code quality, and scalable architecture. Influence...  ..., causal inference, LLM, MCP ~ Experience... 
    Hourly pay
    Full time
    Temporary work

    Oak St. Health

    New York, NY
    20 hours ago
  •  ...Machine Learning and Computer Algorithm Engineer. In this hands-on role, you will develop...  ...8+ years of experience and expertise in ML/algorithm development. Strong coding skills...  ...are essential, alongside experience with real-time computer vision pipelines. This is an exciting... 

    Peskind Executive Search

    New York, NY
    1 day ago
  • $300k - $400k

     ...Description As a Principal AI/ML Engineer in our AdTech team,...  ...ecosystem (e.g., real-time bidding and digital marketing...  ...highly performant, scalable, and reliable. You...  ...training to real-time inference - for our real-time bidding...  ...integrate with our ad serving architecture and... 

    Zeta-Global

    New York, NY
    4 days ago
  • $152k - $228k

     ...Job Description Senior ML Engineer About Invoca...  ...and fine-tuning through inference optimization and production...  ...'s ML stack — model serving, inference optimization...  ..., and build robust, scalable APIs for internal and...  ...regulations. Flexible Time Off – We encourage a... 
    Currently hiring
    Remote work
    Flexible hours

    Invoca

    New York, NY
    11 days ago
  • A leading payments technology firm in New York seeks an experienced professional to architect scalable ML systems for fraud detection. The role requires over 5 years of production ML experience and proficiency in Python and frameworks like PyTorch and TensorFlow. Ideal... 
    Flexible hours

    raincards.xyz

    New York, NY
    5 days ago
  •  ...helps contractors, engineering firms, and utilities...  ...of our training and inference pipelines, fortifying...  ...Design and maintain scalable architectures for serving deep learning models...  ...computer vision and time-series models on large...  ...and scaling ML applications. Infrastructure... 
    For contractors

    SewerAI Corporation

    New York, NY
    2 days ago
  • $190k - $260k

    Machine Learning Engineer - Search, Ranking...  ...Employment Type: Full-Time Experience Level...  ...will join the ML team to design,...  ...across a platform serving hundreds of...  ...accuracy and system scalability. Contribute to product...  ...processing for real‑time inference. Strong backend... 
    Full time
    H1b
    Remote work
    Relocation
    Visa sponsorship

    Fuku

    New York, NY
    1 day ago
  • $148.7k - $199.4k

     ...Machine Learning Engineer on Disney...  ...Entertainment & ESPN’s News ML team, you will...  ..., building scalable infrastructure for learning, inference, and monitoring...  ...impact and most time‑sensitive outcomes...  ...methods to solve real‑world...  ...‑latency online serving Experience designing... 
    Work experience placement

    Disney Cruise Line

    New York, NY
    5 days ago
  •  ...appointments, and handle real customer interactions...  ...the Role As an ML Research Engineer at Maple, you'll be...  ...-ready voice agents, serving millions of...  ...optimized production inference. Lead evaluations,...  ...maintain robustness and scalability. Balance research... 
    Work at office
    Local area

    Maple AI, Inc

    New York, NY
    3 days ago
  •  ...Development Full-Time Neurex AI is building...  ...operate inside real healthcare...  ...This role is for engineers who want to build...  ...the foundational ML infrastructure that...  ...Design and build scalable ML training and inference infrastructure Implement model serving systems for low-latency... 
    Remote job
    Full time

    Neurex AI Limited

    New York, NY
    2 days ago
  • $160k - $170k

     ...Machine Learning Engineer to help build, ship, and scale ML‑powered products...  ...businesses, and serve their own users....  ..., measurable, scalable, and valuable inside a real product. Whether...  ...experiences improve over time. Platform &...  ...trust, faster inference, better... 

    Medium

    New York, NY
    1 day ago
  •  ...technology company seeks a Machine Learning Engineer to design and operate cloud-native data...  .... This hands-on role involves building scalable data pipelines and integrating various event...  ...and 3-5 years of experience with data or ML systems. The ideal applicant possesses... 
    Remote job

    Twilio

    New York, NY
    2 days ago
  • $150k - $300k

     ...Goldman Sachs, our Engineers don't just make...  ...that build massively scalable software and...  ...Performance: Optimize inference latency and manage...  ...scale deployments serving thousands of internal...  ...Java processes to real-time, event-driven AI...  ...years focused on AI/ML integration in... 
    Full time
    Temporary work
    Part time
    Immediate start

    Goldman Sachs

    New York, NY
    5 days ago
  • $244k - $320k

    A leading AI marketing platform in New York is seeking a Senior Machine Learning Engineer to design and operate scalable ML systems that personalize experiences for millions of customers. This role involves end-to-end management of ML projects, collaboration with cross... 

    Attentive

    New York, NY
    5 days ago
  •  ...Voice AI economy, providing real-time APIs for speech-to-text (...  ...to a production API serving millions of requests is one...  ...hardest problems in AI. As an ML Ops Infrastructure Engineer at Deepgram, you will own...  ...frameworks such as NVIDIA Triton Inference Server, TensorRT, or ONNX... 
    Home office
    Flexible hours

    Deepgram

    New York, NY
    2 days ago
  • $180k - $225k

     ...vision is to build an ML/AI-first network of advertisers...  ...shoppers. As our Engineering Manager, Data Science,...  ...modeling, and real-time segmentation. Own Value...  ...engineering of high-scale inference pipelines, including...  ...stores, and low-latency serving infrastructure such as... 
    Remote job
    Work at office
    Shift work

    Fluent, LLC

    New York, NY
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Real-Time ML Inference Engineer for Scalable Serving. Be the first to apply!