Real-Time ML Inference Engineer for Scalable Serving
Yobi
A Behavioral AI company is seeking a Machine Learning Engineer to design and optimize systems for bringing their models to life. The role involves ensuring ML models are efficient and reliable, requiring experience in model deployment and robust coding skills. Candidates should be familiar with low-latency techniques and operational maturity in ML systems. This position can be remote or hybrid from several hubs. #J-18808-Ljbffr Yobi
- ...ML Engineer Jersey City, NJ, 07311 (4 days onsite per week) Video... ...experimentation to production by building scalable, reliable systems that serve predictions in real time or batch environments. What... ...Optimization: Improve model inference speed and scalability....Suggested
- ...Learning / Software Engineer Dyania... ...solves important real-world problems,... .... As a senior ML engineer at Dyania... ..., and deploy scalable ML-driven... ...deployment, and inference at scale. Architect... ...deploying, and serving ML models in... ...Generous Paid Time Off (Vacation,...SuggestedInternshipLocal areaRemote workFlexible hoursShift work
- Machine Learning Engineer - Inference / Serving Join to apply for the Machine Learning... ...human behavior grounded in real‑world actions such as purchases... ...AI models to life in real time. You’ll work at the core of... ...products. This is an applied ML systems role—equal parts engineering...SuggestedFull timeRemote work
- Instacart is hiring a Senior Machine Learning Engineer to join the Matching & Positioning team focused on real-time decision-making for fulfillment processes. This role requires expertise in operations research and machine learning to design algorithms impacting profitability...SuggestedRemote work
$150k - $200k
Affirm is seeking a Senior Machine Learning Engineer (Fraud) to lead the development of fraud prediction models. You will be working... ...and will collaborate with cross-functional teams to build ML systems for real-time transaction decisions. Candidates should have 6+ years of...SuggestedRemote job$200k - $250k
...quality, and trust. Our ML models power the... ...Senior MLOps Engineer to take ownership... ...for a custom-built inference platform powering... ...engines handling real-time inference for high... ...Define and enforce serving-layer SLAs - latency... ...cost-efficient, and scalable, partnering with...Remote workFlexible hours- Olik Global is seeking an experienced Data Engineer for projects in New Jersey, Irving, and... ...a focus on MEM SQL (SingleStore) for real-time processing. The ideal candidate will require... ...crucial. Join a dynamic team to drive scalable data solutions. #J-18808-Ljbffr Olik...
- ...Learning Engineerto serve as a hands-on... ...most of their time working directly... ...principal-level engineer: shaping unclear... ...Develop practical ML models that... ...batch scoring, real-time or near-real-time inference, model versioning... ...supportable, secure, scalable, and aligned...Temporary workRemote work
- Mirage is looking for an ML Engineer in New York City to build and scale systems for video generation models. The role focuses on optimizing advanced models for real-time generation and includes responsibilities like training models and improving system efficiency. The...
- ...**Job Description:****ML Engineer****3M Health Care is now... ...work reliably in the real world. You will help... ...services are secure and scalable.**Key Responsibilities... ...or Git).* **Model Serving:** Deploy ML models as... ...for model training and inference.* **Feature Management...H1bRemote work
- The New York Times is seeking a Senior Data Engineer in New York City to contribute to the Customer-Facing Data Products team. This role involves developing real-time data pipelines and APIs that serve customer needs. The ideal candidate has over 5 years of experience...
$148.7k - $199.4k
...global organization of engineers, product... ..., innovation, and scalability for our businesses... ...media touch points serving millions of people... ...identity. The News ML team is responsible... ...models to enable real-time content personalization... ...learning, inference, and monitoring, conduct...Work experience placementLocal areaDay shift$145k - $180k
...Whythisrolematters: The ML Engineer is a new role within... ...andoptimizingML inference systems that run in production... ...video as well as real-time SBERT for embedding... ...Design and implement scalable data processing... ...transformer, rewritingits serving path for a 2-3x latency...$148.7k - $199.4k
...Machine Learning Engineer - News Technology... ..., innovation, and scalability for our businesses... ...touch points serving millions of people... ...identity. The News ML team is responsible... ...models to enable real-time content personalization... ...learning, inference, and monitoring, conduct...Work experience placementLocal areaDay shift$111.24k - $222.48k
...Machine Learning Engineer We're... ...community at a time. At CVS Health... ...making to better serve millions of customers... ...expertise in ML to work on... ...technologies into real business... ...code quality, and scalable architecture. Influence... ..., causal inference, LLM, MCP ~ Experience...Hourly payFull timeTemporary work- ...Machine Learning and Computer Algorithm Engineer. In this hands-on role, you will develop... ...8+ years of experience and expertise in ML/algorithm development. Strong coding skills... ...are essential, alongside experience with real-time computer vision pipelines. This is an exciting...
$300k - $400k
...Description As a Principal AI/ML Engineer in our AdTech team,... ...ecosystem (e.g., real-time bidding and digital marketing... ...highly performant, scalable, and reliable. You... ...training to real-time inference - for our real-time bidding... ...integrate with our ad serving architecture and...$152k - $228k
...Job Description Senior ML Engineer About Invoca... ...and fine-tuning through inference optimization and production... ...'s ML stack — model serving, inference optimization... ..., and build robust, scalable APIs for internal and... ...regulations. Flexible Time Off – We encourage a...Currently hiringRemote workFlexible hours- A leading payments technology firm in New York seeks an experienced professional to architect scalable ML systems for fraud detection. The role requires over 5 years of production ML experience and proficiency in Python and frameworks like PyTorch and TensorFlow. Ideal...Flexible hours
- ...helps contractors, engineering firms, and utilities... ...of our training and inference pipelines, fortifying... ...Design and maintain scalable architectures for serving deep learning models... ...computer vision and time-series models on large... ...and scaling ML applications. Infrastructure...For contractors
$190k - $260k
Machine Learning Engineer - Search, Ranking... ...Employment Type: Full-Time Experience Level... ...will join the ML team to design,... ...across a platform serving hundreds of... ...accuracy and system scalability. Contribute to product... ...processing for real‑time inference. Strong backend...Full timeH1bRemote workRelocationVisa sponsorship$148.7k - $199.4k
...Machine Learning Engineer on Disney... ...Entertainment & ESPN’s News ML team, you will... ..., building scalable infrastructure for learning, inference, and monitoring... ...impact and most time‑sensitive outcomes... ...methods to solve real‑world... ...‑latency online serving Experience designing...Work experience placement- ...appointments, and handle real customer interactions... ...the Role As an ML Research Engineer at Maple, you'll be... ...-ready voice agents, serving millions of... ...optimized production inference. Lead evaluations,... ...maintain robustness and scalability. Balance research...Work at officeLocal area
- ...Development Full-Time Neurex AI is building... ...operate inside real healthcare... ...This role is for engineers who want to build... ...the foundational ML infrastructure that... ...Design and build scalable ML training and inference infrastructure Implement model serving systems for low-latency...Remote jobFull time
$160k - $170k
...Machine Learning Engineer to help build, ship, and scale ML‑powered products... ...businesses, and serve their own users.... ..., measurable, scalable, and valuable inside a real product. Whether... ...experiences improve over time. Platform &... ...trust, faster inference, better...- ...technology company seeks a Machine Learning Engineer to design and operate cloud-native data... .... This hands-on role involves building scalable data pipelines and integrating various event... ...and 3-5 years of experience with data or ML systems. The ideal applicant possesses...Remote job
$150k - $300k
...Goldman Sachs, our Engineers don't just make... ...that build massively scalable software and... ...Performance: Optimize inference latency and manage... ...scale deployments serving thousands of internal... ...Java processes to real-time, event-driven AI... ...years focused on AI/ML integration in...Full timeTemporary workPart timeImmediate start$244k - $320k
A leading AI marketing platform in New York is seeking a Senior Machine Learning Engineer to design and operate scalable ML systems that personalize experiences for millions of customers. This role involves end-to-end management of ML projects, collaboration with cross...- ...Voice AI economy, providing real-time APIs for speech-to-text (... ...to a production API serving millions of requests is one... ...hardest problems in AI. As an ML Ops Infrastructure Engineer at Deepgram, you will own... ...frameworks such as NVIDIA Triton Inference Server, TensorRT, or ONNX...Home officeFlexible hours
$180k - $225k
...vision is to build an ML/AI-first network of advertisers... ...shoppers. As our Engineering Manager, Data Science,... ...modeling, and real-time segmentation. Own Value... ...engineering of high-scale inference pipelines, including... ...stores, and low-latency serving infrastructure such as...Remote jobWork at officeShift work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Real-Time ML Inference Engineer for Scalable Serving. Be the first to apply!
- entry level machine learning engineer New York, NY
- machine learning ai engineer New York, NY
- junior machine learning research engineer New York, NY
- ai ml engineer New York, NY
- senior ml engineer New York, NY
- machine learning engineer New York, NY
- graduate machine learning engineer New York, NY
- data scientist machine learning engineer New York, NY
- computer vision machine learning engineer New York, NY
- machine learning software engineer New York, NY


