Remote Model Serving Engineer - AI Inference Platform

Bright Vision Technologies

Remote job

Bright Vision Technologies is seeking a Model Serving Engineer to design and operate high-performance inference platforms. This role focuses on optimizing AI deployment systems, balancing latency and throughput while ensuring reliability. Applicants should have a strong background in distributed systems and ML platform engineering, with proficiency in Python and systems languages. The position offers the flexibility of 100% remote work across continental United States. #J-18808-Ljbffr Bright Vision Technologies

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Remote Model Serving Engineer - AI Inference Platform in Bartlett, IL vacancy

Senior Model Serving Engineer - Remote AI Inference
Bright Vision Technologies is hiring a Model Serving Engineer to design scalable AI inference platforms. This role demands experience with high-throughput services... ...and systems language skills. The position is fully remote for candidates in the Continental United States....
Remote job
Platform
Bright Vision Technologies
New York, NY
4 days ago
Senior Model Serving Engineer - High-Performance AI (Remote)
$100k - $150k
Bright Vision Technologies is looking for a Model Serving Engineer to join its team remotely. This position will focus on designing, building, and operating high-performance inference platforms for machine learning models. Responsibilities include optimizing performance...
Remote job
Platform
Bright Vision Technologies
Newark, CA
2 days ago
Senior Engineer 2: AI Inference Engine Systems
$167.2k - $209k
...is expanding its AI Infrastructure... ...seeking a Senior Engineer 2 to join our AI Inference Data Plane team.... ...and scale their models with industry-leading... ...ensure superior platform health. What... ...inference serving frameworks such... ...,000 This is a remote role Why You’ll...
Remote work
Platform
Local area
Worldwide
Flexible hours
DigitalOcean
San Francisco, CA
5 days ago
Forward Deployed Engineer, AI Inference (vLLM and Kubernetes)
$184.94k - $305.13k
The vLLM and LLM-D Engineering team at Red Hat... ...our cutting‑edge inference platform (LLM-D ( and vLLM... ...distributed Large Language Model (LLM) inference... ...disaggregated serving, KV‑cache aware... ...CNI failures. AI Inference... ...For positions with Remote-US locations, the...
Remote work
Platform
Permanent employment
Full time
Contract work
Work experience placement
Work at office
Flexible hours
Red Hat, Inc.
Boston, MA
3 days ago
Product Manager - AI Inference & Model Serving
...is the Kubernetes-native AI infrastructure company, enabling... ..., Mirantis empowers platform engineering teams to deliver composable... ...Product Manager to own AI inference and model serving for k0rdent AI, our control... ...job opportunities. #remote We are a Leader for Container...
Remote work
Platform
Mirantis
Austin, TX
a month ago
Remote Quantitative AI Evaluator & Model Validator
$60 per hour
...developing cutting-edge AI systems, while... ...flexibility of remote work and... .... AI models are increasingly... ...assessment (this serves as our version... ...available on our platform. Benefits... ...and statistical inference. Write clear... ..., Mathematics, Engineering, or similar); a...
Remote work
Platform
Hourly pay
Full time
Flexible hours
DataAnnotation
Sioux Falls, SD
1 day ago
Real-Time Inference & Model Serving Engineer (Equity)
$220k - $320k
ML Model Serving Engineer Want to build the layer that actually makes AI usable in real time? You’ll join a team focused on inference, where performance is the product. This is about delivering low... .... You’ll sit at the core of the platform, working across model serving,...
Platform
3 days per week
Trades Workforce Solutions
San Francisco, CA
4 days ago
Remote Model Serving Architect for High-Performance AI
...Technologies is seeking a Model Serving Engineer responsible for building and... ...operating high-performance platforms for serving machine learning... .... This role will optimize inference performance and implement observability... ...strategies in a fully remote setting. The ideal...
Remote job
Platform
Bright Vision Technologies
Plano, TX
4 days ago
Model Serving Engineer
$100k - $150k
...we’re looking for a skilled Model Serving Engineer to join our dynamic team... ...Engineer Location: 100% Remote (Continental United States)... ...performance, highly reliable inference platforms for serving large machine... ...systems engineering side of AI deployment, including request...
Remote work
Platform
Full time
H1b
Local area
Immediate start
Visa sponsorship
Work visa
Bright Vision Technologies
United States
1 hour ago
ML Engineer - Inference & Model Deployment
...100x better job search engine: fast, comprehensive, honest... ...help us turn powerful AI and ML models into fast, reliable... ...deploying models, optimizing inference latency and throughput, scaling serving systems, and making... ...comfortable with cloud platforms, distributed systems,...
Platform
Relocation package
HiringCafe
Cupertino, CA
1 day ago
Inference Engineer
Machine Learning Engineer, Inference Want to solve realtime... ...-growing voice AI company building the... ...generic AI platform role focused on wrapping... ...of-the-art speech models actually behave... ...Runtime, and custom serving systems Managing KV... .... Location: Remote across the US or Europe...
Remote work
Platform
Flexible hours
Trades Workforce Solutions
San Francisco, CA
4 days ago
Senior Distinguished Engineer, AI Compute (Remote Eligible)
$314.8k - $359.3k
...Senior Distinguished Engineer, AI Compute (Remote Eligible) At... ...to reimagine how we serve our customers and businesses... ...machine learning platform organization manages... ...running large foundation models Work cross‑... ...or high‑throughput inference Hands‑on experience...
Remote work
Platform
Full time
Part time
Local area
Capital One
Cambridge, MA
1 day ago
Software Engineer - Hosted Model Infrastructure
$145k - $200k
...who need it, our platforms empower our... ...are a software engineering team with expertise... ...in enabling ML models in production. We deploy AI models to run... ...full stack, from inference engines, GPU... ...and Go Model serving engines for GPU... ...that allow for “Remote” work on an exceptional...
Remote work
Platform
Work experience placement
Work at office
Work from home
Relocation package
Palantir Technologies
Palo Alto, CA
2 days ago
AI Engineer - Model Performance
...Hybrid Department AI ABOUT FATHOM We... ...We're hiring a Model Performance Engineer to own the speed,... ...reliability of our model inference stack, and to... ...real systems serving millions of meetings... ...serverless GPU platforms. Understanding of... ...being fully remote. We schedule meetings...
Remote work
Platform
Full time
Pantera Capital
San Francisco, CA
4 days ago
Software Engineer, Model Routing & Inference Engineering · · New York; San Francisco Apply →
...inventive research, design, and engineering. Our organization is very... ...Software Engineer on the Model Routing & Inference team at Cursor, you'll build the inference platform that powers every AI interaction in the... ...especially in inference serving, traffic routing, or real...
Platform
Anysphere
New York, NY
2 days ago
Inference Optimization Engineer United States - Remote · Remote
$198k - $286k
...to revolutionize AI infrastructure by... ...Modular, we optimize inference from kernel to... ...differentiated cloud platform that delivers state... ..., the inference engine, and distributed systems... ...Los Altos, CA or remotely from home.... ...performance of LLMs served on Modular Cloud to...
Remote job
Platform
Work experience placement
Work at office
Local area
Flexible hours
Modular Mailing Systems, Inc.
Los Altos, CA
3 days ago
WW Consulting Engineer - AI/ML
$182.9k - $274.9k
...global Field Sales Engineers, customers, and... ..., and edge AI/ML architectures... ...deployment model is most appropriate... ...for Apple platforms across enterprise... ...AFM), on-device inference, and Apple Intelligence... ...and model serving architecture... ...model serving, and remote client access....
Remote work
Platform
Relocation
Shift work
Apple Inc.
Chicago, IL
5 days ago
Senior Software Engineer, Distributed Systems - NIM Factory
$168k - $270.25k
Senior Software Engineer, Distributed Systems... ...Clara: US, TX, Remote: US, NY, Remote:... ...45NVIDIA is the platform upon which every new AI-powered... ...automation for NVIDIA Inference Microservices (NIMs... ...optimizes and serves performant... ...inferencing for every AI model in a...
Remote work
Platform
NVIDIA Corporation
Santa Clara, CA
3 days ago
Quality Engineer III (AI and ML Platforms)
$95.38k - $160.85k
Quality Engineer III (AI and ML Platforms)Skip to main contentWe use cookies... ...across data, model, and infrastructure... ...layers. The position serves as a key contributor... ...and model validation, inference testing, integration... ...with geographically remote and culturally diverse...
Remote work
Platform
Work experience placement
Work at office
Local area
Flexible hours
ICW Group
San Diego, CA
2 days ago
Senior AI Research Engineer Model Inference Remote
...Job We are looking for an experienced AI Model Engineer with deep expertise in kernel development... .... The engineer will extend the inference framework to support inference and fine... ...functional teams to integrate optimized serving and inference frameworks into production...
Remote job
Framework Ventures
New York, NY
4 days ago
Lead Traffic Engineer
...25 applicants Get AI-powered advice on this... ...: California (Remote) | Department: Transportation Engineering We are seeking a Lead... ...Traffic Engineer to serve as a senior technical... ...at talisman by 2x Inferred from the... ...L5) - Open Connect Platform United States $100,...
Remote work
Platform
Full time
For contractors
Work at office
talisman
California, MO
4 days ago
AI Field Engineer
$176k - $228k
AI Field Engineer We're a well-funded AI infrastructure company building the platform that lets teams run, tune, and scale AI on open models in production. Our infrastructure... ...Experience with inference serving frameworks (vLLM,... ...Mateo, CA; open to remote within the US. Regular...
Remote work
Platform
Full time
H1b
Visa sponsorship
David Joseph & Company
New York, NY
4 days ago
Senior Specialist Field Engineer - HPC/AI/ML
...client running AI workloads at scale... ...of our cloud platform. The CX... ...internal and customer engineering teams, offering... ...role, you will: Serve as the primary... ...on AI/ML inference. Fluency in cloud... ...optimization, or advanced model‑server... ...work environment, remote work may be...
Remote work
Platform
Permanent employment
Temporary work
Casual work
Work at office
Flexible hours
Neura Market
Livingston, NJ
5 days ago
Senior ML Inference Platform Engineer (Remote)
...Senior ML Infrastructure Engineer in Mountain View,... ...build and scale robust platforms for ML inference workflows supporting GM’s AI efforts. You will collaborate... ...to implement model serving strategies and handle backend... ...skills. The role offers a remote work setup with...
Remote job
Platform
Israelvcforum
Mountain View, CA
4 days ago
Senior AI Systems Engineer
...operational support of AI platforms, tools, and services,... ...cloud, and platform engineering teams.... ...MLOps pipelines for model packaging, testing, deployment... ...Experience with model serving, inference optimization, or AI platform... ...experience. REMOTE WORK NOTICE This position...
Remote work
Platform
Work at office
Applied Research Associates, Inc.
Raleigh, NC
3 days ago
Senior Machine Learning Engineer, Trust & Safety
$186k - $245k
...Machine Learning Engineer, where you'll... ...application of AI and machine learning... ...safety on the platform. Working... ...machine learning models to identify and... ...preprocess data, run inference, and manage... ..., model serving environment, observability... ...Parents Remote #J-18808-Ljbffr...
Remote work
Platform
Work experience placement
Hinge
New York, NY
3 days ago
Senior Site Reliability Engineer AI Infrastructure
Senior Site Reliability Engineer - AI Infrastructure Location: Global Remote / San Francisco •... ...’s needed most. Our platform routes training and inference jobs across global... ...Technical Partnership: Serve as the primary... ...t need to write the models, but you need to understand...
Remote work
Platform
Full time
Cortes 23
San Francisco, CA
4 days ago
Field Application Engineer
$140k - $180k
...Field Applications Engineer (FAE) Who We Are... ...challenges in large scale AI data center... ...operate across a mix of remote and on-site... ...customer environments Serve as the technical... ..., SRE, and platform engineering teams... ...job scheduling, and inference/training workloads...
Remote work
Platform
Delos Data Inc
Palo Alto, CA
5 days ago
Sr. Site Reliability Engineer
...Site Reliability Engineer Job type: Full Time · Department: Platform · Work type: On-Site... ..., United States (Remote) Optura is healthcare’s AI orchestration platform... ...supports multiple model providers,... ...scheduling, model‑serving stacks, inference autoscaling OSS contributions...
Remote work
Platform
Full time
Neara
San Francisco, CA
5 days ago
Senior Site Reliability Engineer (Remote Poland)
...reliability and AI operations foundation... ...intelligence platform that runs the most... ...Site Reliability Engineer who wants to own... ...This role is a remote position for candidates... ...Enablement Serve as the primary reliability... ...provisioning, model serving, inference latency, and...
Remote job
Platform
Flexible hours
Tech Insights
New York, NY
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote Model Serving Engineer - AI Inference Platform. Be the first to apply!