Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Remote Model Serving Engineer - AI Inference Platform

Bright Vision Technologies

Bartlett, IL
  • Remote job

Bright Vision Technologies is seeking a Model Serving Engineer to design and operate high-performance inference platforms. This role focuses on optimizing AI deployment systems, balancing latency and throughput while ensuring reliability. Applicants should have a strong background in distributed systems and ML platform engineering, with proficiency in Python and systems languages. The position offers the flexibility of 100% remote work across continental United States. #J-18808-Ljbffr Bright Vision Technologies

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Remote Model Serving Engineer - AI Inference Platform in Bartlett, IL vacancy
  • Bright Vision Technologies is hiring a Model Serving Engineer to design scalable AI inference platforms. This role demands experience with high-throughput services...  ...and systems language skills. The position is fully remote for candidates in the Continental United States.... 
    Remote job
    Platform

    Bright Vision Technologies

    New York, NY
    4 days ago
  • $100k - $150k

    Bright Vision Technologies is looking for a Model Serving Engineer to join its team remotely. This position will focus on designing, building, and operating high-performance inference platforms for machine learning models. Responsibilities include optimizing performance... 
    Remote job
    Platform

    Bright Vision Technologies

    Newark, CA
    2 days ago
  • $167.2k - $209k

     ...is expanding its AI Infrastructure...  ...seeking a Senior Engineer 2 to join our AI Inference Data Plane team....  ...and scale their models with industry-leading...  ...ensure superior platform health. What...  ...inference serving frameworks such...  ...,000 This is a remote role Why You’ll... 
    Remote work
    Platform
    Local area
    Worldwide
    Flexible hours

    DigitalOcean

    San Francisco, CA
    5 days ago
  • $184.94k - $305.13k

    The vLLM and LLM-D Engineering team at Red Hat...  ...our cutting‑edge inference platform (LLM-D ( and vLLM...  ...distributed Large Language Model (LLM) inference...  ...disaggregated serving, KV‑cache aware...  ...CNI failures. AI Inference...  ...For positions with Remote-US locations, the... 
    Remote work
    Platform
    Permanent employment
    Full time
    Contract work
    Work experience placement
    Work at office
    Flexible hours

    Red Hat, Inc.

    Boston, MA
    3 days ago
  •  ...is the Kubernetes-native AI infrastructure company, enabling...  ..., Mirantis empowers platform engineering teams to deliver composable...  ...Product Manager to own AI inference and model serving for k0rdent AI, our control...  ...job opportunities. #remote We are a Leader for Container... 
    Remote work
    Platform

    Mirantis

    Austin, TX
    a month ago
  • $60 per hour

     ...developing cutting-edge AI systems, while...  ...flexibility of remote work and...  .... AI models are increasingly...  ...assessment (this serves as our version...  ...available on our platform. Benefits...  ...and statistical inference. Write clear...  ..., Mathematics, Engineering, or similar); a... 
    Remote work
    Platform
    Hourly pay
    Full time
    Flexible hours

    DataAnnotation

    Sioux Falls, SD
    1 day ago
  • $220k - $320k

    ML Model Serving Engineer Want to build the layer that actually makes AI usable in real time? You’ll join a team focused on inference, where performance is the product. This is about delivering low...  .... You’ll sit at the core of the platform, working across model serving,... 
    Platform
    3 days per week

    Trades Workforce Solutions

    San Francisco, CA
    4 days ago
  •  ...Technologies is seeking a Model Serving Engineer responsible for building and...  ...operating high-performance platforms for serving machine learning...  .... This role will optimize inference performance and implement observability...  ...strategies in a fully remote setting. The ideal... 
    Remote job
    Platform

    Bright Vision Technologies

    Plano, TX
    4 days ago
  • $100k - $150k

     ...we’re looking for a skilled Model Serving Engineer to join our dynamic team...  ...Engineer Location: 100% Remote (Continental United States)...  ...performance, highly reliable inference platforms for serving large machine...  ...systems engineering side of AI deployment, including request... 
    Remote work
    Platform
    Full time
    H1b
    Local area
    Immediate start
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    United States
    1 hour ago
  •  ...100x better job search engine: fast, comprehensive, honest...  ...help us turn powerful AI and ML models into fast, reliable...  ...deploying models, optimizing inference latency and throughput, scaling serving systems, and making...  ...comfortable with cloud platforms, distributed systems,... 
    Platform
    Relocation package

    HiringCafe

    Cupertino, CA
    1 day ago
  • Machine Learning Engineer, Inference Want to solve realtime...  ...-growing voice AI company building the...  ...generic AI platform role focused on wrapping...  ...of-the-art speech models actually behave...  ...Runtime, and custom serving systems Managing KV...  .... Location: Remote across the US or Europe... 
    Remote work
    Platform
    Flexible hours

    Trades Workforce Solutions

    San Francisco, CA
    4 days ago
  • $314.8k - $359.3k

     ...Senior Distinguished Engineer, AI Compute (Remote Eligible) At...  ...to reimagine how we serve our customers and businesses...  ...machine learning platform organization manages...  ...running large foundation models Work cross‑...  ...or high‑throughput inference Hands‑on experience... 
    Remote work
    Platform
    Full time
    Part time
    Local area

    Capital One

    Cambridge, MA
    1 day ago
  • $145k - $200k

     ...who need it, our platforms empower our...  ...are a software engineering team with expertise...  ...in enabling ML models in production. We deploy AI models to run...  ...full stack, from inference engines, GPU...  ...and Go Model serving engines for GPU...  ...that allow for “Remote” work on an exceptional... 
    Remote work
    Platform
    Work experience placement
    Work at office
    Work from home
    Relocation package

    Palantir Technologies

    Palo Alto, CA
    2 days ago
  •  ...Hybrid Department AI ABOUT FATHOM We...  ...We're hiring a Model Performance Engineer to own the speed,...  ...reliability of our model inference stack, and to...  ...real systems serving millions of meetings...  ...serverless GPU platforms. Understanding of...  ...being fully remote. We schedule meetings... 
    Remote work
    Platform
    Full time

    Pantera Capital

    San Francisco, CA
    4 days ago
  •  ...inventive research, design, and engineering. Our organization is very...  ...Software Engineer on the Model Routing & Inference team at Cursor, you'll build the inference platform that powers every AI interaction in the...  ...especially in inference serving, traffic routing, or real... 
    Platform

    Anysphere

    New York, NY
    2 days ago
  • $198k - $286k

     ...to revolutionize AI infrastructure by...  ...Modular, we optimize inference from kernel to...  ...differentiated cloud platform that delivers state...  ..., the inference engine, and distributed systems...  ...Los Altos, CA or remotely from home....  ...performance of LLMs served on Modular Cloud to... 
    Remote job
    Platform
    Work experience placement
    Work at office
    Local area
    Flexible hours

    Modular Mailing Systems, Inc.

    Los Altos, CA
    3 days ago
  • $182.9k - $274.9k

     ...global Field Sales Engineers, customers, and...  ..., and edge AI/ML architectures...  ...deployment model is most appropriate...  ...for Apple platforms across enterprise...  ...AFM), on-device inference, and Apple Intelligence...  ...and model serving architecture...  ...model serving, and remote client access.... 
    Remote work
    Platform
    Relocation
    Shift work

    Apple Inc.

    Chicago, IL
    5 days ago
  • $168k - $270.25k

    Senior Software Engineer, Distributed Systems...  ...Clara: US, TX, Remote: US, NY, Remote:...  ...45NVIDIA is the platform upon which every new AI-powered...  ...automation for NVIDIA Inference Microservices (NIMs...  ...optimizes and serves performant...  ...inferencing for every AI model in a... 
    Remote work
    Platform

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $95.38k - $160.85k

    Quality Engineer III (AI and ML Platforms)Skip to main contentWe use cookies...  ...across data, model, and infrastructure...  ...layers. The position serves as a key contributor...  ...and model validation, inference testing, integration...  ...with geographically remote and culturally diverse... 
    Remote work
    Platform
    Work experience placement
    Work at office
    Local area
    Flexible hours

    ICW Group

    San Diego, CA
    2 days ago
  •  ...Job We are looking for an experienced AI Model Engineer with deep expertise in kernel development...  .... The engineer will extend the inference framework to support inference and fine...  ...functional teams to integrate optimized serving and inference frameworks into production... 
    Remote job

    Framework Ventures

    New York, NY
    4 days ago
  •  ...25 applicants Get AI-powered advice on this...  ...: California (Remote) | Department: Transportation Engineering We are seeking a Lead...  ...Traffic Engineer to serve as a senior technical...  ...at talisman by 2x Inferred from the...  ...L5) - Open Connect Platform United States $100,... 
    Remote work
    Platform
    Full time
    For contractors
    Work at office

    talisman

    California, MO
    4 days ago
  • $176k - $228k

    AI Field Engineer We're a well-funded AI infrastructure company building the platform that lets teams run, tune, and scale AI on open models in production. Our infrastructure...  ...Experience with inference serving frameworks (vLLM,...  ...Mateo, CA; open to remote within the US. Regular... 
    Remote work
    Platform
    Full time
    H1b
    Visa sponsorship

    David Joseph & Company

    New York, NY
    4 days ago
  •  ...client running AI workloads at scale...  ...of our cloud platform. The CX...  ...internal and customer engineering teams, offering...  ...role, you will: Serve as the primary...  ...on AI/ML inference. Fluency in cloud...  ...optimization, or advanced model‑server...  ...work environment, remote work may be... 
    Remote work
    Platform
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    Neura Market

    Livingston, NJ
    5 days ago
  •  ...Senior ML Infrastructure Engineer in Mountain View,...  ...build and scale robust platforms for ML inference workflows supporting GM’s AI efforts. You will collaborate...  ...to implement model serving strategies and handle backend...  ...skills. The role offers a remote work setup with... 
    Remote job
    Platform

    Israelvcforum

    Mountain View, CA
    4 days ago
  •  ...operational support of AI platforms, tools, and services,...  ...cloud, and platform engineering teams....  ...MLOps pipelines for model packaging, testing, deployment...  ...Experience with model serving, inference optimization, or AI platform...  ...experience. REMOTE WORK NOTICE This position... 
    Remote work
    Platform
    Work at office

    Applied Research Associates, Inc.

    Raleigh, NC
    3 days ago
  • $186k - $245k

     ...Machine Learning Engineer, where you'll...  ...application of AI and machine learning...  ...safety on the platform. Working...  ...machine learning models to identify and...  ...preprocess data, run inference, and manage...  ..., model serving environment, observability...  ...Parents Remote #J-18808-Ljbffr... 
    Remote work
    Platform
    Work experience placement

    Hinge

    New York, NY
    3 days ago
  • Senior Site Reliability Engineer - AI Infrastructure Location: Global Remote / San Francisco •...  ...’s needed most. Our platform routes training and inference jobs across global...  ...Technical Partnership: Serve as the primary...  ...t need to write the models, but you need to understand... 
    Remote work
    Platform
    Full time

    Cortes 23

    San Francisco, CA
    4 days ago
  • $140k - $180k

     ...Field Applications Engineer (FAE) Who We Are...  ...challenges in large scale AI data center...  ...operate across a mix of remote and on-site...  ...customer environments Serve as the technical...  ..., SRE, and platform engineering teams...  ...job scheduling, and inference/training workloads... 
    Remote work
    Platform

    Delos Data Inc

    Palo Alto, CA
    5 days ago
  •  ...Site Reliability Engineer Job type: Full Time · Department: Platform · Work type: On-Site...  ..., United States (Remote) Optura is healthcare’s AI orchestration platform...  ...supports multiple model providers,...  ...scheduling, model‑serving stacks, inference autoscaling OSS contributions... 
    Remote work
    Platform
    Full time

    Neara

    San Francisco, CA
    5 days ago
  •  ...reliability and AI operations foundation...  ...intelligence platform that runs the most...  ...Site Reliability Engineer who wants to own...  ...This role is a remote position for candidates...  ...Enablement Serve as the primary reliability...  ...provisioning, model serving, inference latency, and... 
    Remote job
    Platform
    Flexible hours

    Tech Insights

    New York, NY
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Remote Model Serving Engineer - AI Inference Platform. Be the first to apply!