Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

LLMOps Platform Engineer for GPU AI Inference

Cloud Analytics Technologies, LLC

A leading AI infrastructure company located in New Jersey is seeking an experienced AI Operations Platform Consultant to lead and optimize large-scale GPU-accelerated AI platforms. The ideal candidate will have a strong background in deploying and managing LLM inference systems on Kubernetes, with expertise in TensorRT-LLM and Triton Inference Server. Responsibilities include managing production-grade LLM pipelines and ensuring operational reliability. This position is part of a team committed to diversity and equal opportunity. #J-18808-Ljbffr

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the LLMOps Platform Engineer for GPU AI Inference in Jersey City, NJ vacancy
  •  ...technology consulting firm seeks an AI Operations Platform Consultant with extensive...  ...in deploying and managing GPU-accelerated AI platforms. The role includes leading LLMOps processes, optimizing production...  ...utilizing tools like Triton Inference Server and TensorRT-LLM.... 
    Suggested

    ETHEREUM TECHNOLOGIES LLC

    Jersey City, NJ
    3 days ago
  •  ...technology firm based in Jersey City, NJ, seeks an AI Operations Platform Consultant to lead the deployment and management of large-scale GPU-accelerated AI systems. The role demands...  ...on the operational reliability of AI inference services. Ideal candidates will have... 
    Suggested

    Quantum Technologies USA

    Jersey City, NJ
    4 days ago
  •  ...firm in Jersey City seeks an AI Operations Platform Consultant with extensive...  ...deploying and managing large-scale GPU-accelerated AI platforms....  ...the deployment of LLM inference systems using Kubernetes, optimizing...  ..., and leading MLOps/LLMOps pipelines. The ideal candidate... 
    Suggested

    Robotics Technologies LLC

    Jersey City, NJ
    4 days ago
  •  ...Alumni Ventures is hiring for a Platform Engineering role in New York City, focused on developing an ultrafast AI inference platform. This position involves interesting challenges...  ...-level systems development, and efficient GPU workload management. Successful candidates... 
    Suggested
    Remote work

    Alumni Ventures

    New York, NY
    4 days ago
  •  ...Labs Apply: ****@*****.***.ai About Yotta Labs Yotta Labs...  ...for AI training and inference on a wide spectrum of hardware...  ...commodity to high-end GPUs. Our platform supports major large...  ...Overview We are seeking a GPU Cloud Platform Engineer to join our core infrastructure... 
    Suggested
    Full time
    Remote work
    Flexible hours

    Yotta Labs

    New York, NY
    2 days ago
  •  ...A pioneering AI infrastructure company is seeking a GPU Cloud Platform Engineer to design and operate large-scale GPU clusters. This remote position aims to ensure high availability and performance of containerized AI workloads across cloud environments. The ideal candidate... 
    Remote work

    Yotta Labs

    New York, NY
    2 days ago
  • $170k - $210k

     ...Utilidata is seeking an experienced AI Infrastructure Engineer responsible for designing and building the end-to-end infrastructure for AI and ML models. This role involves optimizing GPU utilization and ensuring high reliability of AI models deployed across edge, cloud... 
    Remote work

    Utilidata

    New York, NY
    2 days ago
  • $160k - $240k

     ...Bloomberg L.P. in New York is seeking a Senior Software Engineer for AI Inference to design and build scalable infrastructure for machine learning applications. The ideal candidate will have over 5 years of software engineering experience, expertise in distributed systems... 

    Bloomberg

    New York, NY
    3 days ago
  • $200k - $250k

     ...At Wizard AI, we’re building the top-performing...  ...power the core of our platform, and we’re seeking an experienced...  ...Senior MLOps Engineer to take ownership of how...  ...scaling – for a custom-built inference platform powering a...  ...latency, availability, GPU utilization, TTFT, ITL... 
    Remote work
    Flexible hours

    Wizard

    New York, NY
    2 days ago
  •  ...A technology solutions company in Jersey City seeks an AI Operations Platform Consultant to oversee the deployment and management of GPU-accelerated AI platforms and LLM inference systems. Candidates should have strong expertise in Kubernetes, TensorRT-LLM, and Triton... 

    Robotics Prcocess Automation, LLC

    Jersey City, NJ
    3 days ago
  • $171k - $260k

     ...Senior Lead Software Engineer - AI Platform engineer Jersey City, NJ, United States Job Information...  ...transformer architecture, ML training, and inference. Experience in solutions design and...  ...Foundational understanding of NVIDIA GPU Infrastructure software (e.g., NVIDIA... 
    Full time
    For contractors

    Aumni

    Jersey City, NJ
    3 days ago
  •  ...About Nomic Nomic builds AI agents and developer...  ...primarily in architecture, engineering, and construction,...  ...and project files. Our platform combines embedding models...  ...pioneered on-device LLM inference with GPT4All in 2023,...  ...infrastructure inference services, GPU workloads, model... 

    Nomic

    New York, NY
    4 days ago
  •  ...Senior Platform Engineer Who We Are MOXFIVE is building technologies that leverage AI to streamline response, recovery, and resilience from...  ...managing production model inference across hosted providers such...  ...Together AI or Fireworks.ai, GPU platforms such as RunPod or Lambda... 
    Local area

    MOXFIVE

    New York, NY
    9 hours ago
  • $128.7k - $261.3k

     ...Team The Model Deployment & Inference Solutions team in GM AV deploys...  ...: build the ML deployment platform that makes model rollouts fast...  ...currently performed manually by engineers. Build the developer...  ...Familiarity with the NVIDIA GPU stack at the integration level... 
    Flexible hours
    Shift work

    General Motors

    New York, NY
    2 days ago
  • $155k - $215k

     ...100,000 clients nationwide. Our ML and AI capabilities are expanding rapidly—powering...  ...them. As our first dedicated ML Platform Engineer, you'll define the technical direction...  ...production today and are investing in hosted GPU inference to support the next generation of our... 
    Full time
    Work at office
    Local area

    Charlie Health Engineering, Product & Design

    New York, NY
    5 days ago
  • $140k - $200k

     ...investment firm building a proprietary AI and data platform that powers our investment...  ...and structured finance. We are engineers and investors working together to...  ...model training, retraining, and inference (batch and real-time), including GPU compute provisioning and... 
    Flexible hours

    Anthelion Capital

    New York, NY
    4 days ago
  • $128.7k - $261.3k

     ...General Motors seeks a skilled professional to develop its ML deployment platform within the autonomous vehicle sector. This role involves automating model deployment from training to on-vehicle inference and enhancing developer experience through robust tooling. Candidates... 

    General Motors

    New York, NY
    2 days ago
  •  ...8+ years of experience as a Platform Engineer (Site Reliability / DevOps), with at least 3+ years in AI/ML platform development (MLOps...  ...distributed orchestration & inference frameworks. Experience with developing...  ...scaling. Understanding of LLMOps patterns — model registry,... 

    TechWize

    New York, NY
    3 days ago
  • Radimal is a veterinary radiology and AI diagnostics platform delivering 24/7 imaging insights to...  ...high-throughput medical imaging, GPU-backed inference, global distribution, and...  ..., more predictable, and easier for engineers to build on. Why This Role Exists Radimal... 
    Remote job
    Local area

    Radimal

    New York, NY
    2 days ago
  • $130.2k - $195.3k

     ...Overview We are seeking a Senior Lead / Lead ML Platform Engineer to architect and own the technical direction for our Training and Inference infrastructure. This is a high-leverage...  ...day, optimizing for both p99 latency and GPU utilization. Operational Excellence:... 
    Full time
    Shift work

    Paramount

    New York, NY
    1 day ago
  • $110k - $140k

     ...accessible for enterprises and AI innovators around the world....  ...global Cloud Compute, Cloud GPU, Bare Metal, and Cloud Storage...  ...skilled and experienced AI Platform Engineer to own the strategy and execution...  ...‑on experience deploying LLM inference infrastructure and a genuine... 
    Work at office
    Immediate start
    Remote work
    Flexible hours

    Vultr

    New York, NY
    2 days ago
  • $120k - $160k

     ...Description Uncountable Engineering is seeking driven software engineers with a focus on Generative AI deployment in software. Uncountable’s software platform is used by scientists in leading...  ...infrastructure for fine‑tuning and inference, etc) Uncountable has a... 

    Uncountable Inc.

    New York, NY
    4 days ago
  • $170k - $235k

     ...engaged to conduct a search for a Senior Platform Engineer for a rapidly growing, venture-backed...  ...development, data systems, and AI-driven services. This is an opportunity...  ...performance across APIs, data pipelines, and AI inference systems Champion security,... 
    Temporary work
    Interim role
    Immediate start

    Scion Staffing

    New York, NY
    2 days ago
  •  ...Beam is an ultrafast AI inference platform. We built a serverless runtime that launches GPU-backed containers in less than 1 second and quickly scales out to thousands...  ...to hire someone to help us with Platform Engineering work. We’re working on lots of fun problems: Low... 
    Work at office
    Remote work

    Alumni Ventures

    New York, NY
    4 days ago
  •  ...This role spans backend product engineering and infrastructure. You'll build backend...  ...keeps them running in production. The platform processes millions of clinical...  ...as Triomics cloud environments, with GPU infrastructure serving AI extraction models. We need someone who... 
    Day shift

    triomics inc.

    New York, NY
    9 hours ago
  • $122k - $200k

     ...responsible for defining and leading the engineering approach for complex features to...  ...layer for the GenAI platform across AWS, Azure, and GCP, enabling...  ...Experience supporting AI/ML or GenAI workloads, including: Model inference endpoints and API gateways High-... 
    Shift work
    Day shift

    Hobbsnews

    New York, NY
    9 hours ago
  • $133.9k - $154.5k

     ...Every day we work toward transforming global markets. The AI Platform Engineering Lead drives the AI Platform Operations team, guiding...  ...standards, and governance for AI/ML infrastructure, including GPU cluster design, compute resource planning, security controls... 
    Full time
    Contract work
    Temporary work
    Flexible hours

    Intercontinental Exchange

    New York, NY
    4 days ago
  •  ...A client of Innova Solutions is immediately hiring for a BI Platform Engineer. Position type: Contract Duration: 12 + Months Locations...  .... By applying for this job, you agree to receive calls, AI-generated calls, text messages, or emails from Innova... 
    Contract work
    Temporary work
    Work experience placement
    Immediate start
    Worldwide
    Flexible hours

    Innova Solutions

    Jersey City, NJ
    1 day ago
  • $195k - $265k

     ...Security Platform Engineer San Francisco or New York About Pallet Pallet is building AI Agents to transform logistics — a $12 trillion global industry. We've raised...  ...to support model training, evaluation, and inference infrastructure Identify systemic risks and... 
    Full time
    Temporary work
    Work at office
    Local area
    Remote work
    Flexible hours

    Pallet Service Corporation

    New York, NY
    9 days ago
  • $175k - $250k

     ...Platform Engineer - AI Core We are looking for a platform infrastructure engineer to design, build, and operate cloud-native infrastructure...  ...agents, model-serving pipelines, and long-running inference workloads - present unique infrastructure challenges: multi... 
    Immediate start

    Millennium Management Corp

    New York, NY
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to LLMOps Platform Engineer for GPU AI Inference. Be the first to apply!