LLMOps Platform Engineer for GPU AI Inference

Cloud Analytics Technologies, LLC

A leading AI infrastructure company located in New Jersey is seeking an experienced AI Operations Platform Consultant to lead and optimize large-scale GPU-accelerated AI platforms. The ideal candidate will have a strong background in deploying and managing LLM inference systems on Kubernetes, with expertise in TensorRT-LLM and Triton Inference Server. Responsibilities include managing production-grade LLM pipelines and ensuring operational reliability. This position is part of a team committed to diversity and equal opportunity. #J-18808-Ljbffr

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the LLMOps Platform Engineer for GPU AI Inference in Jersey City, NJ vacancy

LLMOps Platform Engineer - GPU AI on Kubernetes
...technology consulting firm seeks an AI Operations Platform Consultant with extensive... ...in deploying and managing GPU-accelerated AI platforms. The role includes leading LLMOps processes, optimizing production... ...utilizing tools like Triton Inference Server and TensorRT-LLM....
Suggested
ETHEREUM TECHNOLOGIES LLC
Jersey City, NJ
3 days ago
LLMOps Platform Engineer: GPU AI on Kubernetes
...technology firm based in Jersey City, NJ, seeks an AI Operations Platform Consultant to lead the deployment and management of large-scale GPU-accelerated AI systems. The role demands... ...on the operational reliability of AI inference services. Ideal candidates will have...
Suggested
Quantum Technologies USA
Jersey City, NJ
4 days ago
Senior AI Ops Platform Engineer (LLMOps)
...firm in Jersey City seeks an AI Operations Platform Consultant with extensive... ...deploying and managing large-scale GPU-accelerated AI platforms.... ...the deployment of LLM inference systems using Kubernetes, optimizing... ..., and leading MLOps/LLMOps pipelines. The ideal candidate...
Suggested
Robotics Technologies LLC
Jersey City, NJ
4 days ago
Remote Platform Engineer - GPU Cloud & Kubernetes
...Alumni Ventures is hiring for a Platform Engineering role in New York City, focused on developing an ultrafast AI inference platform. This position involves interesting challenges... ...-level systems development, and efficient GPU workload management. Successful candidates...
Suggested
Remote work
Alumni Ventures
New York, NY
4 days ago
GPU Cloud Platform Engineer
...Labs Apply: ****@*****.***.ai About Yotta Labs Yotta Labs... ...for AI training and inference on a wide spectrum of hardware... ...commodity to high-end GPUs. Our platform supports major large... ...Overview We are seeking a GPU Cloud Platform Engineer to join our core infrastructure...
Suggested
Full time
Remote work
Flexible hours
Yotta Labs
New York, NY
2 days ago
Remote GPU Cloud Platform Engineer: Scale AI Compute
...A pioneering AI infrastructure company is seeking a GPU Cloud Platform Engineer to design and operate large-scale GPU clusters. This remote position aims to ensure high availability and performance of containerized AI workloads across cloud environments. The ideal candidate...
Remote work
Yotta Labs
New York, NY
2 days ago
AI Inference Platform Engineer Remote
$170k - $210k
...Utilidata is seeking an experienced AI Infrastructure Engineer responsible for designing and building the end-to-end infrastructure for AI and ML models. This role involves optimizing GPU utilization and ensuring high reliability of AI models deployed across edge, cloud...
Remote work
Utilidata
New York, NY
2 days ago
Senior AI Inference Platform Engineer
$160k - $240k
...Bloomberg L.P. in New York is seeking a Senior Software Engineer for AI Inference to design and build scalable infrastructure for machine learning applications. The ideal candidate will have over 5 years of software engineering experience, expertise in distributed systems...
Bloomberg
New York, NY
3 days ago
Senior Machine Learning Engineer (Inference Platform)
$200k - $250k
...At Wizard AI, we’re building the top-performing... ...power the core of our platform, and we’re seeking an experienced... ...Senior MLOps Engineer to take ownership of how... ...scaling – for a custom-built inference platform powering a... ...latency, availability, GPU utilization, TTFT, ITL...
Remote work
Flexible hours
Wizard
New York, NY
2 days ago
AI Ops Platform Engineer - LLM/Kubernetes Expert
...A technology solutions company in Jersey City seeks an AI Operations Platform Consultant to oversee the deployment and management of GPU-accelerated AI platforms and LLM inference systems. Candidates should have strong expertise in Kubernetes, TensorRT-LLM, and Triton...
Robotics Prcocess Automation, LLC
Jersey City, NJ
3 days ago
Senior Lead Software Engineer- AI Platform engineer
$171k - $260k
...Senior Lead Software Engineer - AI Platform engineer Jersey City, NJ, United States Job Information... ...transformer architecture, ML training, and inference. Experience in solutions design and... ...Foundational understanding of NVIDIA GPU Infrastructure software (e.g., NVIDIA...
Full time
For contractors
Aumni
Jersey City, NJ
3 days ago
Senior Platform Engineer
...About Nomic Nomic builds AI agents and developer... ...primarily in architecture, engineering, and construction,... ...and project files. Our platform combines embedding models... ...pioneered on-device LLM inference with GPT4All in 2023,... ...infrastructure inference services, GPU workloads, model...
Nomic
New York, NY
4 days ago
Senior Platform Engineer
...Senior Platform Engineer Who We Are MOXFIVE is building technologies that leverage AI to streamline response, recovery, and resilience from... ...managing production model inference across hosted providers such... ...Together AI or Fireworks.ai, GPU platforms such as RunPod or Lambda...
Local area
MOXFIVE
New York, NY
9 hours ago
Senior ML Inference Engineer - Platform
$128.7k - $261.3k
...Team The Model Deployment & Inference Solutions team in GM AV deploys... ...: build the ML deployment platform that makes model rollouts fast... ...currently performed manually by engineers. Build the developer... ...Familiarity with the NVIDIA GPU stack at the integration level...
Flexible hours
Shift work
General Motors
New York, NY
2 days ago
Senior Machine Learning Platform Engineer
$155k - $215k
...100,000 clients nationwide. Our ML and AI capabilities are expanding rapidly—powering... ...them. As our first dedicated ML Platform Engineer, you'll define the technical direction... ...production today and are investing in hosted GPU inference to support the next generation of our...
Full time
Work at office
Local area
Charlie Health Engineering, Product & Design
New York, NY
5 days ago
AI Infrastructure / Platform Engineer
$140k - $200k
...investment firm building a proprietary AI and data platform that powers our investment... ...and structured finance. We are engineers and investors working together to... ...model training, retraining, and inference (batch and real-time), including GPU compute provisioning and...
Flexible hours
Anthelion Capital
New York, NY
4 days ago
Senior ML Inference Platform Engineer - Real-Time AV
$128.7k - $261.3k
...General Motors seeks a skilled professional to develop its ML deployment platform within the autonomous vehicle sector. This role involves automating model deployment from training to on-vehicle inference and enhancing developer experience through robust tooling. Candidates...
General Motors
New York, NY
2 days ago
Sr. AI Platform Engineer
...8+ years of experience as a Platform Engineer (Site Reliability / DevOps), with at least 3+ years in AI/ML platform development (MLOps... ...distributed orchestration & inference frameworks. Experience with developing... ...scaling. Understanding of LLMOps patterns — model registry,...
TechWize
New York, NY
3 days ago
Staff Platform Engineer (Remote)
Radimal is a veterinary radiology and AI diagnostics platform delivering 24/7 imaging insights to... ...high-throughput medical imaging, GPU-backed inference, global distribution, and... ..., more predictable, and easier for engineers to build on. Why This Role Exists Radimal...
Remote job
Local area
Radimal
New York, NY
2 days ago
Machine Learning Platform Lead Engineer, Training and Inference
$130.2k - $195.3k
...Overview We are seeking a Senior Lead / Lead ML Platform Engineer to architect and own the technical direction for our Training and Inference infrastructure. This is a high-leverage... ...day, optimizing for both p99 latency and GPU utilization. Operational Excellence:...
Full time
Shift work
Paramount
New York, NY
1 day ago
Senior AI Platform Engineer, Core Cloud Engineering
$110k - $140k
...accessible for enterprises and AI innovators around the world.... ...global Cloud Compute, Cloud GPU, Bare Metal, and Cloud Storage... ...skilled and experienced AI Platform Engineer to own the strategy and execution... ...‑on experience deploying LLM inference infrastructure and a genuine...
Work at office
Immediate start
Remote work
Flexible hours
Vultr
New York, NY
2 days ago
Platform Engineer - Generative AI
$120k - $160k
...Description Uncountable Engineering is seeking driven software engineers with a focus on Generative AI deployment in software. Uncountable’s software platform is used by scientists in leading... ...infrastructure for fine‑tuning and inference, etc) Uncountable has a...
Uncountable Inc.
New York, NY
4 days ago
Senior Platform Engineer
$170k - $235k
...engaged to conduct a search for a Senior Platform Engineer for a rapidly growing, venture-backed... ...development, data systems, and AI-driven services. This is an opportunity... ...performance across APIs, data pipelines, and AI inference systems Champion security,...
Temporary work
Interim role
Immediate start
Scion Staffing
New York, NY
2 days ago
Software Engineer, Platform
...Beam is an ultrafast AI inference platform. We built a serverless runtime that launches GPU-backed containers in less than 1 second and quickly scales out to thousands... ...to hire someone to help us with Platform Engineering work. We’re working on lots of fun problems: Low...
Work at office
Remote work
Alumni Ventures
New York, NY
4 days ago
Platform Engineer
...This role spans backend product engineering and infrastructure. You'll build backend... ...keeps them running in production. The platform processes millions of clinical... ...as Triomics cloud environments, with GPU infrastructure serving AI extraction models. We need someone who...
Day shift
triomics inc.
New York, NY
9 hours ago
Senior Cloud Engineer - GenAI Platform Engineering
$122k - $200k
...responsible for defining and leading the engineering approach for complex features to... ...layer for the GenAI platform across AWS, Azure, and GCP, enabling... ...Experience supporting AI/ML or GenAI workloads, including: Model inference endpoints and API gateways High-...
Shift work
Day shift
Hobbsnews
New York, NY
9 hours ago
Lead Engineer, Platform Engineering - AI
$133.9k - $154.5k
...Every day we work toward transforming global markets. The AI Platform Engineering Lead drives the AI Platform Operations team, guiding... ...standards, and governance for AI/ML infrastructure, including GPU cluster design, compute resource planning, security controls...
Full time
Contract work
Temporary work
Flexible hours
Intercontinental Exchange
New York, NY
4 days ago
BI Platform Engineer
...A client of Innova Solutions is immediately hiring for a BI Platform Engineer. Position type: Contract Duration: 12 + Months Locations... .... By applying for this job, you agree to receive calls, AI-generated calls, text messages, or emails from Innova...
Contract work
Temporary work
Work experience placement
Immediate start
Worldwide
Flexible hours
Innova Solutions
Jersey City, NJ
1 day ago
Security Platform Engineer
$195k - $265k
...Security Platform Engineer San Francisco or New York About Pallet Pallet is building AI Agents to transform logistics — a $12 trillion global industry. We've raised... ...to support model training, evaluation, and inference infrastructure Identify systemic risks and...
Full time
Temporary work
Work at office
Local area
Remote work
Flexible hours
Pallet Service Corporation
New York, NY
9 days ago
Platform Engineer - AI Core
$175k - $250k
...Platform Engineer - AI Core We are looking for a platform infrastructure engineer to design, build, and operate cloud-native infrastructure... ...agents, model-serving pipelines, and long-running inference workloads - present unique infrastructure challenges: multi...
Immediate start
Millennium Management Corp
New York, NY
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to LLMOps Platform Engineer for GPU AI Inference. Be the first to apply!