GPU Platform Infrastructure Engineer
Optimal
Job Title: GPU Platform Infrastructure Engineer Job Summary Support the GM ARC RTD team by building and maintaining the foundational GPU cluster platform infrastructure supporting shared AI/ML, simulation, and validation workloads. This role focuses on GPU access governance, resource allocation, scheduling policies, observability, and operational support for multi-tenant GPU environments including RTX 6000, A100, B200, and future systems. Required Experience 3+ years of experience in Platform Engineering, Infrastructure Engineering, DevOps, or related field Bachelor's or Master's degree in Systems Engineering, Computer Science, Computer Engineering, or related discipline Responsibilities Manage GPU cluster access provisioning, onboarding, permissions, and lifecycle management Design and maintain GPU resource allocation policies, quotas, namespace isolation, and scheduling configurations Develop GPU utilization dashboards, reporting, monitoring, and capacity tracking solutions Create reusable job submission templates and onboarding documentation for ML, Isaac Sim simulation, and validation workloads Support platform governance, operational continuity, infrastructure scalability, and CI/CD integration Design and develop GUI-based tools for streamlined Docker development workflows Collaborate with infrastructure, AI/ML, and engineering teams to support shared GPU operations Required Skills Experience with Linux, Kubernetes, Docker, and GPU infrastructure environments Knowledge of workload scheduling, resource management, and multi-tenant platform operations Experience supporting AI/ML, simulation, or GPU-intensive engineering workloads Experience with monitoring, observability, and reporting tools Strong scripting and automation skills using Python, Bash, or similar languages Familiarity with NVIDIA GPU platforms, containerized compute environments, and infrastructure automation tools Experience with CI/CD pipelines and cloud platforms such as AWS, Azure, or GCP is a plus Experience with GUI development frameworks is a plus Strong troubleshooting, documentation, and operational support skills #J-18808-Ljbffr Optimal
- Job Title: ML Platform Engineer - GPU Infrastructure Support team by designing, implementing, and maintaining the automation and ML workload enablement layer of the GPU cluster platform. This role focuses on optimizing GPU compute environments for AI/ML training and Isaac...Suggested
- Optimal is seeking a GPU Platform Infrastructure Engineer to support the GM ARC RTD team by building and maintaining the foundational GPU cluster platform infrastructure. This role focuses on GPU access governance, resource allocation, scheduling policies, and operational...Suggested
- A pioneering AI infrastructure company is seeking a GPU Cloud Platform Engineer to design and operate large-scale GPU clusters. This remote position aims to ensure high availability and performance of containerized AI workloads across cloud environments. The ideal candidate...SuggestedRemote job
- ...of hardware—from commodity to high-end GPUs. Our platform supports major large language models (LLMs) and offers... ...AI development. ️ Role Overview We are seeking a GPU Cloud Platform Engineer to join our core infrastructure team and help build the next-generation AI compute...SuggestedFull timeRemote workFlexible hours
- A leading tech company in the United States is seeking an experienced Infrastructure GPU Engineer to build and support high-performance cloud infrastructure. This role involves optimizing resource allocation for GPU workloads, ensuring system reliability, and collaborating...SuggestedRemote job
- A tech-focused company is seeking a Senior Infrastructure Platform Engineer to design and maintain robust infrastructure platforms. This role involves automating deployments, monitoring performance, and collaborating with development teams. The ideal candidate should have...Remote job
- Optimal is seeking an ML Platform Engineer focusing on GPU Infrastructure in Kentucky. You will be responsible for optimizing GPU compute environments for AI/ML workloads and integrating these into CI/CD pipelines. This role requires a strong background in ML Platform...
$200k - $250k
...Fluidstack At Fluidstack, we’re building the infrastructure for abundant intelligence. We... ...provider, is looking for a Software Engineer, Infrastructure Platform to build the foundational platforms... ...for rack operations, server/GPU deployment, OS installation, quality...Local area- ...landscape. The Opportunity We're growing our infrastructure team in Denver and looking for an engineer who has gotten their hands dirty with both... ...pipeline maintenance and help onboard teams to existing platform tooling Assist with Kubernetes cluster operations...Full timeInternshipLocal area
- triomics inc. in New York is seeking an experienced infrastructure engineer to build backend services and manage cloud infrastructure. The successful... .... Ideal candidates will have over 3 years' experience in platform engineering, proficiently using technologies like AWS or...
$164.45k - $234.93k
Spotify AB is seeking an experienced engineer to join the Platform Infrastructure team in New York, focusing on building and improving cloud-based developer tooling and infrastructure. Candidates should have over 5 years of experience in backend engineering, strong knowledge...Work from home- Discover exciting DevOps job opportunities and connect with 28,396 DevOps professionals. The Senior Infrastructure Platform Engineer position at Jobicy is an exciting opportunity for tech enthusiasts looking to take their DevOps skills to the next level. This role focuses...Remote work
$207k - $311k
Commerce.com US, Inc. is seeking a Director of Software Engineering for Platform & Infrastructure to shape the technical foundation of our global engineering organization. This role involves defining infrastructure strategy and driving developer productivity at scale. The...- ...DevOps Platform Engineer with our client in the financial industry located in Charlotte, NC and New York, NY. This is a... ...scripting, SQL, work scheduling tools ~ Setting up infrastructure monitoring & reporting for GPU/CPU & memory consumption, inference latency and...Contract workWork experience placementShift work
$300k - $350k
Framework Ventures is seeking a Director of Engineering for Infrastructure & Platform in the United States. This role is pivotal in owning the vision and execution of one of the most sophisticated distributed systems in the blockchain ecosystem. Candidates should have over...Flexible hours- A financial technology company is seeking a Senior Cloud and Platform Engineer to design and operate cloud-native infrastructure for AI development. The ideal candidate has over 8 years in cloud infrastructure and DevOps, with expertise in MLOps practices. You will lead...Remote jobFlexible hours
- Scribd, Inc. is hiring a Senior AI Data Engineer in New York City to lead AI engineering workstreams. This role involves building data infrastructure, supporting platform stakeholders, and mentoring other engineers. Candidates should have over 5 years of data engineering...Flexible hours
$140k - $160k
...re looking for a backend-leaning, Senior, Full Stack Engineer who will build AI-powered platforms, tools, and workflows that create value for our... ...tools, with a strong focus on Python and modern cloud infrastructure. You will be hands-on with integrating large language...Full timeWork at office- ...00s alike. We’re growing fast and just getting started. Come join us for a whale of a ride! Our Infrastructure Engineering team builds and operates the cloud-native platform that powers Docker’s suite of products. We design resilient services, automate where it helps most...Remote workHome officeShift work
$164.45k - $234.93k
...Platform Infrastructure Engineer The Platform team creates the technology that enables Spotify to learn quickly and scale easily, enabling rapid growth in our users and our business around the globe. Spanning many disciplines, we work to make the business work; creating...Work from homeFlexible hours$200k - $250k
...Software Engineer, Infrastructure Platform Fluidstack, a leading cloud provider, is looking for a Software Engineer, Infrastructure Platform to build the foundational platforms that enable our global infrastructure and data center operations. You'll develop comprehensive...Local area- ...Are We are building core financial infrastructure for the fastest-growing crypto derivatives exchange in the world - a platform generating over $1B in annualised revenue... ...Thrives Here This role is built for engineers who are energised by complexity, take pride...Work at office
- ...Senior Platform Engineer Who We Are MOXFIVE is building technologies that leverage AI to streamline... .... You are comfortable owning cloud infrastructure, Kubernetes workloads, CI/CD pipelines... ...such as Together AI or Fireworks.ai, GPU platforms such as RunPod or Lambda...Local area
- ...Nomic is the domain-specific AI platform for the Architecture, Engineering, and Construction (AEC) industry.... ...Senior Platform Engineer to own our infrastructure stack - multi-account AWS, Kubernetes... ...scale - serving infrastructure, GPU workloads, and the performance work...Remote workFlexible hours
$135k - $200k
...Software Engineer - Infrastructure, Foundry Platform Build scalable, secure, and high-performance data infrastructure for enterprise clients Location: New York. Compensation: $135,000 – 200,000 USD / year. About the Role Palantir builds the world's leading software for...Temporary workWork experience placementRelocation package- ...Description: This role spans backend product engineering and infrastructure. You'll build backend services and... ...them running in production. The platform processes millions of clinical documents... ...as Triomics cloud environments, with GPU infrastructure serving AI extraction...Day shift
- ...primarily in architecture, engineering, and construction, extract structured... ..., and project files. Our platform combines embedding models,... ...into our own processes. The infrastructure already reflects that: multi... ...inference services, GPU workloads, model serving, eval...
- ...partner is looking for an Architect - Platform Engineer based in the United States. This is a... ...designing and scaling next-generation infrastructure for GenAI and large language model (LLM... ...foundations that power distributed training, GPU-accelerated computing, and AI model...Remote work
- Nscale is seeking a candidate for a cloud infrastructure support role based in New York, NY. The ideal candidate will join a dynamic team, assisting with day-to-day operational tasks, troubleshooting, and automation of processes while delivering high-impact results. This...
$133.9k - $154.5k
...transforming global markets. The AI Platform Engineering Lead drives the AI Platform Operations... ..., and governance across AI/ML infrastructure, workflow automation, and agentic AI capabilities... ...for AI/ML infrastructure, including GPU cluster design, compute resource...Full timeContract workTemporary workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to GPU Platform Infrastructure Engineer. Be the first to apply!
- platform developer Brooklyn, NY
- senior platform engineer Brooklyn, NY
- platform engineer Brooklyn, NY
- security infrastructure engineer Brooklyn, NY
- infrastructure engineer Brooklyn, NY
- data infrastructure engineer Brooklyn, NY
- senior infrastructure engineer Brooklyn, NY
- remote infrastructure engineer Brooklyn, NY
- infrastructure developer Brooklyn, NY
- director of digital platform Brooklyn, NY

