AI Compute Platform Lead - Scale GPU Infra & Kubernetes
Roblox
Roblox Corporation is looking for a Senior Product Manager, Compute Platform, in San Mateo, CA. You will set the strategy for next-generation AI infrastructure and lead the execution of critical projects across the organization. The ideal candidate will have over 7 years of product management experience and deep knowledge of Kubernetes and GPU architecture. This role also requires collaboration with multiple teams to ensure reliability and innovation in Compute infrastructure. #J-18808-Ljbffr
$345.04k - $399.42k
...seeking a Principal Software Engineer specializing in GPU Compute to drive GPU strategy and ensure reliable AI workloads. The role requires deep expertise in GPU... ...management, work on infrastructure efficiency, and lead cross-functional initiatives to enhance performance...Platform$345.04k - $399.42k
...building the tools and platform that empower our... ...technical challenges at scale, and helping to create... ...Software Engineer on the Compute team, you will be the... ...technical anchor for Roblox's GPU and AI accelerator... ...team, partnering across Kubernetes, Machine Bootstrap, Networking...PlatformFull timeWork experience placementH1bWork at officeLocal areaVisa sponsorshipMonday to Friday- ...scalable SaaS systems. This role involves leading a team, architecting robust systems,... ...ideal candidate has proven expertise in Kubernetes, cloud infrastructure, and strong... ...California to innovate and enhance our industry-leading Physics AI platform. #J-18808-Ljbffr...Platform
- ...About Obvio AI Each year, more than 40,000 people in... ...gracefully at high throughput. Scale the inference fleet. Build the compute layer that parallelizes... ...autoscaling strategy for GPU-bound workloads on ECS.... ...to build a world-class ML platform organization. Why Obvio...PlatformLocal area
- ...G&A As Genesis continues to scale the industry’s most advanced molecular AI platform, GEMS, our relationships with leading AI organizations as well as our high-performance compute capacity serve as critical engines... ..., including next‑generation GPU / TPU chip architectures,...PlatformFull timeContract workFlexible hours
- ...About zaimler AI agents can't... ...agentic era: a platform that... ...with precision at scale. Imagine knowledge... ...Truera), a Data Infra veteran and former... ...Azure/GCP, using Kubernetes, Terraform, and... ...and optimize compute systems for distributed... .... GPU Orchestration:...PlatformH1bVisa sponsorshipShift work
$176.75k - $252.5k
## Lead Systems & Data ArchitectApplylocations: Belmont... ..., Contact Center and AI-powered adjacencies. We... ...enabled technology and platforms meet or exceed the... ...ML and LLM workloads at scale.This is a highly visible... ...Azure, or GCP, including compute, storage, networking,...PlatformFull timeLocal areaFlexible hours- ...fine tuning of both the compute hardware architecture... ...specifications. As a GPU performance software... ...of future compute platforms & resource allocation.... ...analyze performance at scale in CI/vehicle, and establish... ...intelligence (AI) tools to support parts...PlatformTemporary workRelocation package
- ...Strategic Partnership Manager to cultivate crucial strategic relationships within the AI ecosystem in Redwood City, California. The role focuses on AI infrastructure, cloud platforms, and energy, driving partnerships to enhance GridCARE’s market footprint. Candidates should...Platform
$345.04k - $399.42k
...building the tools and platform that empower our... ...challenges at scale, and helping to create... ...Software Engineer leading Fleet Management,... ...all of Roblox's compute capacity end to end... ...of every Roblox Kubernetes cluster, and governs... ...backend services, AI, and edge...PlatformFull timeWork experience placementH1bWork at officeLocal areaVisa sponsorshipMonday to Friday$130k - $230k
...infrastructure and LLM platform end-to-end from... ...operators building an AI platform to make... ...Infrastructure Engineers to lead the buildout of our... ...areas in LLM/ML Infra and IoT infra are... ...where applicable, GPU/CPU instance... ...OpenSearch, or pgvector at scale. Hands-on with...PlatformPermanent employmentFull timeRemote work3 days per week- ...Engineer. This role involves designing scalable ML training platforms, optimizing high-performance computing systems, and ensuring robust job scheduling and... ...model tuning and managing cloud environments. Join us to shape the future of AI-driven robotics. #J-18808-Ljbffr...Platform
$345.04k - $399.42k
...building the tools and platform that empower our community... ...challenges at scale, and helping to create... ...experiences for everyone. The AI Platform team is building... ..., Data team, you will lead the development of the... ...Qualifications A BS, MS, or PhD in Computer Science, or a related...PlatformFull timeWork experience placementWork at officeLocal areaMonday to Friday- Hammerhead AI in Redwood City is seeking a Product Manager for their orchestration platform. The role entails owning the product vision, translating technical requirements, and driving execution across teams. Ideal candidates have 5+ years of experience in product management...Platform
$224k - $336k
...Stripe is seeking a Staff Engineer to lead the technical direction for the ML Platform. In this key role, you will influence strategy and architecture while collaborating with technical teams to enhance ML capabilities. The ideal candidate has over 10 years of software...Platform- ...What to Expect The ML & Robotics Infra team builds the foundational systems... ...Engineer on ML & Robotics Infra focused on GPU and accelerated compute, you'll own how every accelerated... ...model inference on embedded GPU platforms (e.g., Jetson) Experience with observability...Platform
- ...About zaimler AI agents can't reason over... ...agentic era: a platform that automatically... ...with precision at scale. Imagine knowledge... ..., Truera), a Data Infra veteran and former... ...Ray, PostGres, and Kubernetes. In this role, you... ...Ray, or distributed compute runtimes...PlatformH1bVisa sponsorshipShift work
$159k - $207k
...Responsibilities Scientific Computing & HPC Platform Engineering: Lead the architecture, build‑... ...optimization of on‑premise GPU clusters, hybrid cloud HPC... ..., structural biology, and AI model training. Implement... ...burst strategies for elastic scaling of peak HPC demand and ML...PlatformLocal area- ...shape the next frontier of AI-driven robotics!... ...Software Engineer, Data Infra you are the architect of... ...that are critical to our scaling. If you enjoy solving... ...using Python, GCP/AWS, and Kubernetes) for the ingestion,... ...or "human-in-the-loop" platforms. Technical Stack: Proficiency...Platform
$175k - $220k
...the future of generative AI infrastructure. Our platform delivers the highest-... ...the stack-from low-level GPU kernels to large-scale distributed systems. A key... ..., memory usage, and compute efficiency Profile system... ...tools (e.g., Kubernetes) Background in ML systems...Platform$226.19k - $292.71k
...The Director of Global AI Capability- Medical Affairs... ...teams. This role leads without a direct team.... ...Compliance. Your ability to scale solutions across MA is... ...for new AI tools and platforms — running structured... ...or advanced degree in Computer Science, Data Science,...PlatformFull timeFor contractorsWork at officeLocal area3 days per week$280.54k - $330.95k
...Product Manager, Compute Platform San Mateo, CA, United... ...challenges at scale, and helping to create... .... About the Role: AI models reshaping... ...products that turn raw GPU hosts into... ...spanning Managed Kubernetes (Roblox Kubernetes... ...build on top easily. Lead cross-functionally...PlatformH1bWork at officeLocal areaVisa sponsorshipMonday to Friday- ...next generation of large-scale AI systems. This role... ...across large distributed GPU environments Build... ...with distributed computing and large-scale infrastructure... ..., Cloud Computing, Kubernetes, Python, C++, Open... ...Learning Infrastructure, AI Platform Engineering, Systems...PlatformVisa sponsorshipRelocation packageFlexible hours
$180k - $220k
...future of generative AI infrastructure. Our platform delivers the highest... ...—including Kubernetes clusters, multi-cloud... ...and securing large-scale Kubernetes platforms... ..., Oracle Cloud, and GPU as service cloud providers... ...management, and confidential computing. Experience...Platform$200k - $350k
...design, optimize, and scale the systems that... ...frameworks (Kubernetes, Ray, SLURM) for distributed... ...BS/MS/PhD in Computer Science,... ...performance computing and GPU programming (CUDA)... ...and cloud computing platforms (AWS/GCP/Azure).... ...diffusion models and leading AI researchers...PlatformImmediate startFlexible hours- ...United Cerebral Palsy of Georgia is seeking a Senior Applied AI Manager to lead the strategy and execution of AI science. This senior role... ...performing team and ensure the quality of AI features on our platform. The ideal candidate will have 5+ years in ML research with...Platform
- ...brand excitement. With 5+ years of B2C social media experience in a high-growth environment, you will create engaging content and track performance across various platforms. Replit offers a full-time role with competitive benefits and flexible time off. #J-18808-Ljbffr...PlatformFull timeFlexible hours
- ...using Docker, Helm, and Kubernetes, including advanced pod... ...and collaboration with platform teams. 7+ years of... ...Excellence in the Age of AI As an... ...pioneering "Agentic AI @ Scale" work, cementing our position... ...delivery of industry-leading solutions and our deep,...Platform
- ...A cutting-edge AI infrastructure company in San Mateo is seeking a Founding Cloud Infrastructure Engineer... ...building cloud infrastructure to power semantic platforms, involving strong hands-on experience with Kubernetes, Terraform, and distributed systems. The ideal candidate...Platform
- ...Foster City seeks a skilled Copilot Architect to design and develop AI-powered components. The role involves working closely with... ...automation solutions using Microsoft Azure services and the Power Platform. The ideal candidate will have 5+ years of experience in software...Platform
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Compute Platform Lead - Scale GPU Infra & Kubernetes. Be the first to apply!



