Principal Software Engineer, GPU Compute
$345.04k - $399.42kRoblox
Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators.
At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there.
A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.
As a Principal Software Engineer on the Compute team, you will be the technical anchor for Roblox's GPU and AI accelerator capabilities. This is a battle-tested GPU expert role focused on the machine management layer and above: how GPU hosts are made production-ready, kept healthy, and turned into reliable compute for the workloads that depend on them. You will own the hard problems that show up only at scale, from driver and firmware management to GPU health, reliability, and performance across a rapidly growing fleet of accelerators spanning Roblox data centers and cloud environments. You will set the technical direction for GPU compute and up-level the entire organization's GPU expertise.
You will:
- Serve as the GPU technical leader for the Compute team, partnering across Kubernetes, Machine Bootstrap, Networking, and Cloud to drive GPU strategy end to end.
- Own the GPU host lifecycle above raw fleet management: driver, firmware, and CUDA stack management, GPU health and telemetry, and remediation of GPU-specific failures (XID errors, ECC, thermal, NVLink and fabric faults).
- Architect how GPU capacity is exposed to compute platforms, including scheduling, isolation, and integration with Kubernetes for GPU and AI workloads.
- Drive GPU reliability and performance at fleet scale, defining the detection, diagnosis, and automated repair of unhealthy accelerators before they impact production.
- Evaluate and onboard new GPU and AI accelerator platforms, networking topologies (NVLink, InfiniBand, RoCE), and multi-node training and inference patterns.
- Establish the standards, tooling, and APIs that let other engineering teams consume GPU compute safely and efficiently, reducing toil and raising the bar for the org.
You have:
- 10+ years of experience building and operating large-scale distributed systems and infrastructure.
- Deep, hands-on GPU expertise at the machine management layer and above: GPU host provisioning, driver and firmware lifecycle, GPU health and reliability, and the realities of running accelerators in production.
- A track record as an expert for compute, not just fleet management, with the scars to prove you have scaled GPU or accelerator infrastructure that other teams depend on.
- Strong proficiency in Go or other well-structured programming languages.
- Experience operating GPU and AI workloads in production, including familiarity with CUDA, GPU scheduling, and high-performance networking (NVLink, InfiniBand, RoCE).
- Familiarity with Kubernetes for GPU workloads and with bare-metal concepts (firmware, BMC/IPMI/Redfish, OS imaging) is a strong plus.
- A history of being the anchor expert that an organization relies on for its hardest GPU and compute problems, and the leadership to up-level the engineers around you.
For roles that are based at our headquarters in San Mateo, CA: The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job-related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall outside of this expected range. This pay range is subject to change and may be modified in the future. All full-time employees are also eligible for equity compensation and for benefits as described on this page .
Annual Salary Range$345,040—$399,420 USDRoles that are based in an office are onsite Tuesday, Wednesday, and Thursday, with optional presence on Monday and Friday (unless otherwise noted).
Roblox provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. Roblox also provides reasonable accommodations to candidates with qualifying disabilities or religious beliefs during the recruiting process.
For US based roles only, please note the Company may not be able to employ candidates for this role who have United States work authorization related to certain U.S. visa categories, or support future H-1B sponsorship at this time.
- Sunday Robotics in Redwood City, California is seeking a System Software Engineer to contribute to the accelerated compute layer of their robot platform. The ideal candidate has over 2 years of experience in GPU systems software development, strong proficiency in CUDA, and...Suggested
$168k - $239k
...Gpu Performance Software Engineer Zoox is building the world's most advanced self-driving hardware and software solution. The efficiency demands... ...such a system require an expert fine tuning of both the compute hardware architecture as well as the algorithms and middleware...SuggestedTemporary workRelocation package- ...developer infrastructure that lets us build, ship, and update that software quickly and safely on every robot in the fleet. As a System Software Engineer on ML & Robotics Infra focused on GPU and accelerated compute, you'll own how every accelerated workload on the robot...Suggested
- ...Scale the inference fleet. Build the compute layer that parallelizes processing across... ..., queueing, and autoscaling strategy for GPU-bound workloads on ECS. Design the data... ...back without pipeline downtime. Set the engineering standard. This is an early hire. You'll write...SuggestedLocal area
- ...Gravity Engineering Services Pvt Ltd. is seeking a Software Engineer focused on Performance Optimization to enhance the efficiency of advanced AI infrastructures. The role requires optimizing performance from GPU kernels to distributed systems, dealing with high-throughput...Suggested
$196.75k - $243.29k
...technical challenges at scale, and helping to create safer, more civil shared experiences for everyone. As a Senior Software Engineer in Roblox's Compute Group's Fleet Management, you'll directly influence the evolution of our Private Cloud. You will build products to...Full timeWork experience placementWork at officeLocal areaMonday to Friday$180k - $300k
...About the Role At the forefront of innovation, the Computer Vision team develops the artificial intelligence and machine... ...preferably with research experience ~2+ years of industry software engineering experience ~1+ years of work or research experience with current...Full timeWork visaFlexible hoursShift work$295.25k - $345.04k
...managing cluster lifecycles. As a Principal Engineer on the Cache team (part of the Infra Storage... ...& Education: A BS degree in Computer Science (or equivalent professional experience... ...) with at least 8+ years of hands-on software engineering experience. ~...Full timeWork experience placementH1bWork at officeLocal areaVisa sponsorshipMonday to Friday$293.8k - $343.34k
...safer, more civil shared experiences for everyone. As a Principal Software Engineer on the Video team, you'll be responsible for building, operating... ...video in a 3D world while the video display size and CPU/GPU/RAM/network availability are changing. Leverage AI...Full timeWork experience placementH1bWork at officeLocal areaVisa sponsorshipSleeping nightsMonday to FridayDay shift$293.8k - $343.34k
...with our massive community of creators. We are seeking a Principal Software Engineer to drive technical strategy and execution for the Ads... ...workflows is desirable. ~ Education: BS, MS, or Ph.D. in Computer Science, Engineering, or equivalent experience. For...Full timeWork experience placementH1bWork at officeLocal areaVisa sponsorshipMonday to Friday$150k - $350k
...and real-time analytics to improve patient outcomes. As a Principal Software Engineer, you'll work directly with our CTO to lead the technical evolution... ...with a modern tech stack (Node.js, React.js, cloud computing, real-time analytics). Impact Lives at Scale - The...Work at officeWorldwideFlexible hours$295.25k - $345.04k
...Roblox Studio is the creation engine behind millions of immersive... ...creators around the world. As a Principal Engineer focused on... ...experience, enjoys solving complex software problems and has a passion for... ...platforms A Bachelor's degree in Computer Science or a similar...Full timeWork experience placementH1bWork at officeLocal areaVisa sponsorshipMonday to Friday$295.25k - $345.04k
...shared experiences for everyone. Why Connections? As a Principal Software Engineer on the Connections team, your work will power the Roblox... ...-facing features ~ Bachelor's degree or higher in Computer Science or a related field. You Are: Proactive:...Full timeWork experience placementH1bWork at officeLocal areaRemote workVisa sponsorshipMonday to Friday$295.25k - $345.04k
...experiences for everyone. Are you a seasoned engineer with a passion for reliability and scalability? We're looking for exceptional Software Engineers with hands-on AI/ML experience... ...equivalent professional experience) in Computer Science or related engineering field with...Full timeWork experience placementH1bWork at officeLocal areaVisa sponsorshipMonday to Friday$295.25k - $345.04k
...technical challenges at scale, and helping to create safer, more civil shared experiences for everyone. As a Principal Software Engineer on the Compute Cell Lifecycle team you will create, support, and evolve the infrastructure at Roblox as we build out Roblox's private...Full timeWork experience placementH1bWork at officeLocal areaVisa sponsorshipMonday to Friday$213.51k - $352k
Zuora, Inc. seeks Senior Principal Software Engineer to: Define scalable, maintainable, and high-performance solutions. Serve as primary architect... ...robustness. Position requires Bachelor's degree in Computer Science, Computer or Electrical Engineering, Information Systems...$363k
...architectural evolution toward next-generation sensing and compute platforms by influencing Hardware-Software Co-Design . In this role, you will: System... ...D. or MS in Computer Science, Robotics, Electrical Engineering, or a related field with a focus on Computer...Temporary workRelocation package$295.25k - $345.04k
...priority and our mission is to protect our community. As a Principal Software Engineer on Asset Safety, you'll ensure that billions of daily user... ...expertise ~ Equivalent experience or a BS in Computer Science, Applied Math, Physics, Engineering, Statistics, or...Full timeWork experience placementH1bWork at officeLocal areaVisa sponsorshipMonday to FridayFlexible hours$345.04k - $399.42k
...safer, more civil shared experiences for everyone. As a Principal Software Engineer on Creator Services Data, you'll be leading the company's... ...existing services like Data Stores and new services built around compute and generative AI. At its core, this team is focused on...Full timeWork experience placementH1bWork at officeLocal areaVisa sponsorshipMonday to Friday$214k - $268k
...Staff Software Engineer, Security Engineering, AI Compute Redwood City, CA (Hybrid) Biohub is the first large-scale initiative bringing frontier AI models... ...the organization. ~ Nice to have - Experience with GPU clusters, AI/ML deployments and operations....Work at officeWorldwideRelocation package3 days per week$278.53k - $345.04k
...the intersection of distributed systems engineering, reliability engineering, and platform... ...-wide scale and stability. As a Principal Software Engineer you will work on some of Roblox... ...professional experience) in Computer Science, with at least 8 years of hands...Full timeWork experience placementH1bWork at officeLocal areaVisa sponsorshipMonday to Friday- ...As a Principal Software Engineer on the Performance & Optimization pod in Studio , you will lead the architecture, design, and development... ...teams and technology platforms ~ A Bachelor's degree in Computer Science or a similar technical field, or equivalent...Full time
- ...Replit is the agentic software creation platform that enables anyone to build applications... ...seeking talented distributed systems engineers who are passionate about building innovative... ...of application deployment, serverless computing, or container orchestration. Familiarity...Full timeTemporary workWork at officeWorldwideMonday to FridayFlexible hours
- ...The Senior Principal Engineer will lead the technical design and implementation of high-performance... ...This role focuses on building robust software engineering foundations and... ...Experience: Bachelor's degree or higher in Computer Science or a related field, with 10 -...
$293.8k - $343.34k
...maintaining Roblox's Economy healthy and vibrant. As a Principal Software Engineer for Virtual Economy Platform, you will be responsible for... ...projects with 5+ engineers. ~ Bachelor's degree in Computer Science, Computer Engineering, or a similar technical field...Full timeWork experience placementH1bWork at officeLocal areaVisa sponsorshipMonday to Friday$345.04k - $399.42k
...processing, storing, managing, and serving user content. As a Principal Software Engineer (Backend, Distributed Systems), you will design and build... ...'s degree (or equivalent professional experience) in Computer Science or related engineering field, Masters is a plus....Full timeWork experience placementH1bWork at officeLocal areaVisa sponsorshipMonday to Friday- ...Core Services is integral to the Roblox experience. As a Principal Engineer within the team, you will own and drive the development and... ...orchestration systems (e.g., schedulers, workflow engines, or compute platforms). Experience designing systems that handle multi-...Full time
$96.8k - $251.6k
...Job Description The Senior Principal AI Agent / ML Software Engineer is a Senior Staff-level, hands-on technical... ...for low latency, high throughput, GPU efficiency, reliability, cost,... ...Bachelor's, Master's, or Ph.D. in Computer Science, AI/ML, Engineering, or a related...Temporary workFlexible hours$159k - $207k
...Core Job Responsibilities Scientific Computing & HPC Platform Engineering: Lead the architecture, build‑out,... ...ongoing optimization of on‑premise GPU clusters, hybrid cloud HPC environments... ...least 3 years operating at a senior/principal individual‑contributor level. Deep,...Local area$143k - $232k
...System Safety Engineer Zoox is on an ambitious journey to develop a full-stack autonomous mobility solution... ...safety analyses of complex electronic and software-based systems, specifically high performance compute. The ideal candidate will be on the Zoox's System...Temporary workRelocation package
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Principal Software Engineer, GPU Compute. Be the first to apply!
- principal software engineer San Mateo, CA
- senior principal cloud computing engineer San Mateo, CA
- senior principal scientist San Mateo, CA
- principal cloud computing engineer San Mateo, CA
- principal San Mateo, CA
- senior c# .net software developer San Mateo, CA
- ultimate software San Mateo, CA
- software technical support engineer San Mateo, CA
- software intern San Mateo, CA
- healthcare software sales San Mateo, CA


