Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Hardware Engineer - GPU & AI Infrastructure

$238.52k - $289.46k

Roblox

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators.

At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device. We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there.

A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.

As a member of the Infrastructure Foundation Hardware Engineering team, you will play a key role in enabling our mission to deliver a reliable, high‑performing, and cost‑efficient infrastructure that powers the world’s play. In this specialized role, you will be the technical lead for our GPU and AI accelerator ecosystem. You will be responsible for the full lifecycle of GPU hardware, from initial architectural evaluation and firmware qualification to large‑scale fleet integration and performance tuning. You will ensure that Roblox’s massive‑scale rendering and ML workloads run on the most optimized and stable hardware possible.

You Will
  • Architect & Prototype: Prototype next‑generation GPU‑accelerated hardware platforms, ensuring seamless integration between high‑density compute nodes, high‑speed interconnects (NVLink/PCIe Gen5/6), and system firmware.
  • GPU Optimization: Drive the integration, performance testing, and debugging of GPUs in our fleet, focusing specifically on hardware‑level optimizations, driver tuning, and thermal/power management.
  • Validation & Certification: Develop and execute rigorous evaluation and stress‑testing strategies for GPU‑heavy server platforms to ensure they meet Roblox’s unique demands for real‑time rendering and low‑latency AI inference.
  • Firmware & Systems: Lead firmware qualification (BIOS/BMC) and troubleshooting, implementing automation systems to manage GPU health, firmware updates.
  • Vendor Collaboration: Provide technical guidance and deep‑dive feedback to hardware vendors. Lead critical investigations into component‑level failures, triaging issues across the hardware, driver, and kernel layers.
  • Observability: Build and maintain advanced monitoring stacks (Grafana/Prometheus) to track GPU metrics like HBM utilization, thermal throttling events, and PCIe bandwidth saturation.
You Have
  • Education: BA/BS Degree in Electrical Engineering, Computer Engineering, or related field with equivalent practical experience.
  • GPU Expertise: 5+ years of hardware engineering experience with a specific focus on GPU architecture (NVIDIA HGX/MGX platforms preferred), AI accelerators, or high‑performance compute (HPC) systems.
  • Deep Technical Knowledge: In‑depth understanding of modern data center technologies, including PCIe fabric, NVLink, InfiniBand, and liquid cooling systems for high‑TDP hardware.
  • Testing Skills: Hands‑on experience testing and validating CPU, Memory (HBM/DDR5), Storage (NVMe), and high‑speed networking subsystems in a Linux environment.
  • Programming: Proficiency in Python, Go, or C++ for developing hardware validation tools and automation scripts.
  • Systemic Debugging: Expert‑level skills in debugging complex server issues remotely, with the ability to analyze kernel logs, hardware registers, and bus‑level captures.
You Are
  • A Problem Solver: Decisive and effective at tracking hardware issues from identification through to fleet‑wide resolution.
  • A Communicator: Excellent oral and written communication skills; able to translate complex hardware constraints into actionable insights for software teams.
  • Collaborative: Strong interpersonal skills with the ability to lead cross‑functional projects with Data Center Ops, SRE, and external vendors.
  • Adaptable: Willing to travel occasionally to data centers or vendor sites to oversee hardware deployments or "first‑of‑a‑kind" builds.

For roles that are based at our headquarters in San Mateo, CA: The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job‑related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall outside of this expected range. This pay range is subject to change and may be modified in the future. All full‑time employees are also eligible for equity compensation and for benefits as described on this page.

Annual Salary Range

$238,520 — $289,460 USD

Roles that are based in an office are onsite Tuesday, Wednesday, and Thursday, with optional presence on Monday and Friday (unless otherwise noted).

Roblox provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. Roblox also provides reasonable accommodations to candidates with qualifying disabilities or religious beliefs during the recruiting process.

#J-18808-Ljbffr
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior Hardware Engineer - GPU & AI Infrastructure in San Mateo, CA vacancy
  • $238.52k - $289.46k

     ...A leading gaming platform in San Mateo, CA is seeking an experienced GPU and AI Hardware Engineer to lead the GPU and AI accelerator ecosystem. In this role, you will handle the complete lifecycle of GPU hardware, focus on performance tuning, and develop rigorous testing... 
    Suggested

    Roblox

    San Mateo, CA
    2 days ago
  •  ...Gravity Engineering Services Pvt Ltd. is seeking a Software Engineer focused on Performance Optimization to enhance the efficiency of advanced AI infrastructures. The role requires optimizing performance from GPU kernels to distributed systems, dealing with high-throughput... 
    Senior

    Gravity Engineering Services Pvt Ltd.

    San Mateo, CA
    6 days ago
  •  ...in California is looking for an experienced Machine Learning Infrastructure Engineer. This role involves designing scalable ML training platforms...  ...in ML model tuning and managing cloud environments. Join us to shape the future of AI-driven robotics. #J-18808-Ljbffr... 
    Senior

    Dyna Robotics

    Redwood City, CA
    2 days ago
  • $345.04k - $399.42k

     ...Corporation is seeking a Principal Software Engineer specializing in GPU Compute to drive GPU strategy and ensure reliable AI workloads. The role requires deep expertise...  ...define GPU capacity management, work on infrastructure efficiency, and lead cross-functional initiatives... 
    Suggested

    Roblox

    San Mateo, CA
    2 days ago
  •  ...About Obvio AI Each year, more than 40,000 people in the...  ...and autoscaling strategy for GPU-bound workloads on ECS. Design...  ...layer. Stand up the infrastructure that loads versioned CV models...  ...pipeline downtime. Set the engineering standard. This is an early hire... 
    Senior
    Local area

    Obvio

    San Carlos, CA
    3 days ago
  • $170k - $236.5k

     ...intelligence, best-in-class hardware and software product development...  .... We leverage breakthrough AI to create the world's most...  ...As a deep learning infrastructure engineer, you will be responsible for...  ...training efficiency Implement GPU kernels for custom architectures... 
    Full time
    Local area
    Relocation package

    Skydio

    San Mateo, CA
    1 day ago
  •  ...Quadric's co-optimized software and hardware is targeted to run neural network...  ...new processor architecture. As a senior member of our chip design team,...  ...Ph.D. in Electrical or Computer Engineering with a minimum of five years of CPU/GPU/ASIC front-end design Proficiency... 
    Senior
    Immediate start

    quadric.io, Inc

    Burlingame, CA
    7 days ago
  • $200k - $300k

     ...Company Overview At Skild AI, we are building the world's first general purpose robotic intelligence that is...  ...innovative projects. Position Overview Skild AI, Inc. seeks a Senior Software Engineer, AI Training & Infrastructure in San Mateo, CA. You will be responsible for... 
    Senior

    Menlo Ventures

    San Mateo, CA
    2 days ago
  • $200k - $275k

     ...intelligence, best-in-class hardware and software product development...  ...regulated environments, our infrastructure must also meet stringent...  ...are seeking an experienced Engineering Manager to grow and lead our...  ...accountability. Comfortable leveraging AI‑assisted development... 
    Senior
    Full time
    Local area
    Relocation package

    Skydio

    San Mateo, CA
    2 days ago
  • $109.2k - $223.4k

     ...Senior Principal Technical Program Manager Oracle Cloud Infrastructure (OCI) is seeking a Senior Principal Technical Program Manager...  ...data center infrastructure, AI-driven operational improvements...  ...delivery Coordinate across engineering, operations, networking, capacity... 
    Senior
    Temporary work
    Worldwide
    Flexible hours

    Oracle

    Redwood City, CA
    1 day ago
  • $170k - $226.25k

     ...artificial intelligence, best-in-class hardware and software product development,...  ...hardware, sensing, and real-time AI software. The complexity of the systems...  ...are seeking a driven and versatile Senior Hardware Test and Reliability Engineer to help drive safety and... 
    Senior
    Full time
    Local area
    Relocation package

    Skydio

    San Mateo, CA
    7 days ago
  • $200k - $250k

     ...About 1X We're an AI and robotics company based in San...  ...functional Firmware or Embedded Engineer to develop and maintain...  ...supports and enables system-level hardware architecture. In this role,...  ...Develop diagnostic and telemetry infrastructure: logging, error counters,... 
    Senior
    Local area

    1X Technologies AS

    San Carlos, CA
    5 days ago
  • $180k - $250k

     ...A tech-driven AI company in Redwood City is seeking an Infrastructure Engineer to develop core infrastructure and support multi-cloud environments. The ideal candidate has experience in large-scale infrastructure, proficiency with tools such as Kubernetes, and a passion... 
    Senior

    Datology

    Redwood City, CA
    3 days ago
  • $272k - $327k

     ...experienced and strategic Senior Manager of Network Engineering to lead the team...  ...position manages our network infrastructure across multiple global sites...  ...carrier circuits, networking hardware, and third-party vendor...  ...artificial intelligence (AI) tools to support parts... 
    Senior
    Temporary work
    Relocation package

    Zoox

    Foster, CA
    3 days ago
  • $196.75k - $243.29k

     ...Roblox Corporation is looking for a Senior Software Engineer focused on Data Infrastructure for Safety in San Mateo, California. The role entails ownership of full-stack features, collaboration with multiple teams, and the design of scalable architecture. Candidates should... 
    Senior
    Work at office
    3 days per week

    Roblox

    San Mateo, CA
    2 days ago
  • $215k - $290k

     ...A leading Voice AI startup in Redwood City is seeking a Founding Senior Software Engineer for Infrastructure. This full-time role involves owning and designing deployment pipelines in cloud and on-prem environments. Candidates should have at least 3 years of experience... 
    Senior
    Full time

    Retell AI

    Redwood City, CA
    2 days ago
  • $196.75k - $243.29k

     ...Senior Software Engineer - Data Infrastructure, Safety Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with...  ...experimentation, automation, detection workflows, and AI-powered text filters. Aligned and partnering with product... 
    Senior
    H1b
    Work at office
    Local area
    Visa sponsorship
    Monday to Friday

    Roblox

    San Mateo, CA
    2 days ago
  • $130k - $280k

     ...with an integrated, privacy-sensitive AI-powered platform that includes...  ...countries. About the Role As a Senior Software Engineer on this team, you will help architect...  ...targeted at making Verkada's infrastructure the most reliable and cost efficient... 
    Senior
    Hourly pay
    Full time
    Work at office
    Work visa
    Flexible hours
    Shift work

    Verkada

    San Mateo, CA
    4 days ago
  •  ...mission to make frontier AI truly open for all. We...  ...We're looking for a Senior Applied AI Manager to own...  ...with a growing team of ML engineers and applied researchers...  ...: Partner with infrastructure and product teams to ensure...  ...infrastructure (Kubernetes, GPU clusters, orchestration... 
    Senior

    United Cerebral Palsy of Georgia

    San Mateo, CA
    2 days ago
  • $160k - $200k

     ...Python. Additionally, familiarity with cloud infrastructure to build training and serving pipelines...  .... Our systems are software-driven, hardware-agnostic, and have already picked over 1...  ...We may use artificial intelligence (AI) tools to support parts of the hiring process... 
    Senior
    Work experience placement

    Dexterity

    Redwood City, CA
    3 days ago
  • $239k - $333k

     ...looking for 3D Machine Learning engineers to simulate sensors (cameras, lidar...  ...in the world and an incredible infrastructure for testing and validating your...  ...for training and testing AV AI, as well as real-time sensor data for hardware-in-the-loop simulation. In this... 
    Senior
    Temporary work
    Relocation package

    Zoox

    Foster, CA
    5 days ago
  •  ...the ML Training and Inference Infrastructure team that enables autonomous...  ..., etc. and our Advanced Hardware Engineering group and have the opportunity...  ...productionization of cutting-edge AI innovation. This team has a...  ...distributed multi-node GPU model training and/or high throughput... 
    Senior
    Temporary work
    Relocation package

    Zoox

    Foster, CA
    2 days ago
  •  ...the ML Training and Inference Infrastructure team that enables autonomous...  ..., etc. and our Advanced Hardware Engineering group and have the opportunity...  ...productionization of cutting-edge AI innovation. This team has a...  ...distributed multi-node GPU model training and/or high throughput... 
    Senior
    Temporary work
    Relocation package

    Zoox

    Foster, CA
    2 days ago
  • $214k - $290k

     ...Senior Software Engineer, ML Core Build and optimize the ML tooling...  ...innovations in ML and AI to make autonomous...  ...as with our Advanced Hardware Engineering group specifying...  ...libraries, and ML infrastructure used by our applied...  ...JAX. Familiarity with GPU‑accelerated inference... 
    Senior
    Temporary work
    Relocation package

    jobs.frontdoordefense.com - Jobboard

    Foster, CA
    2 days ago
  • $137.26k - $193.87k

     ...work every day. Job Description: Job title : Senior Software Engineer, Core Infrastructure The Core Infrastructure team drives developer experience...  ...best practices every team should be following Use AI as a force multiplier in day-to-day engineering work... 
    Senior

    Poshmark

    Redwood City, CA
    2 days ago
  • $238.52k - $289.46k

     ..., we are building large scale ads machine learning infrastructure to deliver effective performance ads to our users,...  ...and more business values to our advertisers. As a Senior Machine Learning Infrastructure Engineer in our Ads ML Infra team, you’ll build scalable,... 
    Senior
    Full time
    Work experience placement
    Work at office
    Local area
    Monday to Friday

    Roblox

    San Mateo, CA
    2 days ago
  • $150k - $200k

     ...busy creating the energy infrastructure that will one day...  ...a key part of Reach's engineering team to develop firmware...  ...product innovation in hardware and software. Qualifications...  ...experience, skills, seniority, and how each...  ...artificial intelligence (AI) tools to support... 
    Senior
    Work at office

    Reach

    Redwood City, CA
    3 days ago
  • $96.8k - $251.6k

     ...Skydance/SDA acceptance, GPU and 5K scale readiness,...  ...OCI is building cloud infrastructure for demanding media, creative, AI, and high‑performance workloads...  ...while improving the engineering systems, operational...  ...planning, and examples of senior‑level ownership in ambiguous... 
    Senior
    Temporary work
    Flexible hours

    Oracle

    Redwood City, CA
    2 days ago
  • $125k - $175k

     ...Senior Firmware Engineer Cala is seeking a Senior Firmware Engineer to join our growing team. The...  ...You will work on the full stack, from hardware support to the application itself, on...  ...optimize code for low-power applications. AI Integration & Process Improvement:... 
    Senior
    Full time
    Visa sponsorship
    Work visa

    Cala Health

    San Mateo, CA
    4 days ago
  • $170k - $258k

     ...artificial intelligence, best-in-class hardware and software product development,...  ...About the team: Skydio’s Cloud infrastructure team is here to ensure that the Skydio...  ...the role About the role: As an Senior Infrastructure Engineer on a semi-new product, you’ll not just... 
    Senior
    Full time
    Local area
    Relocation package

    Booster

    San Mateo, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Hardware Engineer - GPU & AI Infrastructure. Be the first to apply!