System Engineer
$200k - $300kFluidstack
System Engineer, GPU Fleet
As a System Engineer, GPU Fleet, you will manage, operate, and optimize hyperscale GPU compute infrastructure supporting AI/ML training and inference workloads. Ensure high availability, performance, and reliability of GPU server fleet through automation, monitoring, troubleshooting, and collaboration with hardware engineering, platform teams, and datacenter operations.
Focus
- Operate and maintain large-scale GPU server fleet (H100, B200, GB200) supporting AI/ML workloads; monitor system health, performance, and utilization to maximize uptime and ensure SLA compliance
- Perform hands-on troubleshooting and root cause analysis of complex hardware, firmware, OS, and application issues across GPU clusters; coordinate with vendors and hardware teams to resolve systemic failures
- Develop and maintain automation scripts for provisioning, configuration management, monitoring, and remediation at scale.
- Build and improve tooling for GPU health checks, performance diagnostics, driver validation, and automated recovery
- Execute server provisioning, configuration, firmware updates, and OS installation using automation frameworks; manage lifecycle operations including deployment, maintenance, and decommissioning
- Participate in 24x7 on-call rotation; respond to production incidents and coordinate resolution with cross-functional teams including datacenter operations, network engineering, and application teams
- Lead post-incident reviews, document root causes, and drive continuous improvement initiatives focused on automation, reliability, monitoring, and operational efficiency
Basic Qualifications
- Bachelor's degree in Computer Science, Engineering, or related technical field (or equivalent practical experience)
- 3+ years (System Engineer) or 5+ years (Senior System Engineer) in Linux system administration, datacenter operations, or infrastructure engineering
- Strong Linux/Unix fundamentals including system administration, shell scripting (Bash, Python), troubleshooting, and performance tuning
- Experience with server hardware architecture, troubleshooting techniques, and understanding of compute, memory, storage, and networking components
- Experience in automation and configuration management tools (Ansible, Puppet, Chef, Terraform).
- Strong analytical and problem-solving skills with ability to diagnose complex technical issues under pressure
- Excellent communication and collaboration skills; ability to work effectively with cross-functional teams
Preferred Qualifications
- Experience managing large-scale GPU infrastructure (NVIDIA H100, A100, B200, GB200) in production environments supporting AI/ML workloads
- Deep knowledge of GPU architecture, CUDA toolkit, GPU drivers, monitoring tools (nvidia-smi, DCGM)
- Experience with HPC cluster management, job schedulers (Slurm, PBS, LSF), and container orchestration (Kubernetes, Docker)
- Proficiency in out-of-band management protocols (IPMI, Redfish, BMC) and firmware management for server hardware
- Experience with high-performance networking (InfiniBand, RoCE, RDMA) and network troubleshooting in GPU cluster environments
- Familiarity with datacenter operations including rack installations, cabling, power management, and thermal considerations
Salary & Benefits
- Competitive total compensation package (salary + equity).
- Retirement or pension plan, in line with local norms.
- Health, dental, and vision insurance.
- Generous PTO policy, in line with local norms.
The base salary range for this position is $200,000 - $300,000 per year, depending on experience, skills, qualifications, and location. This range represents our good faith estimate of the compensation for this role at the time of posting. Total compensation may also include equity in the form of stock options.
We are committed to pay equity and transparency.
Fluidstack is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Fluidstack will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
- ...their unique business. About The Role: We are looking for engineers to support webAI's Public Sector initiatives focused on secure,... ...infrastructure. Ultimately for this team we're looking for a Systems Engineer that knows Rust, or a Systems-ish Engineer that knows...SuggestedFull timeCasual workLive outWork at officeLocal areaRemote workFlexible hours
- ...System Engineer-IMS Location: Austin, TX Duration: 12 Months Skills: Windows 10 Basics Description: Install hardware and peripheral components like disk drives, printers, keyboards and monitors. Load software packages such as operating systems and office applications...SuggestedWork at office
- ...understands their unique business. About The Role: We are looking for engineers to support webAI's Public Sector initiatives focused on secure,... ...resilient AI infrastructure. Ultimately we're looking for a Systems Engineer that knows Rust, or a Systems-ish Engineer that knows...SuggestedFull timeLive outWork at office
- ...Distributed Systems Engineer Distributed systems engineer position available. Focus on building scalable and reliable distributed systems. Responsibilities include designing, implementing, and maintaining distributed systems architecture. Strong experience in distributed...Suggested
- ...Kwil Distributed Systems Engineer Kwil is a decentralized SQL database, built to enable advanced dApps and scalable composability. Backed by FTX Ventures, Blockchange, AlleyCorp, Amplify Partners, DCG, and other leading VCs, the KwilDB decentralized SQL solution...SuggestedImmediate startFlexible hours
- ...BOXX Technologies is Hiring. BOXX Technologies Engineering department is looking for a Systems II - III located in Austin, TX. BOXX offers Benefits (Medical, Dental and Vision plans), 401-K Matching, and Paid Time Off. Position Summary: System...Work experience placementWork at officeLocal areaWorldwide
- ...Company Overview: Allen Control Systems (ACS) is a cutting-edge defense startup founded by two former Navy electrical engineers with a proven track record in robotics and software. We are developing an autonomous gun turret using advanced computer vision and control...Local area
- ...Systems Engineer The Systems Engineer is responsible for managing and maintaining Teza's systems and storage infrastructure. The System engineer will work closely with the development and trading operations teams to ensure the smooth functioning of all systems and...Flexible hours
$15 per week
...Overhead Cantenary Systems Engineer (OCS) Modern Railway Systems is seeking a specialized OCS professional responsible for the engineering, design development, construction support, estimating and commissioning of OCS and Traction Power Feeder Systems associated with...For contractorsFor subcontractorWork at officeFlexible hours$97.01k - $164.91k
...Job Description About BAE Systems Our employees work on the world’s most advanced electronics. Spanning air, land, sea, and... ...and national security services. When you join our engineering group, you will be an integrated team member collaborating with...Full timeContract workLocal areaFlexible hours- ...graphics processors, motherboard chip sets, and a variety of components used in consumer electronics goods. Job Title: Systems Engineer 3 Duration: 12 Months Location: Austin, TX Job Type: Temporary assignment Work Type: Hybrid onsite (3 days a week...Temporary work3 days per week
- ...Systems Engineer The Systems Engineer works on on-premise server and cloud related projects, both for new and existing infrastructures. Working closely with the managing consultant, project plans are designed and implemented with a high degree of quality and with...Local areaRemote work
- ...the world moves earth for construction. Founded by former SpaceX engineers and backed by Bain Capital Ventures, TerraFirma is automating... ...Overview In this role, you'll take hands-on ownership of various systems spanning hardware and software, and analog and digital domains....WorldwideRelocationWeekend work
$120k - $165k
...We Are Applied Materials is a global leader in materials engineering solutions used to produce virtually every new chip and advanced... ...you may go. Learn more about our benefits. As a Systems Engineer, you'll design, integrate, and optimize complex systems...Full timeRemote workRelocation- ...Job Description: JOB DESCRIPTION: The system engineer is the owner of engineering requirement translation, specification and implementation of projects across entire system design for products. Responsible for implementation and design of custom software/...Work at office
- ...Role: System Engineer Preferred Location: Onsite (Austin) Key responsibilities: 1. Dashboard Development and Maintenance: - Design and implement monitoring dashboards for SAP HANA and SAP NetWeaver using Splunk and Grafana. - Create custom visualizations to...
$54.4k - $57.99k
...traditional call center responsibilities, requiring strong analytical skills, attention to detail, and the ability to work across multiple systems and processes. Maintains end-to-end responsibility for customer’s support needs providing timely, reliable, and courteous...Contract workWork at office- ...Systems Design Engineer The Role Data Center Platform Engineering Group (DPEG) is looking for an on-site (mandatory) 3rd shift Systems Design Engineer that can work complex issues/problems as they arise in the lab or Data Center. Person will be responsible for going...Work at officeNight shift
- ...What to Expect Tesla is seeking a highly motivated Engineer to develop functional test equipment (such as dynamometers, electrical testers... ...vendors to develop and deploy new Drive Unit and Actuator test systems. You will assume full ownership of the design, development, and...Hourly payFull timeTemporary workFlexible hours
- ...Applifecycle Systems Engineer - UCCE and UCM Location: RTP, NC / Austin, TX / San Jose, CA Duration: Fulltime Job Description: Skills Desired: Designing, Managing Cisco Unified Contact Center Enterprise technologies (UCCE), Cisco Customer Voice Portal...Full timeWork experience placement
$80.31 - $85.31 per hour
...us for significant technical transformation in Broker-Dealer Systems and the modernization of our core technology infrastructure. As... ...experience with, GenAI coding assistants used across day-to-day engineering workflows in the SDLC-implementation, refactoring, unit...Hourly payContract workTemporary workWork experience placement- ...Systems Engineer, MAPS Hybrid At Cloudflare, we are on a mission to help build a better Internet. Today the company runs one of the world's largest networks that powers millions of websites and other Internet properties for customers ranging from individual bloggers...Local area
- ...Systems Engineer Atlas Technica's mission is to shoulder IT management, user support, and cybersecurity for our clients, who are hedge funds and other investment firms. Founded in 2016, we have grown year over year through our uncompromising focus on service. We...Work at office
- ...Position Description & Qualifications Are you a Systems Engineer looking for a place where you can make an impact every day? Serco is the place for you! Join our Defense team supporting our CNIC program in this exciting role based out of Naples, Italy. CNIC...Full timeContract workPart timeFor contractorsLocal areaFlexible hours
- ...Overview Our Software Engineers build proprietary trading systems which directly impact the financial markets. Our software engineering teams leverage technology to solve a variety of difficult problems. Our trading strategies must respond to market events in microseconds...Work at office
$136k - $184k
...that will provide low-latency, high-speed broadband connectivity to unserved and underserved communities around the world. As a Systems Engineer, this role is primarily responsible for the design, development and integration of communication payload and customer terminal...Permanent employmentLocal areaFlexible hours- ...Systems Engineer Sage Integration Holdings, LLC protects the people, facilities, and reputation of enterprise clients by advancing the intelligence and integration of security technology. Innovation at SAGE is foundational—not a department or an afterthought. Our culture...Night shift
- ...challenge the status quo" and transform the finance industry together. Join us for significant technical transformation in Broker-Dealer Systems and the modernization of our core technology infrastructure. This role will be a leader in AI focused workflows across the system....
$68k - $105k
...companies. With its comprehensive ITS (Intelligent Transportation Systems) portfolio, Kapsch is actively addressing the challenges of the... .... We are looking for a motivated Junior Systems Engineer in Austin, TX to support the design, integration, testing, and...Casual workInternshipWork at officeLocal areaFlexible hours- ...Commvault Systems Engineer (Data Protection / Backup) Employment Type: Full-Time, Experienced CGS is seeking an experienced Commvault Data Protection Engineer with extensive knowledge and experience in designing, developing, configuring, implementing, testing, troubleshooting...Full timeFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to System Engineer. Be the first to apply!
- healthcare systems engineer Austin, TX
- broadcast systems engineer Austin, TX
- wireless systems engineer Austin, TX
- system test engineer Austin, TX
- unix linux systems engineer Austin, TX
- electronic systems engineer Austin, TX
- systems engineer Austin, TX
- active directory systems engineer Austin, TX
- ground systems engineer Austin, TX
- operations support system engineer Austin, TX

