Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

System Engineer

$200k - $300k

Fluidstack

System Engineer, GPU Fleet

As a System Engineer, GPU Fleet, you will manage, operate, and optimize hyperscale GPU compute infrastructure supporting AI/ML training and inference workloads. Ensure high availability, performance, and reliability of GPU server fleet through automation, monitoring, troubleshooting, and collaboration with hardware engineering, platform teams, and datacenter operations.

Focus
  • Operate and maintain large-scale GPU server fleet (H100, B200, GB200) supporting AI/ML workloads; monitor system health, performance, and utilization to maximize uptime and ensure SLA compliance
  • Perform hands-on troubleshooting and root cause analysis of complex hardware, firmware, OS, and application issues across GPU clusters; coordinate with vendors and hardware teams to resolve systemic failures
  • Develop and maintain automation scripts for provisioning, configuration management, monitoring, and remediation at scale.
  • Build and improve tooling for GPU health checks, performance diagnostics, driver validation, and automated recovery
  • Execute server provisioning, configuration, firmware updates, and OS installation using automation frameworks; manage lifecycle operations including deployment, maintenance, and decommissioning
  • Participate in 24x7 on-call rotation; respond to production incidents and coordinate resolution with cross-functional teams including datacenter operations, network engineering, and application teams
  • Lead post-incident reviews, document root causes, and drive continuous improvement initiatives focused on automation, reliability, monitoring, and operational efficiency
Basic Qualifications
  • Bachelor's degree in Computer Science, Engineering, or related technical field (or equivalent practical experience)
  • 3+ years (System Engineer) or 5+ years (Senior System Engineer) in Linux system administration, datacenter operations, or infrastructure engineering
  • Strong Linux/Unix fundamentals including system administration, shell scripting (Bash, Python), troubleshooting, and performance tuning
  • Experience with server hardware architecture, troubleshooting techniques, and understanding of compute, memory, storage, and networking components
  • Experience in automation and configuration management tools (Ansible, Puppet, Chef, Terraform).
  • Strong analytical and problem-solving skills with ability to diagnose complex technical issues under pressure
  • Excellent communication and collaboration skills; ability to work effectively with cross-functional teams
Preferred Qualifications
  • Experience managing large-scale GPU infrastructure (NVIDIA H100, A100, B200, GB200) in production environments supporting AI/ML workloads
  • Deep knowledge of GPU architecture, CUDA toolkit, GPU drivers, monitoring tools (nvidia-smi, DCGM)
  • Experience with HPC cluster management, job schedulers (Slurm, PBS, LSF), and container orchestration (Kubernetes, Docker)
  • Proficiency in out-of-band management protocols (IPMI, Redfish, BMC) and firmware management for server hardware
  • Experience with high-performance networking (InfiniBand, RoCE, RDMA) and network troubleshooting in GPU cluster environments
  • Familiarity with datacenter operations including rack installations, cabling, power management, and thermal considerations
Salary & Benefits
  • Competitive total compensation package (salary + equity).
  • Retirement or pension plan, in line with local norms.
  • Health, dental, and vision insurance.
  • Generous PTO policy, in line with local norms.

The base salary range for this position is $200,000 - $300,000 per year, depending on experience, skills, qualifications, and location. This range represents our good faith estimate of the compensation for this role at the time of posting. Total compensation may also include equity in the form of stock options.

We are committed to pay equity and transparency.

Fluidstack is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Fluidstack will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the System Engineer in Austin, TX vacancy
  •  ...their unique business. About The Role: We are looking for engineers to support webAI's Public Sector initiatives focused on secure,...  ...infrastructure. Ultimately for this team we're looking for a Systems Engineer that knows Rust, or a Systems-ish Engineer that knows... 
    Suggested
    Full time
    Casual work
    Live out
    Work at office
    Local area
    Remote work
    Flexible hours

    webAI Inc

    Austin, TX
    1 day ago
  •  ...System Engineer-IMS Location: Austin, TX Duration: 12 Months Skills: Windows 10 Basics Description: Install hardware and peripheral components like disk drives, printers, keyboards and monitors. Load software packages such as operating systems and office applications... 
    Suggested
    Work at office

    Keylent Inc

    Austin, TX
    18 hours ago
  •  ...understands their unique business. About The Role: We are looking for engineers to support webAI's Public Sector initiatives focused on secure,...  ...resilient AI infrastructure. Ultimately we're looking for a Systems Engineer that knows Rust, or a Systems-ish Engineer that knows... 
    Suggested
    Full time
    Live out
    Work at office

    Navstar

    Austin, TX
    18 hours ago
  •  ...Distributed Systems Engineer Distributed systems engineer position available. Focus on building scalable and reliable distributed systems. Responsibilities include designing, implementing, and maintaining distributed systems architecture. Strong experience in distributed... 
    Suggested

    Adapt Talent

    Austin, TX
    18 hours ago
  •  ...Kwil Distributed Systems Engineer Kwil is a decentralized SQL database, built to enable advanced dApps and scalable composability. Backed by FTX Ventures, Blockchange, AlleyCorp, Amplify Partners, DCG, and other leading VCs, the KwilDB decentralized SQL solution... 
    Suggested
    Immediate start
    Flexible hours

    Kwil

    Austin, TX
    2 days ago
  •  ...BOXX Technologies is Hiring. BOXX Technologies Engineering department is looking for a Systems II - III located in Austin, TX. BOXX offers Benefits (Medical, Dental and Vision plans), 401-K Matching, and Paid Time Off. Position Summary: System... 
    Work experience placement
    Work at office
    Local area
    Worldwide

    BOXX Technologies

    Austin, TX
    2 days ago
  •  ...Company Overview: Allen Control Systems (ACS) is a cutting-edge defense startup founded by two former Navy electrical engineers with a proven track record in robotics and software. We are developing an autonomous gun turret using advanced computer vision and control... 
    Local area

    Allen Control Systems

    Austin, TX
    2 days ago
  •  ...Systems Engineer The Systems Engineer is responsible for managing and maintaining Teza's systems and storage infrastructure. The System engineer will work closely with the development and trading operations teams to ensure the smooth functioning of all systems and... 
    Flexible hours

    Teza Technologies

    Austin, TX
    4 days ago
  • $15 per week

     ...Overhead Cantenary Systems Engineer (OCS) Modern Railway Systems is seeking a specialized OCS professional responsible for the engineering, design development, construction support, estimating and commissioning of OCS and Traction Power Feeder Systems associated with... 
    For contractors
    For subcontractor
    Work at office
    Flexible hours

    Stacy Witbeck

    Austin, TX
    3 days ago
  • $97.01k - $164.91k

     ...Job Description About BAE Systems Our employees work on the world’s most advanced electronics. Spanning air, land, sea, and...  ...and national security services. When you join our engineering group, you will be an integrated team member collaborating with... 
    Full time
    Contract work
    Local area
    Flexible hours

    BAE Systems USA

    Austin, TX
    4 days ago
  •  ...graphics processors, motherboard chip sets, and a variety of components used in consumer electronics goods. Job Title: Systems Engineer 3 Duration: 12 Months Location: Austin, TX Job Type: Temporary assignment Work Type: Hybrid onsite (3 days a week... 
    Temporary work
    3 days per week

    Tekwissen

    Austin, TX
    2 days ago
  •  ...Systems Engineer The Systems Engineer works on on-premise server and cloud related projects, both for new and existing infrastructures. Working closely with the managing consultant, project plans are designed and implemented with a high degree of quality and with... 
    Local area
    Remote work

    Sigma Information Group Inc

    Austin, TX
    3 days ago
  •  ...the world moves earth for construction. Founded by former SpaceX engineers and backed by Bain Capital Ventures, TerraFirma is automating...  ...Overview In this role, you'll take hands-on ownership of various systems spanning hardware and software, and analog and digital domains.... 
    Worldwide
    Relocation
    Weekend work

    Terra Firma

    Austin, TX
    1 day ago
  • $120k - $165k

     ...We Are Applied Materials is a global leader in materials engineering solutions used to produce virtually every new chip and advanced...  ...you may go. Learn more about our benefits. As a Systems Engineer, you'll design, integrate, and optimize complex systems... 
    Full time
    Remote work
    Relocation

    Applied Materials

    Austin, TX
    3 days ago
  •  ...Job Description: JOB DESCRIPTION: The system engineer is the owner of engineering requirement translation, specification and implementation of projects across entire system design for products. Responsible for implementation and design of custom software/... 
    Work at office

    HRM INFO LLC

    Austin, TX
    2 days ago
  •  ...Role: System Engineer Preferred Location: Onsite (Austin) Key responsibilities: 1. Dashboard Development and Maintenance: - Design and implement monitoring dashboards for SAP HANA and SAP NetWeaver using Splunk and Grafana. - Create custom visualizations to... 

    Info Way Solutions

    Austin, TX
    4 days ago
  • $54.4k - $57.99k

     ...traditional call center responsibilities, requiring strong analytical skills, attention to detail, and the ability to work across multiple systems and processes. Maintains end-to-end responsibility for customer’s support needs providing timely, reliable, and courteous... 
    Contract work
    Work at office

    ASM Research, An Accenture Federal Services Company

    Austin, TX
    4 days ago
  •  ...Systems Design Engineer The Role Data Center Platform Engineering Group (DPEG) is looking for an on-site (mandatory) 3rd shift Systems Design Engineer that can work complex issues/problems as they arise in the lab or Data Center. Person will be responsible for going... 
    Work at office
    Night shift

    LanceSoft

    Austin, TX
    1 hour ago
  •  ...What to Expect Tesla is seeking a highly motivated Engineer to develop functional test equipment (such as dynamometers, electrical testers...  ...vendors to develop and deploy new Drive Unit and Actuator test systems. You will assume full ownership of the design, development, and... 
    Hourly pay
    Full time
    Temporary work
    Flexible hours

    Tesla

    Austin, TX
    18 hours ago
  •  ...Applifecycle Systems Engineer - UCCE and UCM Location: RTP, NC / Austin, TX / San Jose, CA Duration: Fulltime Job Description: Skills Desired: Designing, Managing Cisco Unified Contact Center Enterprise technologies (UCCE), Cisco Customer Voice Portal... 
    Full time
    Work experience placement

    JConnect Infotech

    Austin, TX
    18 hours ago
  • $80.31 - $85.31 per hour

     ...us for significant technical transformation in Broker-Dealer Systems and the modernization of our core technology infrastructure. As...  ...experience with, GenAI coding assistants used across day-to-day engineering workflows in the SDLC-implementation, refactoring, unit... 
    Hourly pay
    Contract work
    Temporary work
    Work experience placement

    Randstad

    Austin, TX
    2 days ago
  •  ...Systems Engineer, MAPS Hybrid At Cloudflare, we are on a mission to help build a better Internet. Today the company runs one of the world's largest networks that powers millions of websites and other Internet properties for customers ranging from individual bloggers... 
    Local area

    Cloudflare Inc

    Austin, TX
    4 days ago
  •  ...Systems Engineer Atlas Technica's mission is to shoulder IT management, user support, and cybersecurity for our clients, who are hedge funds and other investment firms. Founded in 2016, we have grown year over year through our uncompromising focus on service. We... 
    Work at office

    Atlas Technica

    Austin, TX
    18 hours ago
  •  ...Position Description & Qualifications Are you a Systems Engineer looking for a place where you can make an impact every day? Serco is the place for you! Join our Defense team supporting our CNIC program in this exciting role based out of Naples, Italy. CNIC... 
    Full time
    Contract work
    Part time
    For contractors
    Local area
    Flexible hours

    Serco

    Austin, TX
    3 days ago
  •  ...Overview Our Software Engineers build proprietary trading systems which directly impact the financial markets. Our software engineering teams leverage technology to solve a variety of difficult problems. Our trading strategies must respond to market events in microseconds... 
    Work at office

    Optiver

    Austin, TX
    18 hours ago
  • $136k - $184k

     ...that will provide low-latency, high-speed broadband connectivity to unserved and underserved communities around the world. As a Systems Engineer, this role is primarily responsible for the design, development and integration of communication payload and customer terminal... 
    Permanent employment
    Local area
    Flexible hours

    Amazon

    Austin, TX
    2 days ago
  •  ...Systems Engineer Sage Integration Holdings, LLC protects the people, facilities, and reputation of enterprise clients by advancing the intelligence and integration of security technology. Innovation at SAGE is foundational—not a department or an afterthought. Our culture... 
    Night shift

    SAGE Fence

    Austin, TX
    4 days ago
  •  ...challenge the status quo" and transform the finance industry together. Join us for significant technical transformation in Broker-Dealer Systems and the modernization of our core technology infrastructure. This role will be a leader in AI focused workflows across the system.... 

    Randstad

    Austin, TX
    4 days ago
  • $68k - $105k

     ...companies. With its comprehensive ITS (Intelligent Transportation Systems) portfolio, Kapsch is actively addressing the challenges of the...  .... We are looking for a motivated Junior Systems Engineer in Austin, TX to support the design, integration, testing, and... 
    Casual work
    Internship
    Work at office
    Local area
    Flexible hours

    Kapsch TrafficCom AG

    Austin, TX
    1 day ago
  •  ...Commvault Systems Engineer (Data Protection / Backup) Employment Type: Full-Time, Experienced CGS is seeking an experienced Commvault Data Protection Engineer with extensive knowledge and experience in designing, developing, configuring, implementing, testing, troubleshooting... 
    Full time
    Flexible hours

    Contact Government Services LLC

    Austin, TX
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to System Engineer. Be the first to apply!