Infrastructure Software Engineer, Fleet & Automation
$150k - $215kNscale
Infrastructure Software Engineer, Fleet & Automation US . About Nscale Nscale is the GPU cloud engineered for AI. We provide cost-effective, high-performance infrastructure for AI start-ups and large enterprise customers. Nscale enables AI-focused companies to achieve superior results by reducing the complexity of AI development. Our GPU cloud bolsters technical capabilities and directly supports strategic business outcomes, including cost management, rapid innovation, and environmental responsibility. We thrive on a culture of relentless innovation, ownership, and accountability, where every team member takes pride in their work and drives it with excellence and urgency. As an Nscaler, you’ll build trust through openness and transparency, where everyone is inspired to do their best work. If you join our team, you’ll be contributing to building the technology that powers the future. Overview As an Infrastructure Software Engineer for Fleet & Automation, you will be a critical member of the AI Infrastructure Operations team, responsible for ensuring the acceptance, performance, and scalability of our cutting-edge AI and High-Performance Computing (HPC) environments. Leveraging software engineering principles, you will focus on building and maintaining the control plane, tooling, and automation that supports Fleet Operations, Network Operations, and Observability functions. Your work will directly translate into higher system availability and reduced operational costs. Key Responsibilities Perform technical architecture, roadmap and implementation for workflow automation systems, driving architecture decisions that balance automation complexity, reliability, and maintainability. Identify and resolve performance and scalability issues. Establish technology and product direction in collaboration with other tech leads, managers, and senior leadership. Own end-to-end delivery of device provisioning, validation, testing, and remediation workflows at scale. Design and build workflow orchestration systems for hardware lifecycle management, including GPU nodes and network switches. Partner with Infrastructure, Platform, and SRE teams to translate operational needs into robust, scalable automation. Establish engineering standards for reliability, observability, and operational excellence across all services. Help set up engineering best practices in collaboration with the broader engineering team. Build production-grade Python systems for hardware lifecycle automation, leveraging AI tools to accelerate delivery. Assess impact to team software stack from new hardware product programs and explore AI driven process improvement and automation. Collaborate with cross-functional teams (product, design, operations, infrastructure) to build efficient, interoperable, and maintainable automated systems. Required Qualifications Education: Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Experience: 5+ years relevant experience building large-scale infrastructure applications or similar experience. Programming: Experience in utilizing languages such as C, C++, Java, and scripting languages such as Python for API design and unit testing techniques. Systems Expertise: Deep understanding of Linux operating systems, networking fundamentals (TCP/IP, BGP), and familiarity with configuration management tools (e.g., Ansible, Terraform). Distributed Systems: Experience building, running and debugging large-scale infrastructure, stateful and stateless services for distributed systems or networks, and experience with compute technologies, storage, or hardware architecture. Experience integrating with infrastructure tooling such as: DCIMs, NetBox, OpenStack, bare metal APIs (MAAS, Ironic, IPMI). Preferred Qualifications Master’s degree or PhD in Engineering, Computer Science, or a related technical field. Experience designing, analyzing and improving efficiency, scalability, and performance of various system resources. Direct experience with AI/HPC infrastructure , including NVIDIA GPUs, InfiniBand or high-speed Ethernet fabrics, and related management software (e.g., NCCL, SLURM). Experience with advanced observability and monitoring systems (Prometheus, Grafana, OpenTelemetry) for complex, high-cardinality telemetry data. Familiarity with cloud-native technologies (Kubernetes, Docker) and infrastructure-as-code principles. Demonstrated ability to integrate AI tools to optimize/redesign workflows and drive measurable impact (e.g., efficiency gains, quality improvements). Familiarity with SLOs/metrics measurement, logs/telemetry/metrics integration with tools for enhanced operator experience. What We Can Offer You At Nscale, you'll find a collaborative, supportive, and innovative environment where your contributions spark real impact. We're building something extraordinary, and we want you at the core. Highly competitive package (base + equity) with reviews every 12 months. Join the fastest-growing tech startup, your chance to push boundaries, collaborate with brilliant minds, and make your mark on cutting-edge AI. Expect a dynamic progression plan tailored to your ambitions. Grow by trying new things, leading, challenging the status quo, and owning your impact, always with our full support. We strongly encourage applications from people of colour, the LGBTQ+ community, people with disabilities, neurodivergent people, parents, carers, and people from lower socio-economic backgrounds. If there’s anything we can do to accommodate your specific situation, please let us know. The responsibilities outlined in this job description are not exhaustive and are intended to provide a general overview of the position. The employee may be required to perform additional duties, tasks, and responsibilities as assigned by management, consistent with the skills and qualifications required for the role. The range below reflects the base salary for the position. Actual compensation may vary based on job-related factors such as skill set, experience, education, and location. In addition to base salary, this role may be eligible for bonus, equity, and/or commission programs. Nscale may offer a competitive benefits package including medical, dental, vision, flexible paid time off, parental leave, and retirement plan participation. Salary Range
$150,000 - $215,000 USD
For information on how Nscale handles candidate personal data, please see our Employee & Candidate Privacy Notice:Here. #J-18808-Ljbffr Nscale- A leading technology solutions provider is seeking an IT Systems Engineer to support various teams across the organization. This role involves developing workflow automation, designing internal tooling, and configuring company hardware. The ideal candidate should have...Suggested
- ...Stewart Title Guaranty Company in Houston is seeking a Systems Engineer to design and maintain IT systems supporting business... ...security, and enhancing performance through collaboration and automation. The ideal candidate will have extensive experience in system...Suggested
- Nscale is hiring for a support role focused on cloud infrastructure in Houston, Texas. You will ensure the reliability and efficiency of data center operations, collaborating with engineering teams and handling various technical support tasks. The position requires 2-4...Suggested
- Invesco Real Estate in Houston is looking for a Senior Cloud Engineer to enhance their AWS cloud platform. This role involves... ...product strategies, collaborating with stakeholders, and automating infrastructure using Terraform. A minimum of 5 years in AWS and strong problem...SuggestedFlexible hours
- Chevron Corporation seeks a software engineer to support the design and development of its cloud-native FinOps platform based in Houston,... ...backend services, and front-end applications, with a focus on automation and scalability. Ideal candidates will have programming...Suggested
- Hines in Houston, TX is seeking an Azure Infrastructure & Virtual Desktop Engineer to design and optimize their Azure environments. The ideal candidate should have seven or more years of experience in cloud engineering, focusing on Azure infrastructure and Virtual Desktop...
- ...Houston, TX, to design, implement, and maintain IT systems ensuring high performance and security. This role involves leading infrastructure projects and optimizing cloud solutions while collaborating across teams. Ideal candidates should have a Bachelor's degree or relevant...
- ...Platform Owner, you will lead the platform engineering "product," directly influencing... ...service bootstrapping), and frameworks for automating software testing (test automation frameworks).... .... With over 71,000 colleagues and a fleet of over 13,000 vehicles, Sysco operates...FleetWork experience placementLocal areaWorldwide3 days per week
- A leading technology solutions provider in Houston is seeking professionals for various roles, including Account Executives and Customer Success Managers. This company fosters a collaborative and innovative environment, encouraging employees to make an impact while providing...
- Gulf States Financial Services, Inc. seeks a DevSecOps Engineer in Houston, Texas, to enhance secure delivery and manage IAM across cloud environments. The role requires strong experience in DevOps, IAM design, Terraform, and Kubernetes operations. You'll implement security...
- Linux Engineer / DevOps Engineer (Python Automation / On-Prem Linux is a MUST) Location: Houston, Texas OR Boston, Massachusetts - on-site 4 days a week... ...companies, high‑technology companies, platform/infrastructure engineering teams, or compute‑intensive environments...Full timeRemote work
- Sysco Northeast Rdc is seeking a Cloud Engineer to design and manage their cloud infrastructure on Google Cloud Platform. This hybrid role requires 3-4 days of on-site presence per week and involves collaboration with cross-functional teams to assess and implement cloud...
$66k - $171.6k
...Roles and Responsibilities The Principal Automation Engineer - Data and Integration Platforms is a... ..., data centers, and associated site infrastructure. Collaborate with Lilly’s Quality... ...administration, including hardware design, software integration, and system procedures....Full timeLocal areaFlexible hours$66k - $171.6k
...operations. Responsibilities The Principal Automation Engineer - Data and Integration Platforms is... ..., data centers, and associated site infrastructure. Collaborate with Lilly’s Quality... ...including hardware design, software integration, and system procedures....Full timeH1bLocal areaVisa sponsorshipWork visaFlexible hours- Energy Transfer Partners, L.P. is seeking a skilled F5 Network Engineer to ensure the security and performance of enterprise-level applications and network infrastructure. The successful candidate will have hands-on experience with F5 platforms like BIG-IP, and will design...
$70k - $140k
The Huntington National Bank is seeking a Network Engineer 3 responsible for designing tailored solutions to meet client requests and providing high-level engineering support for the corporate network. Ideal candidates will have over 7 years of experience in Network Engineering...- ## Cloud Software EngineerApplylocations: Houston, TXtime type: Full timeposted on: Posted 4 Days Agojob requisition id: JR1509We’re looking for a **Cloud Engineer** to join our growing cloud infrastructure team in Houston. If you’ve got a solid technical foundation, a...For contractors3 days per week
- Nscale, located in Houston, Texas, is hiring an Infrastructure Software Engineer to join their AI Infrastructure Operations team. The role focuses... ...including the design and implementation of automation systems. The ideal candidate will have a bachelor’s degree...Fleet
- ...Overview: The Junior SCADA Engineer supports the implementation,... ...energy and microgrid fleet. Key Responsibilities:... ...secure remote communications. Infrastructure: Experience with Windows Server... ...diagnose hardware, software, and communication link issues...FleetRemote workFlexible hours
$90k
...Network Engineer – ExterNetworks, Houston, Texas Description: Rack & Stack Cisco/Adtran... ...and monitors all installed systems and infrastructure for ... Full-time Netflix IT Help Desk... ...sustainability for commercial vehicle fleets, by developing innovative hardware and...FleetFull timeContract workRemote work- ...Cloud Engineer We’re looking for a Cloud Engineer to join our growing cloud infrastructure team in Houston. This role offers the chance to build a career in a fast‑paced environment and to contribute to cloud projects. Responsibilities Support the design, build, and maintenance...For contractors3 days per week
- ...are seeking a Cloud Back-End Engineer to join our group. In this... ...application development and cloud infrastructure. You will be responsible for... ...or Engineer with focus on software with 4+ years of experience... ...DevOps Build & Release Automation Maven, Gradle, MSBuild, Shell...
- Senior Software Engineer, Salesforce (Global Payment Network) Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast‑paced, collaborative, inclusive, and iterative delivery environment? At Capital One, you’ll...InternshipH1bLocal area
- Greenberg, Traurig, PA seeks a highly skilled Automation Engineer to join their Innovation Team, focusing on designing and maintaining automation solutions across various U.S. locations including Houston. The ideal candidate must possess strong Python development skills...Full time
$23.65 - $37.5 per hour
...Chevron Corporation is seeking a Software Engineer in Houston, TX. The role involves hands-on coding to create functional software solutions, employing CI/CD tooling for deployment, and collaborating with global teams. The ideal candidate is enrolled in a relevant degree...Hourly payInternship- ...Senior Kubernetes / VKS Platform Engineer Overview We are looking... ...and support AI/ML workload infrastructure. Responsibilities... ...Kubernetes platform with Aria Automation self-service catalog Enforce... ...across distributed cluster fleets Required Skills ~10+ years...FleetRemote work
- A leading technology and solutions provider in Houston seeks a Software Engineer to design and implement automated solutions. The role involves gathering requirements, programming with Python and .NET, and using AWS services to manage system integrations. Candidates must...
- Invesco Real Estate is looking for an Advanced Software Engineer in Houston, Texas. In this role, you will design, develop, and support enterprise applications and APIs in a modern, cloud-based environment. The ideal candidate has a minimum of 3 years of experience in application...
- Halliburton Energy Services is looking for experienced software engineers to join Landmark in Houston, Texas. You will be responsible for the full development cycle, working on software critical for high-stakes decisions in the energy industry. Strong proficiency in programming...
- ...SUMMARY**Delivers high quality software and technical solutions to... ...in the development of automated tests and environment management... ...configuration and containerization, infrastructure as code, and monitoring*... ...71,000 colleagues and a fleet of over 13,000 vehicles, Sysco...FleetWork experience placementLocal areaWorldwideFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Infrastructure Software Engineer, Fleet & Automation. Be the first to apply!
- entry level infrastructure engineer Houston, TX
- security infrastructure engineer Houston, TX
- infrastructure engineer Houston, TX
- lead infrastructure engineer Houston, TX
- data infrastructure engineer Houston, TX
- infrastructure engineering manager Houston, TX
- senior infrastructure engineer Houston, TX
- infrastructure automation engineer Houston, TX
- remote infrastructure engineer Houston, TX
- infrastructure developer Houston, TX

