Systems & Infrastructure Specialist - Carlisle-Rockledge
micro1
Job Title: Systems & Infrastructure Specialist
Any additional information you require for this job can be found in the below text Make sure to read thoroughly, then apply.Job Type: Contractor
Location: Remote
Job Summary:
Join our customer's team as a Systems & Infrastructure Specialist for a high-intensity, expert-level project focused on training and optimizing AI models within intricate, containerized environments. In this terminal-intensive role, you'll apply a systems-first mindset to solve complex infrastructure challenges in real time. This one-time project offers significant opportunities for extension or transition into future phases for those who demonstrate elite technical execution.
Key Responsibilities:
• Navigate, troubleshoot, and recover dynamic infrastructure and long-running processes in real-time using command-line tools.
• Master and manage highly containerized environments, including orchestrating Dockerized sandboxes and CI/CD workflows.
• Build, maintain, and optimize systems for AI model training and high-throughput compute environments.
• Respond swiftly to system errors, executing dynamic mid-operation replanning and recovery.
• Collaborate with engineering and AI teams to ensure seamless integration, reliability, and performance.
• Document system architectures, incident responses, and recovery protocols with meticulous clarity.
• Contribute expertise to evolving project needs, adapting to new technologies and scaling strategies as required.
Required Skills and Qualifications:
• Demonstrated expert proficiency working in terminal environments for system builds, server administration, and infrastructure management.
• Advanced problem-solving skills for multi-step troubleshooting, filesystem navigation, and process management within containerized settings.
• Hands-on experience with Python, Bash, JavaScript/TypeScript, Go, Rust, and/or C/C++.
• Deep familiarity with build systems, package managers, databases, web servers, ML frameworks, version control, and cryptography tools.
• Proven ability to execute dynamic infrastructure recovery and optimize long-running processes under pressure.
• Strong written and verbal communication skills, with a passion for precise technical documentation.
• Systems multilingualism: versatility across operating systems, languages, and emerging DevOps tools.
Preferred Qualifications:
• Prior experience in high-compute environments for AI/ML workloads.
• Background in Site Reliability Engineering or DevOps roles focused on mission-critical infrastructure. xywuqvp
• Familiarity with advanced container orchestration and distributed system design.
$133.56k - $200.34k
...love to have you join us. Description Reusable launch systems are the key to seamlessly connecting Earth and space.... ...Earth and space. A highly operable, reliable, and robust network infrastructure is an integral part of a fully reusable launch system. As...SuggestedPermanent employmentFull timeRemote work$108.4k - $203.4k
...moves missions and the government forward! Accenture is seeking a Cloud Systems Engineer specializing in AWS GovCloud who will be responsible for designing and implementing AWS infrastructure architecture to meet mission requirements. This person will join our team and...SuggestedLive inWork at officeLocal area$108.4k - $203.4k
...Cloud Systems Engineer (AWS GovCloud) At Accenture Federal Services, nothing matters more than helping the US federal government... ...who will be responsible for designing and implementing AWS infrastructure architecture to meet mission requirements. This person will join...SuggestedLocal area$122k - $167k
...major aspect of Terran R, from stages and payloads to ground systems, launch, landing, and refurbishment. The Cape is the only place... ...is where it happens. About the Role: As a Senior Infrastructure Engineer, you connect various internal technical teams to define...SuggestedFor contractorsFor subcontractorLocal areaRemote workShift work- ...operating, maintaining, and sustaining an AWS cloud operational system. Essential Functions: Monitor and implement updates from AWS... ...Experience with configuration management (Ansible, Chef, Puppet) and Infrastructure as Code tools (Terraform, CDK, OpenTofu, Cloud Formation)....SuggestedLocal area
- ...cloud environments with focus on containerized workloads, databases, and object storage. Develop scalable, highly-available enterprise systems, ideally for Cloud. Ability to obtain and maintain an FAA public trust clearance. Qualifications: Bachelor's Degree in Computer...Local area
- ...the tools necessary to support our large and growing network infrastructure. This employee will be a member of the IT Network Engineering... ...experience in network automation. Experience with Linux operating system Automation skills in Python, Go, shell, bash, and/or other...Permanent employmentWeekend work
- ...resilient telecommunications networks and information management systems. The team develops critical communication capabilities for... ...based solutions involving Cloud computing, Containers, VPC, Infrastructure as Code (IaC), and Security Groups. 3 years designing a scalable...Local areaImmediate start
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Systems & Infrastructure Specialist - Carlisle-Rockledge. Be the first to apply!

