Systems & Infrastructure Specialist - Yaphank
micro1
Job Title: Systems & Infrastructure Specialist
Any additional information you require for this job can be found in the below text Make sure to read thoroughly, then apply.Job Type: Contractor
Location: Remote
Job Summary:
Join our customer's team as a Systems & Infrastructure Specialist for a high-intensity, expert-level project focused on training and optimizing AI models within intricate, containerized environments. In this terminal-intensive role, you'll apply a systems-first mindset to solve complex infrastructure challenges in real time. This one-time project offers significant opportunities for extension or transition into future phases for those who demonstrate elite technical execution.
Key Responsibilities:
• Navigate, troubleshoot, and recover dynamic infrastructure and long-running processes in real-time using command-line tools.
• Master and manage highly containerized environments, including orchestrating Dockerized sandboxes and CI/CD workflows.
• Build, maintain, and optimize systems for AI model training and high-throughput compute environments.
• Respond swiftly to system errors, executing dynamic mid-operation replanning and recovery.
• Collaborate with engineering and AI teams to ensure seamless integration, reliability, and performance.
• Document system architectures, incident responses, and recovery protocols with meticulous clarity.
• Contribute expertise to evolving project needs, adapting to new technologies and scaling strategies as required.
Required Skills and Qualifications:
• Demonstrated expert proficiency working in terminal environments for system builds, server administration, and infrastructure management.
• Advanced problem-solving skills for multi-step troubleshooting, filesystem navigation, and process management within containerized settings.
• Hands-on experience with Python, Bash, JavaScript/TypeScript, Go, Rust, and/or C/C++.
• Deep familiarity with build systems, package managers, databases, web servers, ML frameworks, version control, and cryptography tools.
• Proven ability to execute dynamic infrastructure recovery and optimize long-running processes under pressure.
• Strong written and verbal communication skills, with a passion for precise technical documentation.
• Systems multilingualism: versatility across operating systems, languages, and emerging DevOps tools.
Preferred Qualifications:
• Prior experience in high-compute environments for AI/ML workloads.
• Background in Site Reliability Engineering or DevOps roles focused on mission-critical infrastructure. xywuqvp
• Familiarity with advanced container orchestration and distributed system design.
- ...- Integration Development: Integrating applications and systems using DataPower capabilities like XSLT, XPATH, and various messaging... ...API Connect administration and Support. b. Environment Infrastructure Build and Automation c. Strong Scripting and Automation...Suggested
- ...Qualifications: Hands-on experience managing Kubernetes clusters and Linux systems administration Proficient in Python, Shell scripting, Perl, Ruby, or similar languages for automation and infrastructure management Skilled in automating deployments with Terraform,...Suggested
$96.8k - $145.2k
...troubleshoot server and non-desktop computer hardware, software, systems and other resources in a data center or other centralized... ...utilization metrics. Creates to develop information technology and infrastructure projects which support on/off premise and Infrastructure-as-a...SuggestedSummer workLocal areaFlexible hours- ...Architect based in Ridge, NY. The successful candidate will design and implement secure cloud solutions primarily within AWS, managing infrastructure through Infrastructure as Code tools like Terraform. Candidates should have over 2 years of experience in cloud environments,...Suggested
$135k
Why Join Our Team?At New York Cancer & Blood Specialists (NYCBS), we are dedicated to making a difference in the lives of our patients... ...VPC, Route53, IAM, and related technologiesBuild and manage infrastructure using Infrastructure as Code (Terraform and AWS...SuggestedFull timeTemporary workRemote work- ...performance. The position involves maintaining and monitoring systems to meet uptime, performance, resource, and security needs,... ...and services in accordance with IT standards. Develop cloud infrastructure to host cloud applications, platforms, and data; determine infrastructure...Work at office
- ...Docker, Kubernetes). High Availability (HA) / Fault Tolerant (FT) design and implementation. Cloud environment in Sharded MongoDB. Infrastructure as Code (Terraform, ARM, CloudFormation). Identity and access management (IAM). Monitoring, logging, and observability. Solid...Work experience placement
- 0072 Port of Houston Authority is looking for a Cloud Services Engineer to design, build, and manage cloud solutions. The ideal candidate will have a Bachelor's degree in Computer Science, experience in cloud-based solutions, and strong technical background. The role requires...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Systems & Infrastructure Specialist - Yaphank. Be the first to apply!

