Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Systems & Infrastructure Specialist - Truckee

micro1

Job Title: Systems & Infrastructure Specialist

Any additional information you require for this job can be found in the below text Make sure to read thoroughly, then apply.

Job Type: Contractor

Location: Remote

Job Summary:

Join our customer's team as a Systems & Infrastructure Specialist for a high-intensity, expert-level project focused on training and optimizing AI models within intricate, containerized environments. In this terminal-intensive role, you'll apply a systems-first mindset to solve complex infrastructure challenges in real time. This one-time project offers significant opportunities for extension or transition into future phases for those who demonstrate elite technical execution.

Key Responsibilities:

• Navigate, troubleshoot, and recover dynamic infrastructure and long-running processes in real-time using command-line tools.

• Master and manage highly containerized environments, including orchestrating Dockerized sandboxes and CI/CD workflows.

• Build, maintain, and optimize systems for AI model training and high-throughput compute environments.

• Respond swiftly to system errors, executing dynamic mid-operation replanning and recovery.

• Collaborate with engineering and AI teams to ensure seamless integration, reliability, and performance.

• Document system architectures, incident responses, and recovery protocols with meticulous clarity.

• Contribute expertise to evolving project needs, adapting to new technologies and scaling strategies as required.

Required Skills and Qualifications:

• Demonstrated expert proficiency working in terminal environments for system builds, server administration, and infrastructure management.

• Advanced problem-solving skills for multi-step troubleshooting, filesystem navigation, and process management within containerized settings.

• Hands-on experience with Python, Bash, JavaScript/TypeScript, Go, Rust, and/or C/C++.

• Deep familiarity with build systems, package managers, databases, web servers, ML frameworks, version control, and cryptography tools.

• Proven ability to execute dynamic infrastructure recovery and optimize long-running processes under pressure.

• Strong written and verbal communication skills, with a passion for precise technical documentation.

• Systems multilingualism: versatility across operating systems, languages, and emerging DevOps tools.

Preferred Qualifications:

• Prior experience in high-compute environments for AI/ML workloads.

• Background in Site Reliability Engineering or DevOps roles focused on mission-critical infrastructure. xywuqvp

• Familiarity with advanced container orchestration and distributed system design.

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Systems & Infrastructure Specialist - Truckee in Truckee, CA vacancy
  • Tahoe Forest Health System is seeking a skilled network engineer responsible for developing and deploying network systems to ensure performance...  ...and Palo Alto Networks technologies. The position is based in Truckee, California, offering opportunities for professional growth in... 
    Suggested

    Tahoe Forest Health System

    Truckee, CA
    4 days ago
  • Truckee is seeking a Design Engineer responsible for creating infrastructure construction documents for public and private projects. You will use AutoCAD and Civil 3D software...  ...to design grading, drainage, and utility systems while ensuring compliance with project... 
    Suggested

    Truckee

    Truckee, CA
    4 days ago
  • Truckee is looking for a Civil/Project Engineer to join their team in Tahoe City, California. The role involves managing public works infrastructure projects, including transportation systems and water management. Candidates must have a Bachelor's degree in Civil Engineering... 
    Suggested
    Flexible hours

    Truckee

    Tahoe City, CA
    2 days ago
  •  ...A technology innovation platform located in Truckee, California, is seeking a DevOps Engineer Intern. This unpaid internship offers...  ...gain hands-on experience in building and maintaining DevOps infrastructure for an AI-powered platform. Responsibilities include designing... 
    Suggested
    Internship
    Immediate start
    Remote work

    Bridge IT

    Truckee, CA
    8 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Systems & Infrastructure Specialist - Truckee. Be the first to apply!