Senior Staff Network Operations Engineer - Hyperscale AI Reliability
Crusoe
Crusoe is seeking a Senior Staff Network Operations Engineer to ensure the reliability of their global network infrastructure. This role focuses on operational excellence, driving incident responses, and mentoring staff engineers. The ideal candidate should have over 12 years of experience in production network engineering, extensive knowledge of network protocols, and a strong ability in operational automation. Join Crusoe to enhance AI strategies and be part of a high-performing team. #J-18808-Ljbffr Crusoe
$150k - $250k
...things they had to do. Powerful AI will be the biggest lever... ...build data centers, and operate them - with teams spanning... ...Role Fluidstack is seeking a Network Engineer, Reliability & Observability to serve as... ...have experience building hyperscale platforms, demonstrating a...SeniorLocal area$245k - $295k
...vertically integrated AI infrastructure... ...ground up, we own and operate each layer of the... ...Crusoe is seeking a Senior Staff Network Automation Engineer to own how our network... ...and Infrastructure Reliability. You will partner... ...as code in hyperscale or internet-scale environments...SeniorTemporary work- Fluidstack is seeking a Network Engineer, Reliability & Observability to ensure the reliability of AI networks through robust data collection and metrics reporting. This... ...experience, with a strong background in operational troubleshooting and software development. A competitive...Senior
- Epoch Biodesign is looking for a Senior Staff Network Operations Engineer to ensure production reliability across its global network in San Francisco. This role drives incident... ...operational standards for Crusoe's extensive AI infrastructure, requiring strong technical...Senior
$261k - $326k
...technology company specializing in AI infrastructure is seeking a Principal Engineer to enhance reliability and scalability of cloud systems.... ...engineers, and ensuring operational excellence. Candidates should have strong networking expertise and systems fundamentals...Senior$225k - $275k
...vertically integrated AI infrastructure company... ...ground up, we own and operate each layer of the stack... ...Cloud is seeking a Senior Staff Network Operations Engineer to own production reliability across our global network... ...that keep our hyperscale AI infrastructure healthy...SeniorTemporary work$225k - $275k
...vertically integrated AI infrastructure... ...ground up, we own and operate each layer of the... ...Cloud is seeking a Senior Staff Network Deployment Engineer to serve as the technical... ...Ops, and Site Reliability leadership. You're... ...built and validated at hyperscale pace. Architect...SeniorTemporary workRemote work- ...Capital is looking for a Production Support Engineer in San Francisco. You'll play a key role in ensuring the reliability of the Agentforce Supply Chain platform and work... ...candidate has over 5 years of experience in operations and scaling, with a focus on cloud platforms...Senior
- ...Alembic is the pioneering Causal AI platform. We help the world'... ...under real-world scale, reliability, and security demands — and we're looking for an engineer who wants to own the... ...on" role. You'll design and operate the global network and reliability layer behind...SeniorFull time
- A cutting-edge AI startup in San Francisco is seeking a Senior Infrastructure Engineer to build platforms for AI agents. Your role will involve creating systems that other engineers rely on, ensuring reliability and fast deployment. You'll work with technologies like Python...Senior
- Algora Public Benefit Corporation is looking for an AI Cloud Infra Engineer to join their team in San Francisco. You will ensure the reliability of backend systems and work closely with engineers to plan for future growth. The ideal candidate has strong cloud infrastructure...Senior
$151.3k - $271.15k
## Senior Manager - Platform Engineering and OperationsApplylocations: San Francisco... ..., Intelligent Operations & Platform... ...infrastructure, data centers, networking, developer... ...accelerating automation, AI-assisted operations... ..., improve reliability and scalability, enhance...Senior- ...technology firm based in San Francisco is seeking a DevOps Engineer to enhance the reliability of their production systems. You will collaborate with... ...Join us in our mission to revolutionize hardware design through innovative AI solutions. #J-18808-Ljbffr Flux EnterpriseSenior
- A tech company focused on AI is seeking a Site Reliability Engineer to ensure the reliability and performance of its GPU marketplace. This role involves maintaining service level objectives, managing capacity, and implementing secure systems. The ideal candidate has strong...Senior
- 53 Stations is seeking a DevOps Engineer to enhance the systems powering Flux's platform. You’ll tackle operations from billing to onboarding while ensuring high system reliability and performance. With a focus on collaboration and ownership, you will develop internal...Senior
- A leading AI research company in San Francisco is seeking a software engineer for its Fleet High Performance Computing team. In this role, you'll ensure the reliability and uptime of the compute fleet, working with automation systems and monitoring tools. Ideal candidates...Senior
- OpenArt AI in San Francisco is seeking a Senior Platform & Reliability Engineer to design and improve the reliability of its infrastructure. The role emphasizes building and operating production systems while collaborating with product engineers to ensure platform scalability...Senior
$200k - $250k
...combines modern web tooling with AI-powered workflows. Our stack... .... We’re hiring a senior owner of stability and infrastructure... ...infrastructure to ensure the platform is reliable, fast, and resilient as we... ...execution). Reliability operating system: Own observability quality...SeniorPermanent employment- ...identity security, delivering an AI-powered platform that... ...cloud-native systems. As a Staff Platform Engineer, you will play a critical role... ...role. You will own reliability for major platform domains,... ...and leverage to deploy and operate their applications Architect...Senior
$190k - $250k
...Senior Network Engineer San Francisco About the Role As a Senior Network... ...networking principles, operational discipline, and advanced automation... ...for availability, reliability, and scalability. You have... ...a plus Understanding of AI training workloads and the...SeniorFull time$250k - $320k
Gimlet Labs, Inc. is seeking a Network Engineer to design and build network infrastructure for AI workloads at scale. This role involves ensuring robust and reliable networking for production systems across distributed environments, focusing on performance and efficiency...Senior$325k
...mission is to create reliable, interpretable, and steerable AI systems. We want AI... ...researchers, engineers, policy experts, and... ...the SDK through our network, API layers, serving... .... Have experience operating large-scale model serving... ..., we expect all staff to be in one of our...SeniorVisa sponsorship$137k - $215k
...unprecedented speed and accuracy. Our AI-enabled platform turns... ...and disconnected data into operational intelligence — instantly... ...We are looking to add a Network Engineer to expand our small but exceptional... ...they are performant and reliable * Have familiarity with network...SeniorLocal area$230k - $342k
OpenAI is seeking a Software Engineer for its Core Network Engineering team in San Francisco. This role involves designing and operating networking systems for large-scale AI training, focusing on improving performance and reliability. Candidates should have experience...Senior- Overview Senior Platform & Reliability Engineer OpenArt is an AI Storytelling and Visual Creation Platform used by millions worldwide. We’re building the next... ...Looking For Core Requirements 5+ years building and operating production systems where reliability and scaling...SeniorRemote workWorldwideVisa sponsorship
- A pioneering AI infrastructure company is looking for a Senior Staff Software Engineer to lead initiatives in cloud software. This role requires over 10 years in software engineering with expertise in systems engineering and Kubernetes. Key responsibilities include setting...Senior
- A leading tech company in San Francisco is seeking a Senior Staff Software Engineer who specializes in hypervisor virtualization. You will be responsible... ...for optimizing virtualization technologies tailored for an AI cloud infrastructure. Proven expertise in hypervisor...SeniorFull time
$232k - $319k
...Every Identity, from AI to Human Identity is... ...builders and owners who operate with speed and urgency... ...with great people and reliable, cost-effective, and... ...teams focused on Edge networking, K8s platform, CI/CD,... ...architects and product engineering Build a world-class...SeniorPermanent employmentLocal areaWorldwideFlexible hours$320k
...Staff + Sr. Software Engineer, Cloud Inference San Francisco, CA About... ...mission is to create reliable, interpretable, and steerable AI systems. We want AI to... ..., and day-to-day operations. Our engineers are... ...with different hardware, networking stacks, and operational...SeniorWork at officeVisa sponsorshipFlexible hours- ...ROLE Software Engineers at Bright Machines... ...the manufacturing operations for some of the biggest... .... As a Senior Software Engineer,... ...Make the system more reliable, observable, and... ...across process and network boundaries,... ...next-generation, AI-enabled manufacturer...SeniorWork at officeFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Staff Network Operations Engineer - Hyperscale AI Reliability. Be the first to apply!
- software engineer staff San Francisco, CA
- staff devops engineer San Francisco, CA
- assistant engineer San Francisco, CA
- assistant engineering manager San Francisco, CA
- staff design engineer San Francisco, CA
- project engineer assistant project manager San Francisco, CA
- technology administrator San Francisco, CA
- staff data engineer San Francisco, CA
- assistant chief engineer San Francisco, CA
- senior staff systems engineer San Francisco, CA


