Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Staff Network Operations Engineer - Hyperscale AI Reliability

Crusoe

Crusoe is seeking a Senior Staff Network Operations Engineer to ensure the reliability of their global network infrastructure. This role focuses on operational excellence, driving incident responses, and mentoring staff engineers. The ideal candidate should have over 12 years of experience in production network engineering, extensive knowledge of network protocols, and a strong ability in operational automation. Join Crusoe to enhance AI strategies and be part of a high-performing team. #J-18808-Ljbffr Crusoe

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Senior Staff Network Operations Engineer - Hyperscale AI Reliability in San Francisco, CA vacancy
  • $150k - $250k

     ...things they had to do. Powerful AI will be the biggest lever...  ...build data centers, and operate them - with teams spanning...  ...Role Fluidstack is seeking a Network Engineer, Reliability & Observability to serve as...  ...have experience building hyperscale platforms, demonstrating a... 
    Senior
    Local area

    Fluidstack

    San Francisco, CA
    3 days ago
  • $245k - $295k

     ...vertically integrated AI infrastructure...  ...ground up, we own and operate each layer of the...  ...Crusoe is seeking a Senior Staff Network Automation Engineer to own how our network...  ...and Infrastructure Reliability. You will partner...  ...as code in hyperscale or internet-scale environments... 
    Senior
    Temporary work

    Crusoe Energy Systems LLC

    San Francisco, CA
    2 days ago
  • Fluidstack is seeking a Network Engineer, Reliability & Observability to ensure the reliability of AI networks through robust data collection and metrics reporting. This...  ...experience, with a strong background in operational troubleshooting and software development. A competitive... 
    Senior

    Fluidstack

    San Francisco, CA
    3 days ago
  • Epoch Biodesign is looking for a Senior Staff Network Operations Engineer to ensure production reliability across its global network in San Francisco. This role drives incident...  ...operational standards for Crusoe's extensive AI infrastructure, requiring strong technical... 
    Senior

    Epoch Biodesign

    San Francisco, CA
    4 days ago
  • $261k - $326k

     ...technology company specializing in AI infrastructure is seeking a Principal Engineer to enhance reliability and scalability of cloud systems....  ...engineers, and ensuring operational excellence. Candidates should have strong networking expertise and systems fundamentals... 
    Senior

    Crusoe

    San Francisco, CA
    3 days ago
  • $225k - $275k

     ...vertically integrated AI infrastructure company...  ...ground up, we own and operate each layer of the stack...  ...Cloud is seeking a Senior Staff Network Operations Engineer to own production reliability across our global network...  ...that keep our hyperscale AI infrastructure healthy... 
    Senior
    Temporary work

    Crusoe

    San Francisco, CA
    10 days ago
  • $225k - $275k

     ...vertically integrated AI infrastructure...  ...ground up, we own and operate each layer of the...  ...Cloud is seeking a Senior Staff Network Deployment Engineer to serve as the technical...  ...Ops, and Site Reliability leadership. You're...  ...built and validated at hyperscale pace. Architect... 
    Senior
    Temporary work
    Remote work

    Crusoe

    San Francisco, CA
    22 days ago
  •  ...Capital is looking for a Production Support Engineer in San Francisco. You'll play a key role in ensuring the reliability of the Agentforce Supply Chain platform and work...  ...candidate has over 5 years of experience in operations and scaling, with a focus on cloud platforms... 
    Senior

    B Capital

    San Francisco, CA
    13 hours ago
  •  ...Alembic is the pioneering Causal AI platform. We help the world'...  ...under real-world scale, reliability, and security demands — and we're looking for an engineer who wants to own the...  ...on" role. You'll design and operate the global network and reliability layer behind... 
    Senior
    Full time

    Alembic

    San Francisco, CA
    6 hours ago
  • A cutting-edge AI startup in San Francisco is seeking a Senior Infrastructure Engineer to build platforms for AI agents. Your role will involve creating systems that other engineers rely on, ensuring reliability and fast deployment. You'll work with technologies like Python... 
    Senior

    Giga

    San Francisco, CA
    1 day ago
  • Algora Public Benefit Corporation is looking for an AI Cloud Infra Engineer to join their team in San Francisco. You will ensure the reliability of backend systems and work closely with engineers to plan for future growth. The ideal candidate has strong cloud infrastructure... 
    Senior

    Algora Public Benefit Corporation

    San Francisco, CA
    4 days ago
  • $151.3k - $271.15k

    ## Senior Manager - Platform Engineering and OperationsApplylocations: San Francisco...  ..., Intelligent Operations & Platform...  ...infrastructure, data centers, networking, developer...  ...accelerating automation, AI-assisted operations...  ..., improve reliability and scalability, enhance... 
    Senior

    Autodesk, Inc.

    San Francisco, CA
    3 days ago
  •  ...technology firm based in San Francisco is seeking a DevOps Engineer to enhance the reliability of their production systems. You will collaborate with...  ...Join us in our mission to revolutionize hardware design through innovative AI solutions. #J-18808-Ljbffr Flux Enterprise
    Senior

    Flux Enterprise

    San Francisco, CA
    3 days ago
  • A tech company focused on AI is seeking a Site Reliability Engineer to ensure the reliability and performance of its GPU marketplace. This role involves maintaining service level objectives, managing capacity, and implementing secure systems. The ideal candidate has strong... 
    Senior

    Hyperbolic Labs

    San Francisco, CA
    13 hours ago
  • 53 Stations is seeking a DevOps Engineer to enhance the systems powering Flux's platform. You’ll tackle operations from billing to onboarding while ensuring high system reliability and performance. With a focus on collaboration and ownership, you will develop internal... 
    Senior

    53 Stations

    San Francisco, CA
    4 days ago
  • A leading AI research company in San Francisco is seeking a software engineer for its Fleet High Performance Computing team. In this role, you'll ensure the reliability and uptime of the compute fleet, working with automation systems and monitoring tools. Ideal candidates... 
    Senior

    OpenAI

    San Francisco, CA
    4 days ago
  • OpenArt AI in San Francisco is seeking a Senior Platform & Reliability Engineer to design and improve the reliability of its infrastructure. The role emphasizes building and operating production systems while collaborating with product engineers to ensure platform scalability... 
    Senior

    OpenArt AI

    San Francisco, CA
    2 days ago
  • $200k - $250k

     ...combines modern web tooling with AI-powered workflows. Our stack...  .... We’re hiring a senior owner of stability and infrastructure...  ...infrastructure to ensure the platform is reliable, fast, and resilient as we...  ...execution). Reliability operating system: Own observability quality... 
    Senior
    Permanent employment

    Vizcom

    San Francisco, CA
    3 days ago
  •  ...identity security, delivering an AI-powered platform that...  ...cloud-native systems. As a Staff Platform Engineer, you will play a critical role...  ...role. You will own reliability for major platform domains,...  ...and leverage to deploy and operate their applications Architect... 
    Senior

    Saviynt

    San Francisco, CA
    3 days ago
  • $190k - $250k

     ...Senior Network Engineer San Francisco About the Role As a Senior Network...  ...networking principles, operational discipline, and advanced automation...  ...for availability, reliability, and scalability. You have...  ...a plus Understanding of AI training workloads and the... 
    Senior
    Full time

    Together AI

    San Francisco, CA
    3 days ago
  • $250k - $320k

    Gimlet Labs, Inc. is seeking a Network Engineer to design and build network infrastructure for AI workloads at scale. This role involves ensuring robust and reliable networking for production systems across distributed environments, focusing on performance and efficiency... 
    Senior

    Gimlet Labs, Inc.

    San Francisco, CA
    2 days ago
  • $325k

     ...mission is to create reliable, interpretable, and steerable AI systems. We want AI...  ...researchers, engineers, policy experts, and...  ...the SDK through our network, API layers, serving...  .... Have experience operating large-scale model serving...  ..., we expect all staff to be in one of our... 
    Senior
    Visa sponsorship

    Menlo Ventures

    San Francisco, CA
    13 hours ago
  • $137k - $215k

     ...unprecedented speed and accuracy. Our AI-enabled platform turns...  ...and disconnected data into operational intelligence — instantly...  ...We are looking to add a Network Engineer to expand our small but exceptional...  ...they are performant and reliable * Have familiarity with network... 
    Senior
    Local area

    Peregrine Technologies

    San Francisco, CA
    2 days ago
  • $230k - $342k

    OpenAI is seeking a Software Engineer for its Core Network Engineering team in San Francisco. This role involves designing and operating networking systems for large-scale AI training, focusing on improving performance and reliability. Candidates should have experience... 
    Senior

    OpenAI

    San Francisco, CA
    1 day ago
  • Overview Senior Platform & Reliability Engineer OpenArt is an AI Storytelling and Visual Creation Platform used by millions worldwide. We’re building the next...  ...Looking For Core Requirements 5+ years building and operating production systems where reliability and scaling... 
    Senior
    Remote work
    Worldwide
    Visa sponsorship

    OpenArt AI

    San Francisco, CA
    2 days ago
  • A pioneering AI infrastructure company is looking for a Senior Staff Software Engineer to lead initiatives in cloud software. This role requires over 10 years in software engineering with expertise in systems engineering and Kubernetes. Key responsibilities include setting... 
    Senior

    Crusoe Energy Systems LLC

    San Francisco, CA
    1 day ago
  • A leading tech company in San Francisco is seeking a Senior Staff Software Engineer who specializes in hypervisor virtualization. You will be responsible...  ...for optimizing virtualization technologies tailored for an AI cloud infrastructure. Proven expertise in hypervisor... 
    Senior
    Full time

    Crusoe Energy Systems LLC

    San Francisco, CA
    2 days ago
  • $232k - $319k

     ...Every Identity, from AI to Human Identity is...  ...builders and owners who operate with speed and urgency...  ...with great people and reliable, cost-effective, and...  ...teams focused on Edge networking, K8s platform, CI/CD,...  ...architects and product engineering Build a world-class... 
    Senior
    Permanent employment
    Local area
    Worldwide
    Flexible hours

    Okta, Inc.

    San Francisco, CA
    3 days ago
  • $320k

     ...Staff + Sr. Software Engineer, Cloud Inference San Francisco, CA About...  ...mission is to create reliable, interpretable, and steerable AI systems. We want AI to...  ..., and day-to-day operations. Our engineers are...  ...with different hardware, networking stacks, and operational... 
    Senior
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    1 day ago
  •  ...ROLE Software Engineers at Bright Machines...  ...the manufacturing operations for some of the biggest...  ....   As a Senior Software Engineer,...  ...Make the system more reliable, observable, and...  ...across process and network boundaries,...  ...next-generation, AI-enabled manufacturer... 
    Senior
    Work at office
    Flexible hours

    Bright Machines

    San Francisco, CA
    a month ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Staff Network Operations Engineer - Hyperscale AI Reliability. Be the first to apply!