Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Staff Network Engineer, Operations

$225k - $275k

Crusoe

Job Description

Job Description

Crusoe is on a mission to accelerate the abundance of energy and intelligence . As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack — from electrons to tokens — to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster.

We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that — with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI.

We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved — people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services.

If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe.

About this Role

Crusoe Cloud is seeking a Senior Staff Network Operations Engineer to own production reliability across our global network, including edge, backbone, data center fabric, and GPU cluster interconnects. You will drive incident response, root cause analysis, and the operational excellence initiatives that keep our hyperscale AI infrastructure healthy at scale.

This is a senior production ownership role, not architecture, not pre-sales, not purely automation. You will set operational standards, define SLIs and SLOs, mentor Staff and Senior engineers, and serve as the senior escalation point during high-severity events. This is the role that keeps the network up.

What You'll Be Working On

  • Own Production Reliability: Serve as the senior technical owner for uptime of Crusoe's global edge, backbone, data center, and GPU cluster network, directly affecting the availability of AI workloads running on hundreds of thousands of GPUs.

  • Lead Incident Response: Own end-to-end response for high-severity network events, including rapid mitigation, stakeholder communication, and postmortem documentation that prevents recurrence.

  • Drive Root Cause Analysis: Lead RCAs for production incidents, identify systemic issues, author remediation plans, and track them to closure.

  • Define SLIs and SLOs: Partner with Architecture and Site Reliability to define network reliability metrics and service level objectives, backed by real-time dashboards and alerting.

  • Set Operational Standards: Author and maintain runbooks, escalation playbooks, and SOPs used by the broader operations team.

  • Improve Observability: Drive continuous improvement of Crusoe's network monitoring stack including streaming telemetry, SNMP, NetFlow, and tools such as Kentik, Grafana, Prometheus, and ThousandEyes.

  • Build Operational Automation: Write Python-based auto-remediation tooling that reduces toil and accelerates mean time to resolution for known failure modes.

  • Mentor and Multiply: Provide technical guidance to Staff and Senior engineers. Drive post-incident learning and build a culture of operational excellence across the team.

What You'll Bring to the Team

  • 12+ years of production network engineering experience with a demonstrated focus on large-scale operations, incident response, and reliability in hyperscale or internet-scale environments.

  • Observability and Monitoring : Hands-on experience with streaming telemetry, SNMP, NetFlow, sFlow, and tools such as Kentik, Grafana, Prometheus, ThousandEyes, and Arbor.

  • GPU Cluster and RDMA Networking: Hands-on experience operating RDMA/RoCE (v1 and v2) lossless fabrics for GPU and HPC workloads, including PFC, ECN, and DCQCN tuning. Required at this level.

  • Demonstrated Technical Leadership: Proven track record owning production reliability at scale, leading RCAs that drove systemic change, and setting operational standards the broader org executes against.

  • Hyperscale Operational Depth: Comfort operating 10K+ device fleets across multi-region environments with 24/7 on-call responsibility. You have been the senior escalation point during critical network events.

  • Protocol Fluency: Expert hands-on knowledge of BGP, EVPN-VXLAN, IS-IS, OSPF, MPLS, QoS, and TCP/IP across production DC fabric environments at scale.

  • Hardware Platform Depth : Expert knowledge of Arista (EOS), Juniper (Junos), and NVIDIA/Mellanox platforms in leaf-spine CLOS architectures across multi-vendor environments.

  • Operational Automation : Proficiency in Python for auto-remediation scripts, diagnostic tooling, and operational workflows that reduce toil and accelerate incident resolution.

  • SLI and SLO Ownership : Experience defining and owning network reliability metrics and service level objectives in partnership with engineering and product leadership.

  • Education : Bachelor's degree in Computer Science, Electrical Engineering, or a related field, or equivalent practical experience in hyperscale or internet-scale environments.

Benefits:

  • Competitive compensation

  • Restricted Stock Units

  • Paid time off & paid holidays

  • Comprehensive health, dental & vision insurance

  • Employer contributions to HSA account

  • Paid parental leave

  • Paid life insurance, short-term and long-term disability

  • Professional development & tuition reimbursement

  • Mental health & wellness support

  • Commuter benefits (parking & transit)

  • Cell phone stipend

  • 401(k) Retirement plan with company match up to 4% of salary

  • Volunteer time off

Compensation:

Compensation will be paid in the range of $225,000 - $275,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant’s education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data.

Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior Staff Network Engineer, Operations in San Francisco, CA vacancy
  • $225k - $275k

     ...infrastructure company built from the ground up, we own and operate each layer of the stack — from electrons to tokens — to...  ...at Crusoe. About this Role Crusoe Cloud is seeking a Senior Staff Network Deployment Engineer to serve as the technical owner of how we deploy... 
    Operations
    Senior
    Temporary work
    Remote work

    Crusoe

    San Francisco, CA
    3 days ago
  •  ...motivated by our mission and operating principles. You move...  ...our IT support, IT engineering and business...  ...and access management. Network Operations builds our...  ...access for our Airwallex staff all over the globe, and...  ...What you’ll do As a Senior / Staff Network Engineer... 
    Operations
    Senior
    Work at office
    Remote work
    Worldwide
    Flexible hours
    Weekend work

    Airwallex

    San Francisco, CA
    5 days ago
  • $245k - $295k

     ...Senior Staff Network Automation Engineer Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated...  ...company built from the ground up, we own and operate each layer of the stack — from electrons to tokens — to... 
    Operations
    Senior
    Temporary work

    Crusoe

    San Francisco, CA
    9 hours ago
  •  ...: We are looking for an experienced Senior Staff Software Engineer to join our Builder Tools engineering...  ..., implementation, delivery and operations for AI powered agentic testing (autonomous...  ...(e.g., Docker, Kubernetes) and networking Compensation and Benefits The base pay... 
    Operations
    Senior
    Remote work

    Israelvcforum

    San Francisco, CA
    5 days ago
  • $150k - $250k

     ...We are looking for a Bay Area-based NFS Engineer who wants to work where distributed...  ...object storage, Kubernetes, and high‑speed networking all collide. This is not a role for...  ...to improve observability, testing, and operations Help shape the next generation of infrastructure... 
    Operations
    Senior

    DataDirect Networks, Inc.

    San Francisco, CA
    3 days ago
  • $320k - $405k

     ...growing group of committed researchers, engineers, policy experts, and business leaders...  ...and routers. We're looking for a Staff Fiber Network Engineer to own the physical layer of...  ...Monitor degradation and quality over time. Operations - partner with NOC and field‑ops on... 
    Operations
    Visa sponsorship
    Night shift

    anthropic

    San Francisco, CA
    2 days ago
  •  ...Job role : Senior Network Engineer Duration : 6 month contract Location : Bay Area, CA (100% onsite; primarily South San...  ...support our client's rapid growth through the deployment and operation of new, greenfield network infrastructure. This role is... 
    Operations
    Senior
    Contract work
    Work at office

    VDart

    San Francisco, CA
    1 day ago
  • $180k - $400k

     .... Industry leaders like DoorDash trust Giga with their most complex support and operations workflows across voice, chat, and email. About the Role We\'re seeking a Forward Deployed Engineer to join our San Francisco team. You\'ll serve as the technical interface and project... 
    Operations
    Senior
    Work at office

    Giga

    San Francisco, CA
    3 days ago
  • $215k - $250k

     ...Employment Type Full time Location Type On-site Department Engineering About the Company World is a network of real humans, built on privacy-preserving proof-of...  ...with our Fraud Risk Engine Systems that enable our operations teams to respond to user reports and that enforce... 
    Operations
    Senior
    Full time
    Flexible hours

    Kubelt

    San Francisco, CA
    3 days ago
  • $110k - $167k

     ...door. We work together, so we win together. As a Senior Staff Platform Operations Engineer, you will lead the technical direction, architecture,...  ...systems. Deep expertise in Microsoft Azure, including networking, identity, compute, and platform design for multi-... 
    Operations
    Senior
    For contractors
    Local area

    PG Forsta

    Emeryville, CA
    3 days ago
  • $232k - $290k

     ...opportunities, join us, and build real world value. THE WORK: As a Senior Staff Security Engineer focused on AI Security, you will be Ripple's deepest...  ...mandates: securing AI systems that Ripple builds and operates, and harnessing AI to make Ripple's security function... 
    Operations
    Senior
    Full time
    Work at office
    Local area

    Ripple

    San Francisco, CA
    4 days ago
  • Hamilton Barnes Associates Limited is seeking a Senior Wireless Network Engineer in San Francisco, CA to enhance FTTH and fixed wireless operations. The role involves optimizing network performance, integrating technologies, and participating in an on-call rotation. Candidates... 
    Operations
    Senior

    Hamilton Barnes Associates Limited

    San Francisco, CA
    1 day ago
  • Lawrence Berkeley National Laboratory is seeking a Senior Network Platform Engineer to advance networking for high-performance computing. The role involves managing software and automation for network operations and contributing to modernization initiatives. Ideal candidates... 
    Operations
    Senior
    Flexible hours

    Lawrence Berkeley National Laboratory

    San Francisco, CA
    3 days ago
  •  ...Obsidian is seeking a Level 3 / Tier 3 network support engineer in San Francisco, CA. This role combines the responsibilities of ensuring existing...  ...infrastructure stability with scripting to automate operations, influencing intelligent systems and data pipelines.The ideal... 
    Operations
    Senior
    Full time

    Obsidian

    San Francisco, CA
    2 days ago
  • $210k - $230k

     ...assessed during the interview process. About the Role: We're looking for a Senior Staff Security Engineer to lead Gusto's edge and network security strategy, owning the design and operation of our Cloudflare WAF, DDoS protection, Zero Trust, and broader perimeter... 
    Operations
    Senior
    Full time
    Work at office
    Local area
    Remote work
    2 days per week
    3 days per week

    gusto

    San Francisco, CA
    27 days ago
  •  ...iTD is seeking a Senior Network Engineer. You will work directly with the client Director, Network Engineering. You will be required to apply...  ...network security trends, products and technologies Operations & Support Provide end-to-end technical support—from endpoints... 
    Operations
    Senior

    itD Tech

    San Francisco, CA
    3 days ago
  • $293k - $385k

     ...intersection of data science, research, and engineering within OpenAI's B2B organization....  ...experiences over time. We operate in highly constrained production environments...  ...internal title Member of Technical Staff . We use Staff / Senior Staff externally to signal the depth... 
    Operations
    Senior
    Shift work

    OpenAI

    San Francisco, CA
    4 days ago
  • $170k

     ...wireless and fiber solutions for growing connectivity needs. The company is looking for a Senior Wireless Network Engineer to support and improve FTTH and fixed wireless network operations in a growing ISP environment. The role involves optimising network performance,... 
    Operations
    Senior

    Hamilton Barnes Associates Limited

    San Francisco, CA
    1 day ago
  •  ...combination of inventive research, design, and engineering. Our organization is very flat, and our...  ...code. About The Role As our first Senior Staff GTM Systems Engineer, you'll own the...  ...velocity, sales efficiency, and revenue operations, and putting AI at the center of... 
    Operations
    Senior

    Cursor

    San Francisco, CA
    4 days ago
  •  ...research firm in San Francisco is seeking a Data Center Controls Network Engineer to design and manage OT network architectures for high-...  ...offers a collaborative environment working with multidisciplinary teams to ensure effective network operations. #J-18808-Ljbffr OpenAI
    Operations
    Senior

    OpenAI

    San Francisco, CA
    3 days ago
  • $200k - $250k

     ...the United States to help them hire. Title of Role: Senior/Staff Backend Engineer (Platform & AI Systems) Location: San Francisco, CA /...  ...growing AI-enabled platform modernizing one of the most operationally complex and high-stakes workflows in global talent mobility... 
    Operations
    Senior
    H1b
    Work at office
    Remote work
    Visa sponsorship
    3 days per week

    Recruiting from Scratch

    San Francisco, CA
    4 days ago
  •  ..., designed for today’s global logistics network. We’re a fast-moving, close-knit team of...  .... We don’t believe culture can be engineered - but when it falls into place, it’s a once...  ...debug: First power-on through stable system operation; instrument, isolate, and resolve issues... 
    Operations
    Senior
    Local area

    Humble Robotics

    San Francisco, CA
    5 days ago
  •  ...and you're motivated by our mission and operating principles. You move fast with good...  ...standards, are tool-agnostic, and expect engineers to own problems end-to-end. We collaborate...  ...has. About the Role As a Staff / Senior Backend Engineer, you'll design the backbone... 
    Operations
    Senior
    Worldwide

    Airwallex

    San Francisco, CA
    5 days ago
  • Senior Staff Machine Learning Engineer, Communication & Connectivity Remote - USA Airbnb was born in 2007 when two hosts welcomed three guests to their...  ...Notifications so hosts on Airbnb can streamline their operations, and travelers get just the information they need to... 
    Operations
    Senior
    Work experience placement
    Remote work

    airbnb, Inc.

    San Francisco, CA
    5 days ago
  • Senior / Staff / Principal Backend Engineer Location: Onsite San Francisco We work with multiple startups to connect talented individuals with opportunities...  ...infrastructure, ensuring the efficient and secure operation of web applications and systems. They develop and maintain... 
    Operations
    Senior

    Lead Allies Inc

    San Francisco, CA
    1 day ago
  • $251k - $325k

     ...technology behind World. World is building a real human network designed to accelerate people in the age of AI. As bots...  ...people across hardware, software, AI, cryptography, mobile engineering, and global operations. Our teams come from OpenAI, Tesla, SpaceX, Apple,... 
    Operations
    Senior
    Casual work
    Worldwide
    Flexible hours

    Tools for Humanity

    San Francisco, CA
    4 days ago
  • Airbnb, Inc. is hiring a Senior Staff Machine Learning Engineer, focusing on driving evaluation strategies and data infrastructure for CSxAI initiatives...  ...projects. The work will significantly impact ... operations to ensure quality and efficiency in AI applications. Candidates... 
    Operations
    Senior
    Remote job

    airbnb, Inc.

    San Francisco, CA
    5 days ago
  •  ...Job Description Insight Global is seeking a Network Engineer – Reliability & Observability to support the quality, reliability, and lifecycle...  ...network performance from initial deployment through ongoing operations. This position focuses on developing scalable processes,... 
    Operations
    Senior

    Insight Global

    San Francisco, CA
    3 days ago
  • Position Overview We are seeking a Senior Datacenter Network Infrastructure Engineer to help develop the strategic...  ...optimization of datacenter and network operations at the Internet Archive. This...  ...Foundation to recruit, hire and retain staff. Open Staffing Foundation offers... 
    Operations
    Senior
    Temporary work
    Local area
    Flexible hours

    Filecoin Foundation

    San Francisco, CA
    2 days ago
  • $95 - $105 per hour

     ...Job Description Job Description Senior Network Engineer Duration: 3‑month Contract‑to‑Hire Location: Menlo Park, CA Compensation...  ...private family office that manages a diverse portfolio of operating businesses, investments, and philanthropic initiatives. The... 
    Operations
    Senior
    Contract work
    Work at office
    Local area

    Addison Group

    San Francisco, CA
    a month ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Staff Network Engineer, Operations. Be the first to apply!