Senior Staff Network Reliability Engineer
Epoch Biodesign
Epoch Biodesign is looking for a Senior Staff Network Operations Engineer to ensure production reliability across its global network in San Francisco. This role drives incident response and sets operational standards for Crusoe's extensive AI infrastructure, requiring strong technical leadership and incident management skills. Applicants should have at least 12 years of experience in network engineering and possess expertise in observability tools and protocols. The position offers a competitive compensation package including stock options and comprehensive benefits. #J-18808-Ljbffr Epoch Biodesign
- Fluidstack is seeking a Network Engineer, Reliability & Observability to ensure the reliability of AI networks through robust data collection and metrics reporting. This role involves developing processes and systems while collaborating with cross-functional teams. The...Senior
$225k - $275k
Crusoe Energy Systems LLC in San Francisco is looking for a Senior Staff Network Operations Engineer to ensure production reliability across its global network. In this role, you will lead incident response and define key operational standards. Ideal candidates will bring...Senior$150k - $250k
...hire people who care deeply about this problem space. If that is you, please apply! About the Role Fluidstack is seeking a Network Engineer, Reliability & Observability to serve as a reliability engineer championing and building process, data collections, and reliability...SeniorLocal area- ...infrastructure that has to perform under real-world scale, reliability, and security demands — and we're looking for an engineer who wants to own the foundation it runs on. This... ...on" role. You'll design and operate the global network and reliability layer behind one of the world's...SeniorFull time
$261k - $326k
...specializing in AI infrastructure is seeking a Principal Engineer to enhance reliability and scalability of cloud systems. This role demands over... ...operational excellence. Candidates should have strong networking expertise and systems fundamentals, especially in high-scale...Senior- Epoch Biodesign in San Francisco is seeking a Senior Staff Cloud Support Engineer to lead technical escalations and improve cloud infrastructure. You will mentor engineers and influence architectural decisions while ensuring high availability for AI workloads. The ideal...Senior
- A healthcare technology company seeks a Senior Technical Support Engineer to manage and resolve technical issues. You will play a critical role in supporting clients, ensuring seamless operations and efficient issue resolution. Responsibilities include managing the lifecycle...Senior
- A tech company focused on AI is seeking a Site Reliability Engineer to ensure the reliability and performance of its GPU marketplace. This role involves maintaining service level objectives, managing capacity, and implementing secure systems. The ideal candidate has strong...Senior
- A leading AI research company in San Francisco is seeking a software engineer for its Fleet High Performance Computing team. In this role, you'll ensure the reliability and uptime of the compute fleet, working with automation systems and monitoring tools. Ideal candidates...Senior
$200k - $250k
A leading visual creation platform in San Francisco is seeking a Senior Owner of Stability and Infrastructure. This hands-on technical leadership role demands expertise in service reliability to ensure the platform's performance as it scales. Responsibilities include setting...Senior- Drata is seeking a Senior Site Reliability Engineer in San Francisco. In this role, you will engage in reliability architecture for product teams, lead production readiness reviews, and build automation around monitoring and alerting. The ideal candidate has at least 6...Senior
- OpenArt AI in San Francisco is seeking a Senior Platform & Reliability Engineer to design and improve the reliability of its infrastructure. The role emphasizes building and operating production systems while collaborating with product engineers to ensure platform scalability...Senior
- AngelList Venture in San Francisco is seeking a Senior Infrastructure Engineer to build and optimize platform infrastructure that supports billions... ...enhance developer productivity through automation and reliability practices. The ideal candidate has a solid background in...SeniorWork at office
- An innovative R&D company in San Francisco is seeking a Site Reliability Engineer to join its Platform Engineering team. This position focuses on ensuring the reliability and performance of an AI-powered code review platform. The ideal candidate will have 6-8 years of...Senior
$163k - $203k
GoTo Meeting is looking for a Senior Site Reliability Engineer in San Francisco. You will be responsible for the reliability, scalability, and security of Prosper’s Cloud Platform portfolio. This role requires expertise in Kubernetes, cloud platforms (preferably GCP),...Senior- ...runs on complex, distributed, cloud-native systems. As a Staff Platform Engineer, you will play a critical role in ensuring these systems remain... ...engineering and technical leadership role. You will own reliability for major platform domains, design scalable solutions on...Senior
$200k - $250k
...PostgreSQL, Redis, BullMQ queues, and Kubernetes-based production infrastructure. We’re hiring a senior owner of stability and infrastructure to ensure the platform is reliable, fast, and resilient as we scale. Role Mission Own service reliability end-to-end: prevent...SeniorPermanent employment- Airwallex- is seeking a Senior Site Reliability Engineer in San Francisco, California, to work with product teams to build and maintain robust cloud infrastructure. In this role, you will lead critical infrastructure projects, ensuring the reliability and performance of...Senior
- ...experience , 5+ years of experience in Site Reliability Engineering, DevOps, or a similar role focused on... ...(AWS, GCP, Azure), including compute, networking, storage, and database services ,... ...job involves As a Member of Technical Staff, Cluster Management at Fireworks AI, you...Senior
- A pioneering AI infrastructure company is looking for a Senior Staff Software Engineer to lead initiatives in cloud software. This role requires over 10 years in software engineering with expertise in systems engineering and Kubernetes. Key responsibilities include setting...Senior
- Overview Senior Platform & Reliability Engineer OpenArt is an AI Storytelling and Visual Creation Platform used by millions worldwide. We’re building the next generation of creative tools powered by cutting-edge AI, enabling anyone to create videos, visuals, characters...SeniorRemote workWorldwideVisa sponsorship
- A leading tech company in San Francisco is seeking a Senior Staff Software Engineer who specializes in hypervisor virtualization. You will be responsible for optimizing virtualization technologies tailored for an AI cloud infrastructure. Proven expertise in hypervisor...SeniorFull time
$151.5k - $252.5k
A leading technology firm is seeking a Senior Site Reliability Engineer to join their Data Cloud engineering team in San Francisco. The role requires expertise in Azure infrastructure and SaaS applications, focusing on building reliable, scalable systems. The ideal candidate...Senior$232k - $319k
...scale the service with great people and reliable, cost-effective, and efficient... ...oversee multiple teams focused on Edge networking, K8s platform, CI/CD, Observability, automation... ...partnership with architects and product engineering Build a world-class observability...SeniorPermanent employmentLocal areaWorldwideFlexible hours- ...Job Title: Reliability Test Technician Location: San Francisco, CA (Onsite) Pay Rate: *65/hr W2 Duration: 2 months+ About the... ...ensure an organized laboratory environment. Collaborate with engineering teams to improve product reliability and test methodologies....Senior
$150k - $180k
...isn’t it. The Role As we continue to develop and deploy cutting-edge autonomous technologies, we are seeking a Senior Reliability Engineer (REL) to lead efforts in ensuring the long-term performance, durability, and robustness of critical hardware systems. This...SeniorFull timeImmediate startWorldwideFlexible hoursNight shift- Hudson Manpower is seeking a Mechanical Engineer - Offshore Reliability for a role involving the improvement of offshore mechanical equipment reliability and performance. This position requires a Bachelor's Degree in Mechanical Engineering and a minimum of 12 years of experience...Senior
- A leading AI research organization in San Francisco is seeking a cross-stack engineer to ensure reliability in next-generation AI systems. This hands-on position requires extensive experience in reliability modeling and DFX architecture to enhance the durability and performance...Senior
- Fluidstack is seeking a Network Engineer in San Francisco, California to oversee the health and operation of our extensive network. This role involves building active debugging tools, developing monitoring frameworks, and implementing automation for seamless network repair...
- scribehow.com is seeking a Senior Database Reliability Engineer based in San Francisco (hybrid model). You will own the reliability, performance, and scalability of our data tier and work with a growing engineering team. Your expertise will ensure smooth operations across...SeniorRemote job
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Staff Network Reliability Engineer. Be the first to apply!
- software engineer staff San Francisco, CA
- staff devops engineer San Francisco, CA
- assistant engineer San Francisco, CA
- assistant engineering manager San Francisco, CA
- staff design engineer San Francisco, CA
- project engineer assistant project manager San Francisco, CA
- technology administrator San Francisco, CA
- staff data engineer San Francisco, CA
- assistant chief engineer San Francisco, CA
- senior staff systems engineer San Francisco, CA



