Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff SRE & Tech Lead: Scale Reliability & Data Infra

Unify

A leading tech company in San Francisco is hiring a Staff SRE Tech Lead to enhance the reliability and scalability of their platform. This role involves overseeing a team of SREs, optimizing backend services for performance, and architecting data infrastructures like ClickHouse and PostgreSQL to handle terabytes of data. The ideal candidate possesses over 8 years of software engineering experience, including team leadership and database expertise. This position offers a high-energy work environment focused on innovation and reliability. #J-18808-Ljbffr Unify

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Staff SRE & Tech Lead: Scale Reliability & Data Infra in San Francisco, CA vacancy
  • A cutting-edge AI firm in San Francisco seeks an experienced Infrastructure Tech Lead to oversee the scaling of its platform. You will enhance infrastructure reliability and performance as customer demand grows, working hands-on with systems. Ideal candidates have substantial... 
    Suggested
    Flexible hours

    Lightfield

    San Francisco, CA
    5 days ago
  • $182k - $249k

    Okta, Inc. is seeking an experienced Staff Site Reliability Engineer to join their Infrastructure Platform AGILE SRE team in San Francisco, CA. This role involves resolving infrastructure challenges through strong technical guidance, mentoring, and improvements to monitoring... 
    Suggested

    Okta, Inc.

    San Francisco, CA
    5 days ago
  •  ...machine learning research engineer at Scale AI . The rest of our team comes from companies...  ...with state-of-the-art AI. As our Staff SRE Tech Lead, you'll own the reliability and scalability of our platform as we add terabytes of data monthly and onboard customers with... 
    Data

    Unify

    San Francisco, CA
    1 day ago
  •  ...CloudDevs: Senior Web site Reliability Engineer (SRE) CloudDevs works with fast-moving...  ...pleasure in designing for scale and bettering how groups...  ...testing and monitor benchmarks. Lead resilience work like...  ...Terraform, and Kubernetes. Deep data of observability instruments... 
    Data

    The10minutecareersolution

    San Francisco, CA
    1 day ago
  •  ...cryptocurrency project, patiently leading the asset-backed currency...  ...it's L2s, custom API's, data pipelines, docker,...  ...developer operations are as reliable and scalable as possible. Users...  ...experience, specialization as an SRE, and a love of scaling. They should also have the... 
    Data
    Work at office
    Flexible hours

    ABC Labs

    San Francisco, CA
    2 hours ago
  • $251k - $310k

     ...Technical Lead Manager, Software Infra/Query Storage Waymo...  ...and reports to Sr. Staff TLM You will:...  ...technical roadmap of the data pipeline, storage...  ...highly scalable, reliable, and efficient...  ...for massive-scale data processing and...  ...management or tech leadership, including... 
    Data
    Full time
    Remote work

    Waymo

    San Francisco, CA
    2 days ago
  • A leading language learning platform is seeking an experienced SRE Engineer to ensure the reliability and resilience of their infrastructure. Responsibilities include leading incident response, improving observability, and collaborating with various teams to enhance platform... 

    Speak

    San Francisco, CA
    4 days ago
  •  ...technology company in New York is seeking a Senior Site Reliability Engineer to tackle scaling and reliability challenges in a high-intensity environment...  ...systems and database management. You will optimize data infrastructures, improve system performance, and build automation... 
    Data

    Unify

    San Francisco, CA
    1 day ago
  • $145k - $195k

    Platform Engineer - Reliability & Scale at LangChain - San Francisco, CA About LangChain At LangChain, our mission is to make intelligent agents...  ...critical systems : Design and implement high throughput data-intensive systems supporting our flagship SaaS products (LangSmith... 
    Data

    Victrays

    San Francisco, CA
    3 days ago
  •  ...a world-class Site Reliability Engineer to ensure...  ...power agentic AI at scale. Your mission: keep...  ...the founders, the infra team, and the dev team...  ...reliability. Lead incident response with...  ...precision, and ownership. Data-driven engineer:...  ...3+ years in SRE, DevOps, or infrastructure... 
    Data

    Blaxel

    San Francisco, CA
    2 days ago
  • $160k - $225k

    A leading cybersecurity firm in San Francisco is looking for a Platform / Infrastructure Engineer to build and scale core systems for its data workflows. This role involves developing reliable and scalable backend systems, optimizing performance, and collaborating with... 
    Data

    Fable Security, LLC

    San Francisco, CA
    4 days ago
  •  ...building AI agents that can reliably do everyday digital tasks. We...  ...a member of the AI technical staff to join the founding team. Someone...  ...etc. Responsibilities: Scale infra for post-training of...  ...performance of multimodal LLMs (data/tensor/pipeline/context/expert... 
    Data
    Work at office
    Relocation
    Visa sponsorship

    Yutori

    San Francisco, CA
    1 day ago
  • $170k - $250k

     ...Senior Infra Software Engineer Title of Role...  ...and implement internal data and AI applications using...  ...and improve system reliability. Contribute to the development...  ...and maintaining large-scale distributed systems....  ...to groundbreaking projects in the tech industry.
    Data
    Work at office

    Recruiting from Scratch

    San Francisco, CA
    6 hours ago
  • A tech-driven company in San Francisco is seeking an experienced engineer dedicated to building...  ...should have a strong background in scaling internal systems, incident response, and cloud...  ...multi-cloud deployments, improving data security, and developing CI/CD systems. Candidates... 
    Data

    Sieve

    San Francisco, CA
    4 days ago
  • $180k - $300k

     ...the Software Engineer (Infra) role at Numeral ....  ...compliance. Tomorrow, we're scaling that impact even...  ..., improve service reliability and observability, and...  ...APIs, services, and data pipelines. Lead infrastructure architecture...  ..., or regulatory tech. Infrastructure or platform... 
    Data
    Full time
    Immediate start
    Remote work
    Flexible hours

    Numeral

    San Francisco, CA
    3 days ago
  • A leading data and AI company is seeking a Senior Manager, Infrastructure Data Science to lead a team focused on optimizing infrastructure and reliability. You will work closely with engineering leaders, promote data-driven strategies, and implement solutions to enhance... 
    Data

    Databricks Inc.

    San Francisco, CA
    3 days ago
  • $325k

     ...s mission is to create reliable, interpretable, and steerable...  ...and cloud providers. Lead incident response for...  ...built product stacks, scaled databases, run massive...  ...also: Have been an SRE, Production Engineer, or...  ...Currently, we expect all staff to be in one of our... 
    Visa sponsorship

    Menlo Ventures

    San Francisco, CA
    3 days ago
  • $200k - $240k

    A leading AI company in San Francisco is seeking a Backend Tech Lead to architect and scale their core API infrastructure. This position requires 5+ years of backend engineering...  ...and expertise in Node.js and large-scale data systems. You'll collaborate with various teams... 
    Data

    Hockeystack

    San Francisco, CA
    4 days ago
  •  ...a machine learning research engineer at Scale AI. The rest of our team comes from companies...  ...with state-of-the-art AI. As a Senior SRE, you'll tackle the scaling and reliability challenges that come with adding terabytes of data monthly and supporting enterprise customers... 
    Data

    Unify

    San Francisco, CA
    1 day ago
  • $175k - $250k

     ...supported the Regular Toilet is seeking a Site Reliability Engineer to enhance the reliability and...  ...at WorkOS. As a key member of the SRE team, you will handle critical responsibilities...  ...ensure our platform runs reliably at scale. #J-18808-Ljbffr I did my part and supported... 
    Remote job
    Flexible hours

    I did my part and supported the Regular Toilet

    San Francisco, CA
    1 day ago
  •  ...Kivu Ventures, and other leading investors, we’re...  ...role We’re hiring an SRE to join our engineering...  ...take ownership of the reliability and performance of the...  ...services and Postgres based data layer. This role reports...  ...influence how we build, scale and operate our platform... 
    Data
    Work at office
    Remote work
    Flexible hours
    2 days per week

    Plenful

    San Francisco, CA
    4 days ago
  •  ...seeking a hands‑on Infrastructure Tech Lead to help scale the platform through a...  ...ways. As customer scale and data volumes grow beyond initial...  ...the platform remains fast, reliable, and resilient as Lightfield...  ...infrastructure‑heavy engineering (SRE, reliability, performance,... 
    Data
    Immediate start
    Work from home

    Lightfield

    San Francisco, CA
    5 days ago
  • $200k - $275k

     ...HealthLeap processes billions of data points from hospital EHRs....  ...services, data pipelines Build reliable data infrastructure that...  ...transforms, and serves data at scale (S3, Iceberg, Spark, Dagster,...  ...stage startup where you owned infra end-to-end This Role Is Not... 
    Data
    Home office
    Day shift

    Healthleap AI

    San Francisco, CA
    6 hours ago
  • $180k - $240k

    A growing technology company is seeking a Site Reliability Engineer (SRE) who is passionate about leveraging data and automation. This role focuses on scaling infrastructure and enhancing customer experience. The ideal candidate should have expertise in Infrastructure as... 
    Data
    Remote job

    Pantera Capital

    San Francisco, CA
    1 day ago
  • $202.5k - $247.5k

     ...production traffic at scale. A few things you...  ...time. About the Infra Platform Team The...  ...abstractions, automation, reliability, and developer...  ...production load. SRE and DevOps problems...  ...security-focused systems Tech Stack ngrok runs...  ...members, market data, and specific work... 
    Data
    Permanent employment
    Full time
    Work at office
    Local area
    Remote work
    Home office
    Flexible hours

    ngrok Inc.

    San Francisco, CA
    3 days ago
  • $240k

     ...developers to create fast, reliable, and dynamic apps...  ...also create products that scale and remain simple over...  ...world, with exabytes of data, millions of...  ...looking for exceptional staff or principal-level engineers...  ...operating large-scale infra, we’d love to talk! This... 
    Data
    Full time
    Work at office
    Remote work
    Shift work
    Night shift

    Convex

    San Francisco, CA
    3 days ago
  •  ...We are looking for an infra god. You are the person...  ...Make sure our service can scale as we add thousands of...  ...keep our customers' data private and secure within...  ...system stays fast and reliable. Own everything related...  ...comfortable being the lead on technical... 
    Data

    Manufact, Inc.

    San Francisco, CA
    6 hours ago
  • $250k - $300k

     ...evolved into a full‑scale Work AI ecosystem,...  ...broadest range of data: enterprise and world...  ...50, and Gartner’s Tech Innovators in Agentic...  ...the Role The Tech Lead Manager of the...  ...builds the low‑latency, reliable, and secure...  ...observability, and ML infra integrations to deliver... 
    Data
    Home office
    Flexible hours

    aijoblist

    San Francisco, CA
    3 days ago
  • $281k - $356k

     ...experienced TLM to lead the Vehicle Understanding...  ...to our Senior Staff TLM of Semantics...  ...production area to scale Waymo's business...  ...large-scale 3rd party data, and partner teams...  ...Collaborate with Platform/Infra partners to...  ...performance, efficiency, and reliability. Develop the... 
    Data
    Full time
    Temporary work
    Immediate start
    Remote work

    Waymo

    San Francisco, CA
    6 hours ago
  •  ...the bar on safety, reliability, and velocity...  ...be at the heart of scaling and hardening the...  ...collaborating closely with infra, product, and...  ...experience, with 2+ years leading large scale,...  ...as an engineer or tech lead A passion for...  ...(including the data contained therein)... 
    Data

    Slope

    San Francisco, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff SRE & Tech Lead: Scale Reliability & Data Infra. Be the first to apply!