Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff - Distributed Systems

Gimlet Labs

About Us

Gimlet Labs is building the first heterogeneous neocloud for AI workloads.

As AI systems scale, the industry is hitting fundamental limits in power, capacity, and cost with today's homogeneous, vertically integrated infrastructure. Gimlet addresses this by decoupling AI workloads from the underlying hardware. Our platform intelligently partitions workloads into components and orchestrates each component to hardware that best fits its performance and efficiency needs. This approach enables heterogeneous systems across multi-vendor and multi-generation hardware, including the latest emerging accelerators. These systems unlock step-function improvements in performance and cost efficiency at scale.

On top of this foundation, Gimlet is building a production-grade neocloud for agentic workloads. Customers use Gimlet to deploy and manage their workloads through stable, production-ready APIs, without having to reason about hardware selection, placement, or low-level performance optimization.

Gimlet works with foundation labs, hyperscalers, and AI native companies to power real production workloads built to scale to gigawatt-class AI datacenters.
Mission

Gimlet Labs is seeking a Member of Technical Staff focused on distributed systems. In this role, you will build the core platform that schedules, routes, and operates AI workloads reliably at production scale. You will work on systems that coordinate execution across thousands of nodes, expose stable production APIs, and ensure workloads run predictably under real-world load and failure conditions.

This role is well-suited for engineers who enjoy building foundational infrastructure, understanding systems end-to-end, and operating at scale.
Responsibilities
  • Design and build distributed systems that orchestrate and operate AI workloads at large scale
  • Develop scheduling, routing, and resource management components that coordinate execution across many nodes and services
  • Build production-grade APIs and control planes for deploying and managing workloads
  • Implement mechanisms for reliability, availability, and fault tolerance in distributed environments
  • Instrument systems for observability and debugging at scale
  • Work closely with compilers, runtimes, and hardware to ensure end-to-end system correctness and performance
Qualifications
  • Strong software engineering fundamentals
  • Experience building or operating distributed systems in production environments
  • Comfort reasoning about concurrency, failure modes, and tradeoffs in large-scale systems
Preferred Qualifications
  • Experience with Kubernetes or Kubernetes-adjacent systems beyond basic usage
  • Experience designing service-oriented architectures using RPC or asynchronous messaging
  • Familiarity with scheduling, queues, or resource management systems
  • Experience building reliable APIs and operating systems under high load
  • Software development experience in languages commonly used for systems development (e.g., Go, C++, Python)
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff - Distributed Systems in San Francisco, CA vacancy
  • $150k - $350k

    Mission Gimlet Labs is seeking a Member of Technical Staff focused on distributed systems. In this role, you will build the core platform that schedules, routes, and operates AI workloads reliably at production scale. You will work on systems that coordinate execution... 
    Suggested

    Gimlet Labs, Inc.

    San Francisco, CA
    2 days ago
  •  ...immensely competitive market. Build the systems that make AI inference fast, reliable, and...  ...notices. What we’re looking for Strong distributed systems fundamentals (concurrency,...  ...about your experience, and share as much technical detail about Sail as you want to hear. Come... 
    Suggested
    Work at office
    Immediate start

    Sail Research

    San Francisco, CA
    4 days ago
  •  ...power our research, training, and production environments. These systems form the foundational platform that multiple teams depend on...  ..., reproducible environments, multi-tenant isolation. Distributed Systems Architecture: Sharding, replication, coordination services... 
    Suggested
    Relocation package

    Reflection AI

    San Francisco, CA
    1 day ago
  •  ...neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental...  ...Gimlet Labs is seeking a Member of Technical Staff focused on ML systems and inference....  ...compilers, kernels, networking, and distributed systems to deliver end-to-end performance... 
    Suggested

    Gimlet Labs

    San Francisco, CA
    2 days ago
  • $225k

     ...inference-time compute to achieve this goal. About the Role As a Software Engineer on the Pre-training Systems team, you will design and operate the distributed infrastructure that trains Magic's long-context models at scale. This role focuses on large-scale model... 
    Suggested
    Relocation
    Visa sponsorship

    Magic Inc

    San Francisco, CA
    1 day ago
  • Member of Technical Staff, ML Systems Mirendil Mirendil is a tech-first company focused on solving core bottlenecks that unlock step-change acceleration across science and technology. Our first goal is to democratize frontier AI R&D across scientific disciplines. We believe... 

    Mirendil

    San Francisco, CA
    1 day ago
  • $350k

    Software Engineer ML Infra - Distributed Systems Series A AI Infrastructure Startup | Neocloud Platform | On-site (San Francisco) We’re hiring a Member of Technical Staff - Distributed Systems to join a next-generation AI infrastructure company building the first heterogeneous... 

    Acceler8 Talent

    San Francisco, CA
    3 days ago
  • Pixeltable Inc. Member of Technical Staff San Francisco, CA·Full time Apply for Member of Technical Staff As a founding...  ...science, or a related field 5+ years of systems engineering experience in an industry setting: distributed data systems, cloud infrastructure, or... 
    Full time
    Part time
    Work at office
    Work from home
    Flexible hours
    2 days per week

    Pixeltable, Inc.

    San Francisco, CA
    2 days ago
  • $150k - $220k

    # Founding Member of Technical Staff (MTS)Bay Area, CAFull-time$150k-$220k + equity## About UsVizopsAI...  ...(MTS), you'll build production-grade systems that power continuous optimization...  ...measurable improvements in production* •Scale distributed workloads for training/serving.... 

    VizopsAI

    San Francisco, CA
    2 days ago
  • $170k - $220k

    Member of Technical Staff - Infrastructure & LLMs Location: San Francisco, CA (Hybrid) Compensation...  ...development of performance-critical systems from day one, working directly on problems...  ...-GPU inference workloads Designing distributed job schedulers Experimenting with... 
    Full time
    Temporary work
    Immediate start
    Visa sponsorship
    Work visa

    Amadeus Search

    San Francisco, CA
    19 hours ago
  • $150k - $300k

     ...inference optimization and RL systems. You will be working on...  ...our RL training stack. Core Technical Responsibilities LLM Serving...  ...throughput/latency SLOs. Model Distribution: Optimize model distribution...  ...development and encourage team members to contribute to the broader... 
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours
    Shift work

    Prime-Intellect

    San Francisco, CA
    3 days ago
  • Job Description We’re looking for a Member of Technical Staff to build and deploy production-grade AI systems. In this role, you’ll work across modeling, systems, and...  ...workflow automation systems Experience with distributed systems or large-scale infrastructure Background... 

    ERAGON

    San Francisco, CA
    3 days ago
  • $300k

    Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing...  ...open-ended learning. We are building systems to exceed humans in all capacities by...  ...the systems layer for RL at scale: distributed rollouts, training orchestration, inference... 
    Work at office
    Local area

    Vmax

    San Francisco, CA
    19 hours ago
  • $300k

    Member of Technical Staff - Mechanistic Interpretability About V max V max is an applied research lab...  ...open-ended learning. We are building systems to exceed humans in all capacities by...  ...with scalable RL infrastructure, distributed training, experiment tracking, and large... 
    Work at office
    Local area

    Vmax

    San Francisco, CA
    3 days ago
  • $300k

    Member of Technical Staff - RL Algorithms About V max V max is an applied research lab developing AI...  ...open-ended learning. We are building systems to exceed humans in all capacities by...  ...optimization, exploration failures, and distribution shift. Collaborate with researchers... 
    Work at office
    Local area
    Shift work

    Vmax

    San Francisco, CA
    19 hours ago
  •  ...This Role We're looking for an engineer with deep Rust expertise and strong algorithmic fundamentals to work on performance‑critical systems. You'll build the low‑level infrastructure that powers real‑time robotic perception, planning, and control. Core Responsibilities... 

    Dimensional Inc.

    San Francisco, CA
    2 days ago
  •  ...attention, and an agentic operating system can lift that ceiling by an order of...  ...precedents to copy from. About the Role Members of Technical Staff (MTS) are the senior engineers who...  ...defend it without raising your voice. Distributed systems intuition. You have lived... 

    Beacon Software

    San Francisco, CA
    19 hours ago
  •  ...Numerics is an AI lab bringing the rigor of distributed systems, model architecture, and numerics...  ...focused on building and deploying the technical systems that make biosecurity real. About the Role As a Member of Technical Staff, Biosecurity at Radical Numerics, you will... 
    Full time

    Radical Numerics

    San Francisco, CA
    2 days ago
  •  ...Member Of Technical Staff Atomic is the leading venture studio for company creation, partnering...  ...Miami, NYC, and San Francisco, and a distributed team across North America. Our latest...  ...contributor building high-impact systems from 0→1. You'll work closely with the... 
    Local area

    Atomic VC

    San Francisco, CA
    4 days ago
  • $150k - $280k

     ...Member of Technical Staff (Backend) San Francisco, CA Compensation: $150,000 – $280,000 + Competitive...  ...and selling successful AI and ML systems. Key company highlights: - Has...  ...The role covers backend engineering, distributed systems, ML pipelines, and agent... 
    Full time
    Temporary work
    H1b
    Work at office
    Visa sponsorship
    Relocation package

    Fuku

    San Francisco, CA
    3 days ago
  •  ...Member Of Technical Staff, Platform Engineer You'll design, build, and own distributed systems and core platform infrastructure end-to-end across the stack - from user-facing product surfaces and real-time interactions to evaluation pipelines, model orchestration, and... 

    Arcada Labs Incorporated

    San Francisco, CA
    3 days ago
  • $180k

     ...Member Of Technical Staff - Pre-Training Palo Alto, CA About XAI XAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit...  ...of scaling laws. Familiar with distributed training, multi-GPU neural network... 
    Temporary work

    Xai

    San Francisco, CA
    19 hours ago
  •  ...neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental...  ...Gimlet Labs is seeking a Member of Technical Staff focused on kernels and GPU performance...  ...performance analysis tools Familiarity with multi-GPU or distributed execution is a plus... 

    Gimlet Labs

    San Francisco, CA
    2 days ago
  •  ...Member Of Technical Staff, Training Infra Bay Area Ai Systems Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world...  ...Responsibilities Design, implement, and optimize distributed training systems that scale across thousands of... 
    Immediate start
    Flexible hours

    Inception LLC

    San Francisco, CA
    22 hours ago
  • $256k - $276k

     ...the bigger picture and our vision at Postman. The Opportunity As a Member of Technical Staff on AI Infrastructure, you will build and maintain the foundational systems and distributed infrastructure that power AI model post training, inference, and data pipelines... 
    Work at office
    Flexible hours
    3 days per week

    Postman

    San Francisco, CA
    2 days ago
  • $200k - $300k

     ...Member of Technical Staff (Platform) Title of Role: Member of Technical Staff (Platform)...  ...a focus on building robust AI-driven systems, this organization is poised for significant...  ...contributing to the architecture of distributed systems. Deliver features quickly... 
    Work at office

    Recruiting from Scratch

    San Francisco, CA
    1 day ago
  •  ...heterogeneous neocloud for AI workloads. As AI systems scale, the industry is hitting...  ...Mission Gimlet Labs is seeking an Member of Staff focused on AI Research (Intern). As an...  ...of study Experience with AI/ML or distributed systems. Preferred Qualifications... 
    Internship

    Gimlet Labs

    San Francisco, CA
    4 days ago
  •  ...pioneering biologists, Phylo is building the next generation of AI systems for the life sciences. About the role We’re looking for an...  ...AI platform in production. You’ll design and implement the distributed systems, compute environments, and service architecture that allow... 

    deCircle

    San Francisco, CA
    3 days ago
  •  ...Member of Technical Staff, Model Efficiency Who are we? Our mission is to scale intelligence to serve...  ...and enterprises who are building AI systems to power magical experiences like content...  ...) Scaling performance‑critical distributed systems (e.g., computation, search, storage... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    San Francisco, CA
    4 days ago
  • Member of Technical Staff, Post-Training Location: SF Bay Area or Tokyo, Japan Type: Full-time About Radical Numerics Radical Numerics is an AI lab bringing the rigor of distributed systems, model architecture, and numerics research to the challenges of biology. We are... 
    Full time

    Radical Numerics

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff - Distributed Systems. Be the first to apply!