Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Member of Technical Staff - Distributed Systems

Sail Research

Sail is the foundation of useful, agentic AI. We are here to take a big swing at the most ambitious engineering challenge of our careers. Everyone working at Sail will become an expert; nothing less will do in our immensely competitive market. Build the systems that make AI inference fast, reliable, and cost‑efficient at global scale. You’ll design the control plane that schedules a huge queue of tokens over a diverse fleet of machines, spread all over the world. What you’ll do Design and implement high‑performance schedulers (admission control, queuing, priority, fairness, preemption, bin packing). Build global routing and traffic management (latency‑aware dispatch, predictive autoscaling, failover strategies). LLM‑specific routing optimizations, e.g., KV caching that lets us trade memory for compute, across the pyramid of GPU RAM, CPU RAM, and NVMe flash. Build deep observability: we want to trace every millisecond of our systems, and catch failures early enough that we can make things right before the customer even notices. What we’re looking for Strong distributed systems fundamentals (concurrency, networking, databases, performance engineering). Eagerness to work with agents. Distributed systems are not easy to one‑shot; you’ll always have to think carefully about testing correctness and edge cases. Writing extremely clear plans and tests is a must. Bonus: experience with ML inference stacks (vLLM/SGLang), GPUs/accelerators. Interview process Meet the CEO. This is the first step because we respect your time. Ask any question and get a definitive answer immediately. Meet the CTO, who will ask about your experience, and share as much technical detail about Sail as you want to hear. Come in to Sail's SF office for an interview day. Meet the whole team, then you'll have 3‑4 hours to work on a problem that closely simulates the work we do daily. It's an objectively scored task, so you'll have immediate feedback on how well your code is working – just like we do in production! AI assistance is highly encouraged, and we'll provide a laptop with all the best tools set up. Finish with a short presentation describing your process, learnings, and results. Offer. Once the team decides we want to work with you, we make a strong offer quickly and will be quite persistent over email/text/calls :) #J-18808-Ljbffr Sail Research

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff - Distributed Systems in San Francisco, CA vacancy
  •  ...first heterogeneous neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental limits in...  .... Mission Gimlet Labs is seeking a Member of Technical Staff focused on distributed systems. In this role, you will build the core platform... 
    Suggested

    Gimlet Labs

    San Francisco, CA
    2 days ago
  • $150k - $350k

    Mission Gimlet Labs is seeking a Member of Technical Staff focused on distributed systems. In this role, you will build the core platform that schedules, routes, and operates AI workloads reliably at production scale. You will work on systems that coordinate execution... 
    Suggested

    Gimlet Labs, Inc.

    San Francisco, CA
    2 days ago
  •  ...power our research, training, and production environments. These systems form the foundational platform that multiple teams depend on...  ..., reproducible environments, multi-tenant isolation. Distributed Systems Architecture: Sharding, replication, coordination services... 
    Suggested
    Relocation package

    Reflection AI

    San Francisco, CA
    1 day ago
  •  ...neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental...  ...Gimlet Labs is seeking a Member of Technical Staff focused on ML systems and inference....  ...compilers, kernels, networking, and distributed systems to deliver end-to-end performance... 
    Suggested

    Gimlet Labs

    San Francisco, CA
    2 days ago
  • $225k

     ...inference-time compute to achieve this goal. About the Role As a Software Engineer on the Pre-training Systems team, you will design and operate the distributed infrastructure that trains Magic's long-context models at scale. This role focuses on large-scale model... 
    Suggested
    Relocation
    Visa sponsorship

    Magic Inc

    San Francisco, CA
    1 day ago
  • Member of Technical Staff, ML Systems Mirendil Mirendil is a tech-first company focused on solving core bottlenecks that unlock step-change acceleration across science and technology. Our first goal is to democratize frontier AI R&D across scientific disciplines. We believe... 

    Mirendil

    San Francisco, CA
    1 day ago
  • $150k - $300k

     ...inference optimization and RL systems. You will be working on...  ...our RL training stack. Core Technical Responsibilities LLM Serving...  ...throughput/latency SLOs. Model Distribution: Optimize model distribution...  ...development and encourage team members to contribute to the broader... 
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours
    Shift work

    Prime-Intellect

    San Francisco, CA
    3 days ago
  • $150k - $220k

    # Founding Member of Technical Staff (MTS)Bay Area, CAFull-time$150k-$220k + equity## About UsVizopsAI...  ...(MTS), you'll build production-grade systems that power continuous optimization...  ...measurable improvements in production* •Scale distributed workloads for training/serving.... 

    VizopsAI

    San Francisco, CA
    2 days ago
  • Pixeltable Inc. Member of Technical Staff San Francisco, CA·Full time Apply for Member of Technical Staff As a founding...  ...science, or a related field 5+ years of systems engineering experience in an industry setting: distributed data systems, cloud infrastructure, or... 
    Full time
    Part time
    Work at office
    Work from home
    Flexible hours
    2 days per week

    Pixeltable, Inc.

    San Francisco, CA
    2 days ago
  • $170k - $220k

    Member of Technical Staff - Infrastructure & LLMs Location: San Francisco, CA (Hybrid) Compensation...  ...development of performance-critical systems from day one, working directly on problems...  ...-GPU inference workloads Designing distributed job schedulers Experimenting with... 
    Full time
    Temporary work
    Immediate start
    Visa sponsorship
    Work visa

    Amadeus Search

    San Francisco, CA
    20 hours ago
  • $350k

    Software Engineer ML Infra - Distributed Systems Series A AI Infrastructure Startup | Neocloud Platform | On-site (San Francisco) We’re hiring a Member of Technical Staff - Distributed Systems to join a next-generation AI infrastructure company building the first heterogeneous... 

    Acceler8 Talent

    San Francisco, CA
    3 days ago
  • Job Description We’re looking for a Member of Technical Staff to build and deploy production-grade AI systems. In this role, you’ll work across modeling, systems, and...  ...workflow automation systems Experience with distributed systems or large-scale infrastructure Background... 

    ERAGON

    San Francisco, CA
    3 days ago
  • $300k

    Member of Technical Staff - Mechanistic Interpretability About V max V max is an applied research lab...  ...open-ended learning. We are building systems to exceed humans in all capacities by...  ...with scalable RL infrastructure, distributed training, experiment tracking, and large... 
    Work at office
    Local area

    Vmax

    San Francisco, CA
    3 days ago
  • $300k

    Member of Technical Staff - RL Algorithms About V max V max is an applied research lab developing AI...  ...open-ended learning. We are building systems to exceed humans in all capacities by...  ...optimization, exploration failures, and distribution shift. Collaborate with researchers... 
    Work at office
    Local area
    Shift work

    Vmax

    San Francisco, CA
    20 hours ago
  • $300k

    Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing...  ...open-ended learning. We are building systems to exceed humans in all capacities by...  ...the systems layer for RL at scale: distributed rollouts, training orchestration, inference... 
    Work at office
    Local area

    Vmax

    San Francisco, CA
    20 hours ago
  •  ...attention, and an agentic operating system can lift that ceiling by an order of...  ...precedents to copy from. About the Role Members of Technical Staff (MTS) are the senior engineers who...  ...defend it without raising your voice. Distributed systems intuition. You have lived... 

    Beacon Software

    San Francisco, CA
    20 hours ago
  •  ...Numerics is an AI lab bringing the rigor of distributed systems, model architecture, and numerics...  ...focused on building and deploying the technical systems that make biosecurity real. About the Role As a Member of Technical Staff, Biosecurity at Radical Numerics, you will... 
    Full time

    Radical Numerics

    San Francisco, CA
    2 days ago
  •  ...This Role We're looking for an engineer with deep Rust expertise and strong algorithmic fundamentals to work on performance‑critical systems. You'll build the low‑level infrastructure that powers real‑time robotic perception, planning, and control. Core Responsibilities... 

    Dimensional Inc.

    San Francisco, CA
    2 days ago
  • $150k - $280k

     ...Member of Technical Staff (Backend) San Francisco, CA Compensation: $150,000 – $280,000 + Competitive...  ...and selling successful AI and ML systems. Key company highlights: - Has...  ...The role covers backend engineering, distributed systems, ML pipelines, and agent... 
    Full time
    Temporary work
    H1b
    Work at office
    Visa sponsorship
    Relocation package

    Fuku

    San Francisco, CA
    3 days ago
  •  ...Member Of Technical Staff Atomic is the leading venture studio for company creation, partnering...  ...Miami, NYC, and San Francisco, and a distributed team across North America. Our latest...  ...contributor building high-impact systems from 0→1. You'll work closely with the... 
    Local area

    Atomic VC

    San Francisco, CA
    4 days ago
  •  ...Member Of Technical Staff, Platform Engineer You'll design, build, and own distributed systems and core platform infrastructure end-to-end across the stack - from user-facing product surfaces and real-time interactions to evaluation pipelines, model orchestration, and... 

    Arcada Labs Incorporated

    San Francisco, CA
    3 days ago
  •  ...neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental...  ...Gimlet Labs is seeking a Member of Technical Staff focused on kernels and GPU performance...  ...performance analysis tools Familiarity with multi-GPU or distributed execution is a plus... 

    Gimlet Labs

    San Francisco, CA
    2 days ago
  • $200k - $300k

     ...Member of Technical Staff (Platform) Title of Role: Member of Technical Staff (Platform)...  ...a focus on building robust AI-driven systems, this organization is poised for significant...  ...contributing to the architecture of distributed systems. Deliver features quickly... 
    Work at office

    Recruiting from Scratch

    San Francisco, CA
    1 day ago
  •  ...heterogeneous neocloud for AI workloads. As AI systems scale, the industry is hitting...  ...Mission Gimlet Labs is seeking an Member of Staff focused on AI Research (Intern). As an...  ...of study Experience with AI/ML or distributed systems. Preferred Qualifications... 
    Internship

    Gimlet Labs

    San Francisco, CA
    4 days ago
  •  ...Member Of Technical Staff, Training Infra Bay Area Ai Systems Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world...  ...Responsibilities Design, implement, and optimize distributed training systems that scale across thousands of... 
    Immediate start
    Flexible hours

    Inception LLC

    San Francisco, CA
    23 hours ago
  • $256k - $276k

     ...the bigger picture and our vision at Postman. The Opportunity As a Member of Technical Staff on AI Infrastructure, you will build and maintain the foundational systems and distributed infrastructure that power AI model post training, inference, and data pipelines... 
    Work at office
    Flexible hours
    3 days per week

    Postman

    San Francisco, CA
    2 days ago
  •  ...pioneering biologists, Phylo is building the next generation of AI systems for the life sciences. About the role We’re looking for an...  ...AI platform in production. You’ll design and implement the distributed systems, compute environments, and service architecture that allow... 

    deCircle

    San Francisco, CA
    3 days ago
  •  ...Member of Technical Staff, Model Efficiency Who are we? Our mission is to scale intelligence to serve...  ...and enterprises who are building AI systems to power magical experiences like content...  ...) Scaling performance‑critical distributed systems (e.g., computation, search, storage... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    San Francisco, CA
    4 days ago
  •  ...and enterprises who are building AI systems to power magical experiences like content...  ...matter, and join the team. As a Member of Technical Staff with a focus on Multimodal AI, you will...  ...capabilities. Knowledge of distributed training strategies, especially for large... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    San Francisco, CA
    3 days ago
  • Member of Technical Staff, Post-Training Location: SF Bay Area or Tokyo, Japan Type: Full-time About Radical Numerics Radical Numerics is an AI lab bringing the rigor of distributed systems, model architecture, and numerics research to the challenges of biology. We are... 
    Full time

    Radical Numerics

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Member of Technical Staff - Distributed Systems. Be the first to apply!