Member of Technical Staff - Distributed Systems
Gimlet Labs
About Us Gimlet Labs is building the first heterogeneous neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental limits in power, capacity, and cost with today's homogeneous, vertically integrated infrastructure. Gimlet addresses this by decoupling AI workloads from the underlying hardware. Our platform intelligently partitions workloads into components and orchestrates each component to hardware that best fits its performance and efficiency needs. This approach enables heterogeneous systems across multi-vendor and multi-generation hardware, including the latest emerging accelerators. These systems unlock step-function improvements in performance and cost efficiency at scale. On top of this foundation, Gimlet is building a production-grade neocloud for agentic workloads. Customers use Gimlet to deploy and manage their workloads through stable, production-ready APIs, without having to reason about hardware selection, placement, or low-level performance optimization. Gimlet works with foundation labs, hyperscalers, and AI native companies to power real production workloads built to scale to gigawatt-class AI datacenters.
Mission Gimlet Labs is seeking a Member of Technical Staff focused on distributed systems. In this role, you will build the core platform that schedules, routes, and operates AI workloads reliably at production scale. You will work on systems that coordinate execution across thousands of nodes, expose stable production APIs, and ensure workloads run predictably under real-world load and failure conditions. This role is well-suited for engineers who enjoy building foundational infrastructure, understanding systems end-to-end, and operating at scale.
Responsibilities
Mission Gimlet Labs is seeking a Member of Technical Staff focused on distributed systems. In this role, you will build the core platform that schedules, routes, and operates AI workloads reliably at production scale. You will work on systems that coordinate execution across thousands of nodes, expose stable production APIs, and ensure workloads run predictably under real-world load and failure conditions. This role is well-suited for engineers who enjoy building foundational infrastructure, understanding systems end-to-end, and operating at scale.
Responsibilities
- Design and build distributed systems that orchestrate and operate AI workloads at large scale
- Develop scheduling, routing, and resource management components that coordinate execution across many nodes and services
- Build production-grade APIs and control planes for deploying and managing workloads
- Implement mechanisms for reliability, availability, and fault tolerance in distributed environments
- Instrument systems for observability and debugging at scale
- Work closely with compilers, runtimes, and hardware to ensure end-to-end system correctness and performance
- Strong software engineering fundamentals
- Experience building or operating distributed systems in production environments
- Comfort reasoning about concurrency, failure modes, and tradeoffs in large-scale systems
- Experience with Kubernetes or Kubernetes-adjacent systems beyond basic usage
- Experience designing service-oriented architectures using RPC or asynchronous messaging
- Familiarity with scheduling, queues, or resource management systems
- Experience building reliable APIs and operating systems under high load
- Software development experience in languages commonly used for systems development (e.g., Go, C++, Python)
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Member of Technical Staff - Distributed Systems in San Francisco, CA vacancy
$150k - $350k
Mission Gimlet Labs is seeking a Member of Technical Staff focused on distributed systems. In this role, you will build the core platform that schedules, routes, and operates AI workloads reliably at production scale. You will work on systems that coordinate execution...Suggested- ...immensely competitive market. Build the systems that make AI inference fast, reliable, and... ...notices. What we’re looking for Strong distributed systems fundamentals (concurrency,... ...about your experience, and share as much technical detail about Sail as you want to hear. Come...SuggestedWork at officeImmediate start
- ...power our research, training, and production environments. These systems form the foundational platform that multiple teams depend on... ..., reproducible environments, multi-tenant isolation. Distributed Systems Architecture: Sharding, replication, coordination services...SuggestedRelocation package
- ...neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental... ...Gimlet Labs is seeking a Member of Technical Staff focused on ML systems and inference.... ...compilers, kernels, networking, and distributed systems to deliver end-to-end performance...Suggested
$225k
...inference-time compute to achieve this goal. About the Role As a Software Engineer on the Pre-training Systems team, you will design and operate the distributed infrastructure that trains Magic's long-context models at scale. This role focuses on large-scale model...SuggestedRelocationVisa sponsorship- Member of Technical Staff, ML Systems Mirendil Mirendil is a tech-first company focused on solving core bottlenecks that unlock step-change acceleration across science and technology. Our first goal is to democratize frontier AI R&D across scientific disciplines. We believe...
$350k
Software Engineer ML Infra - Distributed Systems Series A AI Infrastructure Startup | Neocloud Platform | On-site (San Francisco) We’re hiring a Member of Technical Staff - Distributed Systems to join a next-generation AI infrastructure company building the first heterogeneous...- Pixeltable Inc. Member of Technical Staff San Francisco, CA·Full time Apply for Member of Technical Staff As a founding... ...science, or a related field 5+ years of systems engineering experience in an industry setting: distributed data systems, cloud infrastructure, or...Full timePart timeWork at officeWork from homeFlexible hours2 days per week
$150k - $220k
# Founding Member of Technical Staff (MTS)Bay Area, CAFull-time$150k-$220k + equity## About UsVizopsAI... ...(MTS), you'll build production-grade systems that power continuous optimization... ...measurable improvements in production* •Scale distributed workloads for training/serving....$170k - $220k
Member of Technical Staff - Infrastructure & LLMs Location: San Francisco, CA (Hybrid) Compensation... ...development of performance-critical systems from day one, working directly on problems... ...-GPU inference workloads Designing distributed job schedulers Experimenting with...Full timeTemporary workImmediate startVisa sponsorshipWork visa$150k - $300k
...inference optimization and RL systems. You will be working on... ...our RL training stack. Core Technical Responsibilities LLM Serving... ...throughput/latency SLOs. Model Distribution: Optimize model distribution... ...development and encourage team members to contribute to the broader...Work at officeRemote workVisa sponsorshipRelocation packageFlexible hoursShift work- Job Description We’re looking for a Member of Technical Staff to build and deploy production-grade AI systems. In this role, you’ll work across modeling, systems, and... ...workflow automation systems Experience with distributed systems or large-scale infrastructure Background...
$300k
Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing... ...open-ended learning. We are building systems to exceed humans in all capacities by... ...the systems layer for RL at scale: distributed rollouts, training orchestration, inference...Work at officeLocal area$300k
Member of Technical Staff - Mechanistic Interpretability About V max V max is an applied research lab... ...open-ended learning. We are building systems to exceed humans in all capacities by... ...with scalable RL infrastructure, distributed training, experiment tracking, and large...Work at officeLocal area$300k
Member of Technical Staff - RL Algorithms About V max V max is an applied research lab developing AI... ...open-ended learning. We are building systems to exceed humans in all capacities by... ...optimization, exploration failures, and distribution shift. Collaborate with researchers...Work at officeLocal areaShift work- ...This Role We're looking for an engineer with deep Rust expertise and strong algorithmic fundamentals to work on performance‑critical systems. You'll build the low‑level infrastructure that powers real‑time robotic perception, planning, and control. Core Responsibilities...
- ...attention, and an agentic operating system can lift that ceiling by an order of... ...precedents to copy from. About the Role Members of Technical Staff (MTS) are the senior engineers who... ...defend it without raising your voice. Distributed systems intuition. You have lived...
- ...Numerics is an AI lab bringing the rigor of distributed systems, model architecture, and numerics... ...focused on building and deploying the technical systems that make biosecurity real. About the Role As a Member of Technical Staff, Biosecurity at Radical Numerics, you will...Full time
- ...Member Of Technical Staff Atomic is the leading venture studio for company creation, partnering... ...Miami, NYC, and San Francisco, and a distributed team across North America. Our latest... ...contributor building high-impact systems from 0→1. You'll work closely with the...Local area
$150k - $280k
...Member of Technical Staff (Backend) San Francisco, CA Compensation: $150,000 – $280,000 + Competitive... ...and selling successful AI and ML systems. Key company highlights: - Has... ...The role covers backend engineering, distributed systems, ML pipelines, and agent...Full timeTemporary workH1bWork at officeVisa sponsorshipRelocation package- ...Member Of Technical Staff, Platform Engineer You'll design, build, and own distributed systems and core platform infrastructure end-to-end across the stack - from user-facing product surfaces and real-time interactions to evaluation pipelines, model orchestration, and...
$180k
...Member Of Technical Staff - Pre-Training Palo Alto, CA About XAI XAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit... ...of scaling laws. Familiar with distributed training, multi-GPU neural network...Temporary work- ...neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental... ...Gimlet Labs is seeking a Member of Technical Staff focused on kernels and GPU performance... ...performance analysis tools Familiarity with multi-GPU or distributed execution is a plus...
- ...Member Of Technical Staff, Training Infra Bay Area Ai Systems Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world... ...Responsibilities Design, implement, and optimize distributed training systems that scale across thousands of...Immediate startFlexible hours
$256k - $276k
...the bigger picture and our vision at Postman. The Opportunity As a Member of Technical Staff on AI Infrastructure, you will build and maintain the foundational systems and distributed infrastructure that power AI model post training, inference, and data pipelines...Work at officeFlexible hours3 days per week$200k - $300k
...Member of Technical Staff (Platform) Title of Role: Member of Technical Staff (Platform)... ...a focus on building robust AI-driven systems, this organization is poised for significant... ...contributing to the architecture of distributed systems. Deliver features quickly...Work at office- ...heterogeneous neocloud for AI workloads. As AI systems scale, the industry is hitting... ...Mission Gimlet Labs is seeking an Member of Staff focused on AI Research (Intern). As an... ...of study Experience with AI/ML or distributed systems. Preferred Qualifications...Internship
- ...pioneering biologists, Phylo is building the next generation of AI systems for the life sciences. About the role We’re looking for an... ...AI platform in production. You’ll design and implement the distributed systems, compute environments, and service architecture that allow...
- ...Member of Technical Staff, Model Efficiency Who are we? Our mission is to scale intelligence to serve... ...and enterprises who are building AI systems to power magical experiences like content... ...) Scaling performance‑critical distributed systems (e.g., computation, search, storage...Full timeWork at officeRemote workFlexible hours
- Member of Technical Staff, Post-Training Location: SF Bay Area or Tokyo, Japan Type: Full-time About Radical Numerics Radical Numerics is an AI lab bringing the rigor of distributed systems, model architecture, and numerics research to the challenges of biology. We are...Full time
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Member of Technical Staff - Distributed Systems. Be the first to apply!
Related searches
- technical support assistant San Francisco, CA
- technical analyst San Francisco, CA
- end user support technician San Francisco, CA
- IT assistant San Francisco, CA
- help desk assistant San Francisco, CA
- IT support technician San Francisco, CA
- operations support technician San Francisco, CA
- desktop support analyst San Francisco, CA
- support analyst San Francisco, CA
- technical associate San Francisco, CA

