Member of Technical Staff - Distributed Systems
Sail Research
Sail is the foundation of useful, agentic AI. We are here to take a big swing at the most ambitious engineering challenge of our careers. Everyone working at Sail will become an expert; nothing less will do in our immensely competitive market. Build the systems that make AI inference fast, reliable, and cost‑efficient at global scale. You’ll design the control plane that schedules a huge queue of tokens over a diverse fleet of machines, spread all over the world. What you’ll do Design and implement high‑performance schedulers (admission control, queuing, priority, fairness, preemption, bin packing). Build global routing and traffic management (latency‑aware dispatch, predictive autoscaling, failover strategies). LLM‑specific routing optimizations, e.g., KV caching that lets us trade memory for compute, across the pyramid of GPU RAM, CPU RAM, and NVMe flash. Build deep observability: we want to trace every millisecond of our systems, and catch failures early enough that we can make things right before the customer even notices. What we’re looking for Strong distributed systems fundamentals (concurrency, networking, databases, performance engineering). Eagerness to work with agents. Distributed systems are not easy to one‑shot; you’ll always have to think carefully about testing correctness and edge cases. Writing extremely clear plans and tests is a must. Bonus: experience with ML inference stacks (vLLM/SGLang), GPUs/accelerators. Interview process Meet the CEO. This is the first step because we respect your time. Ask any question and get a definitive answer immediately. Meet the CTO, who will ask about your experience, and share as much technical detail about Sail as you want to hear. Come in to Sail's SF office for an interview day. Meet the whole team, then you'll have 3‑4 hours to work on a problem that closely simulates the work we do daily. It's an objectively scored task, so you'll have immediate feedback on how well your code is working – just like we do in production! AI assistance is highly encouraged, and we'll provide a laptop with all the best tools set up. Finish with a short presentation describing your process, learnings, and results. Offer. Once the team decides we want to work with you, we make a strong offer quickly and will be quite persistent over email/text/calls :) #J-18808-Ljbffr Sail Research
- ...first heterogeneous neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental limits in... .... Mission Gimlet Labs is seeking a Member of Technical Staff focused on distributed systems. In this role, you will build the core platform...Suggested
$150k - $350k
Mission Gimlet Labs is seeking a Member of Technical Staff focused on distributed systems. In this role, you will build the core platform that schedules, routes, and operates AI workloads reliably at production scale. You will work on systems that coordinate execution...Suggested- ...power our research, training, and production environments. These systems form the foundational platform that multiple teams depend on... ..., reproducible environments, multi-tenant isolation. Distributed Systems Architecture: Sharding, replication, coordination services...SuggestedRelocation package
- ...neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental... ...Gimlet Labs is seeking a Member of Technical Staff focused on ML systems and inference.... ...compilers, kernels, networking, and distributed systems to deliver end-to-end performance...Suggested
$225k
...inference-time compute to achieve this goal. About the Role As a Software Engineer on the Pre-training Systems team, you will design and operate the distributed infrastructure that trains Magic's long-context models at scale. This role focuses on large-scale model...SuggestedRelocationVisa sponsorship- Member of Technical Staff, ML Systems Mirendil Mirendil is a tech-first company focused on solving core bottlenecks that unlock step-change acceleration across science and technology. Our first goal is to democratize frontier AI R&D across scientific disciplines. We believe...
$150k - $300k
...inference optimization and RL systems. You will be working on... ...our RL training stack. Core Technical Responsibilities LLM Serving... ...throughput/latency SLOs. Model Distribution: Optimize model distribution... ...development and encourage team members to contribute to the broader...Work at officeRemote workVisa sponsorshipRelocation packageFlexible hoursShift work$150k - $220k
# Founding Member of Technical Staff (MTS)Bay Area, CAFull-time$150k-$220k + equity## About UsVizopsAI... ...(MTS), you'll build production-grade systems that power continuous optimization... ...measurable improvements in production* •Scale distributed workloads for training/serving....- Pixeltable Inc. Member of Technical Staff San Francisco, CA·Full time Apply for Member of Technical Staff As a founding... ...science, or a related field 5+ years of systems engineering experience in an industry setting: distributed data systems, cloud infrastructure, or...Full timePart timeWork at officeWork from homeFlexible hours2 days per week
$170k - $220k
Member of Technical Staff - Infrastructure & LLMs Location: San Francisco, CA (Hybrid) Compensation... ...development of performance-critical systems from day one, working directly on problems... ...-GPU inference workloads Designing distributed job schedulers Experimenting with...Full timeTemporary workImmediate startVisa sponsorshipWork visa$350k
Software Engineer ML Infra - Distributed Systems Series A AI Infrastructure Startup | Neocloud Platform | On-site (San Francisco) We’re hiring a Member of Technical Staff - Distributed Systems to join a next-generation AI infrastructure company building the first heterogeneous...- Job Description We’re looking for a Member of Technical Staff to build and deploy production-grade AI systems. In this role, you’ll work across modeling, systems, and... ...workflow automation systems Experience with distributed systems or large-scale infrastructure Background...
$300k
Member of Technical Staff - Mechanistic Interpretability About V max V max is an applied research lab... ...open-ended learning. We are building systems to exceed humans in all capacities by... ...with scalable RL infrastructure, distributed training, experiment tracking, and large...Work at officeLocal area$300k
Member of Technical Staff - RL Algorithms About V max V max is an applied research lab developing AI... ...open-ended learning. We are building systems to exceed humans in all capacities by... ...optimization, exploration failures, and distribution shift. Collaborate with researchers...Work at officeLocal areaShift work$300k
Member of Technical Staff - RL Infrastructure About V max V max is an applied research lab developing... ...open-ended learning. We are building systems to exceed humans in all capacities by... ...the systems layer for RL at scale: distributed rollouts, training orchestration, inference...Work at officeLocal area- ...attention, and an agentic operating system can lift that ceiling by an order of... ...precedents to copy from. About the Role Members of Technical Staff (MTS) are the senior engineers who... ...defend it without raising your voice. Distributed systems intuition. You have lived...
- ...Numerics is an AI lab bringing the rigor of distributed systems, model architecture, and numerics... ...focused on building and deploying the technical systems that make biosecurity real. About the Role As a Member of Technical Staff, Biosecurity at Radical Numerics, you will...Full time
- ...This Role We're looking for an engineer with deep Rust expertise and strong algorithmic fundamentals to work on performance‑critical systems. You'll build the low‑level infrastructure that powers real‑time robotic perception, planning, and control. Core Responsibilities...
$150k - $280k
...Member of Technical Staff (Backend) San Francisco, CA Compensation: $150,000 – $280,000 + Competitive... ...and selling successful AI and ML systems. Key company highlights: - Has... ...The role covers backend engineering, distributed systems, ML pipelines, and agent...Full timeTemporary workH1bWork at officeVisa sponsorshipRelocation package- ...Member Of Technical Staff Atomic is the leading venture studio for company creation, partnering... ...Miami, NYC, and San Francisco, and a distributed team across North America. Our latest... ...contributor building high-impact systems from 0→1. You'll work closely with the...Local area
- ...Member Of Technical Staff, Platform Engineer You'll design, build, and own distributed systems and core platform infrastructure end-to-end across the stack - from user-facing product surfaces and real-time interactions to evaluation pipelines, model orchestration, and...
- ...neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental... ...Gimlet Labs is seeking a Member of Technical Staff focused on kernels and GPU performance... ...performance analysis tools Familiarity with multi-GPU or distributed execution is a plus...
$200k - $300k
...Member of Technical Staff (Platform) Title of Role: Member of Technical Staff (Platform)... ...a focus on building robust AI-driven systems, this organization is poised for significant... ...contributing to the architecture of distributed systems. Deliver features quickly...Work at office- ...heterogeneous neocloud for AI workloads. As AI systems scale, the industry is hitting... ...Mission Gimlet Labs is seeking an Member of Staff focused on AI Research (Intern). As an... ...of study Experience with AI/ML or distributed systems. Preferred Qualifications...Internship
- ...Member Of Technical Staff, Training Infra Bay Area Ai Systems Inception creates the world's fastest, most efficient AI models. Our Mercury model is the world... ...Responsibilities Design, implement, and optimize distributed training systems that scale across thousands of...Immediate startFlexible hours
$256k - $276k
...the bigger picture and our vision at Postman. The Opportunity As a Member of Technical Staff on AI Infrastructure, you will build and maintain the foundational systems and distributed infrastructure that power AI model post training, inference, and data pipelines...Work at officeFlexible hours3 days per week- ...pioneering biologists, Phylo is building the next generation of AI systems for the life sciences. About the role We’re looking for an... ...AI platform in production. You’ll design and implement the distributed systems, compute environments, and service architecture that allow...
- ...Member of Technical Staff, Model Efficiency Who are we? Our mission is to scale intelligence to serve... ...and enterprises who are building AI systems to power magical experiences like content... ...) Scaling performance‑critical distributed systems (e.g., computation, search, storage...Full timeWork at officeRemote workFlexible hours
- ...and enterprises who are building AI systems to power magical experiences like content... ...matter, and join the team. As a Member of Technical Staff with a focus on Multimodal AI, you will... ...capabilities. Knowledge of distributed training strategies, especially for large...Full timeWork at officeRemote workFlexible hours
- Member of Technical Staff, Post-Training Location: SF Bay Area or Tokyo, Japan Type: Full-time About Radical Numerics Radical Numerics is an AI lab bringing the rigor of distributed systems, model architecture, and numerics research to the challenges of biology. We are...Full time
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Member of Technical Staff - Distributed Systems. Be the first to apply!
- technical support associate San Francisco, CA
- decision support analyst San Francisco, CA
- desktop support analyst San Francisco, CA
- senior technical analyst San Francisco, CA
- user support analyst San Francisco, CA
- customer support technician San Francisco, CA
- technical support analyst San Francisco, CA
- support analyst San Francisco, CA
- tech assistant San Francisco, CA
- technical support specialist San Francisco, CA

