Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Principal Member of Technical Staff, Platform Infrastructure

$200k - $350k

Edison Scientific

About Edison Scientific builds and commercializes AI agents for science. Scientific discovery moves too slowly, and autonomous AI agents are how we intend to fix that. We're assembling a team of top researchers and engineers across AI and biology to build an AI scientist. Role As a Principal MTS , you'll play a key role in designing, scaling, and operating the core platform infrastructure that powers autonomous scientific discovery. Your primary focus will be the orchestration for our agents at scale — building and managing clusters that orchestrate thousands of persistent, stateful workloads, developing custom resource definitions (CRDs) and operators, and ensuring the reliability and efficiency of our compute layer at scale. Our mission is to build an AI scientist, and you'll own the infrastructure foundation it runs on. AI agents performing long‑running scientific research demand resilient scheduling, lifecycle management, and resource orchestration far beyond typical cloud‑native workloads. This role will influence platform architecture, establish infrastructure best practices, and partner closely with backend engineers, ML engineers, and researchers to deliver a production‑grade environment that lets science move faster. At Edison Scientific, engineering at the senior level is about technical ownership and leverage- understanding how complex systems interact, making sound architectural tradeoffs, and building foundations that allow teams and science to move faster. This role is on‑site at our San Francisco office in the Dogpatch neighborhood. Our office is a converted warehouse with high ceilings, open space, and a team that genuinely believes in what they're building. This position is part of the Platform team. Responsibilities Architect, implement, and operate Kubernetes clusters that support thousands of concurrent, persistent resources (agents, jobs, services) with high availability and efficient resource utilization. Design and develop custom resource definitions (CRDs) and Kubernetes operators to model and manage domain‑specific workloads such as AI agent lifecycles, research pipelines, and long‑running compute tasks. Drive the strategy for cluster scaling, node pool management, autoscaling policies, and resource quota frameworks to handle rapid workload growth. Build and maintain infrastructure‑as‑code (Terraform, Pulumi, or similar) for reproducible, version‑controlled environment management. Design and implement robust scheduling, placement, and affinity strategies to optimize cost, performance, and fault tolerance for heterogeneous workloads (CPU, GPU, memory‑intensive). Establish and uphold best practices around observability, monitoring, alerting, and incident response for infrastructure systems (Prometheus, Grafana, Datadog, or similar). Own storage and networking strategy within Kubernetes — including persistent volume management, CSI drivers, service mesh, network policies, and ingress architecture. Troubleshoot complex, cross‑system infrastructure issues and guide others through effective debugging and remediation in distributed environments. Collaborate closely with backend, ML, and research teams to understand workload requirements and translate them into reliable infrastructure patterns. Qualifications Typically, 10+ years of professional infrastructure or platform engineering experience, with deep hands‑on Kubernetes expertise in production environments. Experience designing and implementing custom resource definitions (CRDs) and Kubernetes operators (using frameworks such as Kubebuilder, Operator SDK, or controller‑runtime). Track record of operating and scaling Kubernetes clusters supporting thousands of persistent or long‑lived resources (stateful workloads, persistent pods, long‑running jobs). Deep understanding of Kubernetes internals — API server, etcd, scheduler, controller manager, kubelet — and how they behave at scale. Expertise with cloud infrastructure (AWS EKS, GCP GKE, or Azure AKS) and associated networking, storage, and IAM primitives. Proficiency in at least one systems or backend language for operator development and infrastructure tooling. Hands‑on experience with infrastructure‑as‑code tools (Terraform, Pulumi, or Crossplane) and GitOps workflows. Strong working knowledge of container networking (CNI plugins, service mesh, network policies), storage (CSI, persistent volumes, StatefulSets), and security (RBAC, Pod Security Standards, secrets management). Ability to operate autonomously, make sound technical judgments, and drive projects from concept through production. Bonus points for Experience with data‑intensive platforms, scientific computing, or ML/AI infrastructure. Prior experience in startups or small teams with significant architectural ownership and ambiguity. Experience scaling systems, teams, or platforms through periods of rapid growth. Salary $200,000 - $350,000 • Offers equity Why join us? Competitive salary and equity Full healthcare coverage — we pay 100% of premiums for you and your dependents Support for growing families, including a yearly new parent stipend and fertility coverage through Carrot 401(k) company matching $300 health and wellness benefit Lunch is on us every day you're in the office, and dinner is on us when you're working late Regular team offsites and company events A fast‑moving, mission‑driven culture where smart people do their best work and actually enjoy doing it #J-18808-Ljbffr Edison Scientific

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Principal Member of Technical Staff, Platform Infrastructure in San Francisco, CA vacancy
  • Member of Technical Staff - Infrastructure Security We're partnering with a frontier AI research company that is building next-generation open-weight foundation...  ..., cloud infrastructure, incident response, and platform security while defining the long-term security... 
    Platform

    Xcede

    San Francisco, CA
    1 day ago
  •  ...research lab building the foundational infrastructure to train specialized AI agents. We...  ...like one seamless system. As a Member of Technical Staff, Infrastructure / DevOps, you will own...  ...cloud infrastructure, orchestration platforms, or developer tooling. Are comfortable... 
    Platform

    Plato

    San Francisco, CA
    2 days ago
  • Take full ownership of NeoSigma's platform infrastructure — lead architectural decisions and design...  ...regulated enterprise customers Own the technical relationship with enterprise customers...  ...career-defining impact As a founding member, you’ll help define the technical... 
    Platform

    NeoSigma

    San Francisco, CA
    3 days ago
  •  ...observe their code. We are responsible for designing, building, and scaling core infrastructure that powers a high-volume data platform for AI applications. We are looking for team members who love building enabling systems that empower our engineers and power our rapidly... 
    Platform
    Work at office

    LlamaIndex

    San Francisco, CA
    2 days ago
  •  ...world’s most advanced digital asset platform for institutions. Anchorage...  ...Goldman Sachs, KKR, Visa, and others. Technical Skills Develop and maintain infrastructure that powers digital asset custody...  ..., and assist or teach other team members when possible. You may be a fit... 
    Platform
    Worldwide

    Crypto Pro Network

    San Francisco, CA
    1 day ago
  • $275k - $350k

     ...an AI scientist. Role As a Principal Machine Learning Engineer at...  ...including building internal infrastructure to improve the efficiency of...  ...extend our experimentation platform for internal tools and projects...  ...to adapt to various technical challenges in the data, ML,... 
    Platform
    Principal
    Work at office
    Flexible hours

    Edison Scientific Inc.

    San Francisco, CA
    1 day ago
  •  ...low-level engineering Hands‑on experience building or significantly enhancing distributed compute platforms, orchestration systems, or high‑performance infrastructure at scale Ability to thrive in a fast‑paced, meritocratic environment with full ownership, high standards... 
    Platform

    xAI

    San Francisco, CA
    7 days ago
  •  ...reliably in the real world. Our platform sits between robot hardware...  ...foundational software and infrastructure that everything else depends...  ...What We Look For Senior to staff-level experience in software...  ...or other publicly visible technical work. Comfort owning ambiguous... 
    Platform

    Dimensional Inc.

    San Francisco, CA
    2 days ago
  •  ...our lifetimes. Mandolin is laying the clinical and financial infrastructure to get groundbreaking treatments to patients faster, powered...  ...climbing quickly and we’re preparing for a broad public launch. The platform must deliver enterprise-grade reliability, airtight security,... 
    Platform
    Local area

    Mandolin

    San Francisco, CA
    5 days ago
  •  ...research and the largest training runs possible. It's building infrastructure at a scale where billion-image datasets are normal and where...  ...object storage such as S3 and Azure Blob Storage, cloud platforms (AWS, GCP, or Azure) and Slurm/HPC environments for distributed... 
    Platform
    Worldwide

    Black Forest Labs

    San Francisco, CA
    3 days ago
  • $150k - $265k

     ...ways text alone can't: voice makes technology human again. Mission We're building the platform for the future of voice technology. Our market edge is extensible, reliable infrastructure designed for the full complexity of voice interactions. 18 months, 150k developers,... 
    Platform
    Full time
    Shift work

    Vapi

    San Francisco, CA
    5 days ago
  •  ...CloudCruise is building the coding agent for enterprise computer automation. Our developer platform writes, tests, and maintains automation code on fully‑managed infrastructure - cutting dev time by 90%. We’re starting with healthcare, where legacy systems make reliable... 
    Platform
    Immediate start
    Remote work

    CloudCruise

    San Francisco, CA
    1 day ago
  • About Us Gimlet is building the next generation of AI infrastructure: large-scale AI datacenters and the orchestration platform that coordinates them. The future of AI will require vastly more compute than exists today. But as AI workloads become more complex and new... 
    Platform

    Gimlet Labs

    San Francisco, CA
    19 hours ago
  •  ...candidates typically come from staff or principal-level roles and are recognized for establishing technical direction, leading large-...  ...office: a fully‑integrated platform that lets employees reserve...  ...size space and budgets. This infrastructure already powers 16,000 workplaces... 
    Platform
    Work at office
    Local area
    Monday to Thursday

    Envoy Inc.

    San Francisco, CA
    5 days ago
  •  ...secure, integrated workplace management platform and ecosystem. More than 16,000...  ...Pulumi, and AWS Cloud, to automate our infrastructure and deliver reliable applications efficiently...  ...Strong communication, analytical, and technical leadership skills. Preferred Skills Experience... 
    Platform
    Work at office
    Local area
    Monday to Thursday

    Envoy

    San Francisco, CA
    2 days ago
  •  ...systems? Do you want to set technical direction and help shape the next generation of AI platforms powering advanced NLP...  ...We are looking for a Lead Member of Technical Staff to join the Model Serving...  ...experience running production infrastructure at a large scale, with a track... 
    Platform
    Full time
    Work at office
    Local area
    Remote work
    Home office

    Cohere Health

    San Francisco, CA
    4 days ago
  •  ...commerce layer for AI - the missing infrastructure that lets agents not just search the...  ...discover and buy online. Role As a Member of Technical Staff, you will ship core systems, set engineering...  ...move the mission from prototype to platform. You will work across the stack and... 
    Platform
    Work at office

    Getcatalog

    San Francisco, CA
    2 days ago
  •  ...Pixeltable Inc. Member of Technical Staff San Francisco, CA·Full time Apply for Member of Technical...  ...landscape with our data-centric platform designed to simplify and accelerate...  ...teams to focus on innovation, not on infrastructure. We aim to simplify the AI development... 
    Platform
    Full time
    Part time
    Work at office
    Work from home
    Flexible hours
    2 days per week

    Pixeltable, Inc.

    San Francisco, CA
    2 days ago
  •  ...We’re an AI platform out to redefine knowledge work. The team builds agents that...  ...500 companies. About the Role As a Member of Technical Staff, you will be part of the team responsible...  ...works vigorously on the underlying infrastructure, core features, agent configurations... 
    Platform
    Work experience placement
    H1b
    Work at office
    Visa sponsorship

    Ersilia

    San Francisco, CA
    4 days ago
  • $150k - $300k

     ...key areas are: Building the infrastructure to serve LLMs efficiently at...  ...our RL training stack. Core Technical Responsibilities LLM Serving...  ...a multi‑tenant LLM serving platform that operates across our cloud...  ...and encourage team members to contribute to the broader... 
    Platform
    Work at office
    Remote work
    Visa sponsorship
    Relocation package
    Flexible hours
    Shift work

    Prime Intellect

    San Francisco, CA
    1 day ago
  • $227.5k - $401k

     ...the financial technology platform of choice. At Adyen, everything...  ...who tackle unique technical challenges at scale and solve...  ...financial technology sector. As a Member of Technical Staff, you will operate with a...  ...in AI‑enabled fintech or infrastructure companies. Familiarity... 
    Platform
    Work at office
    Immediate start
    Relocation
    Flexible hours

    Adyen

    San Francisco, CA
    1 day ago
  •  ...and AI converge; it is who builds the infrastructure to make that convergence reliable,...  ...hardware into a stable, ready‑to‑run platform accessible through a simple chat...  ...frontier of science. Role Overview As a Member of Technical Staff you will shape Conductor's core... 
    Platform

    Conductor Quantum

    San Francisco, CA
    2 days ago
  •  ...is partnering with Context , an AI platform redefining knowledge work by building...  ...Apple, Ramp, Stripe, and Meta. As a Member of Technical Staff , you will own products end‑to‑end across...  ...Operate across frontend, backend, infrastructure, integrations, and agent systems... 
    Platform
    Work at office

    Love Freedom Solution

    San Francisco, CA
    2 days ago
  • $200k

     ...Join to apply for the Member of Technical Staff role at Listen Labs . TL;DR: We are seeing...  ...Listen Labs is an AI‑powered research platform that helps teams uncover insights...  ...decisions across the LLM pipeline, infrastructure, backend, and UX. You have a high... 
    Platform
    Flexible hours

    Listen Labs

    San Francisco, CA
    2 days ago
  • $225k - $300k

     ...Member of Technical Staff Location: San Francisco, CA Onsite Policy: Full-time onsite Comp & Benefits...  ...is rebuilding consumer underwriting infrastructure from the ground up using AI-powered...  ..., and financial decisioning. Their platform has already helped over a million... 
    Platform
    Full time

    Trades Workforce Solutions

    San Francisco, CA
    1 day ago
  • $180k

     ...Member Of Technical Staff - Inference Palo Alto, CA About Xai Xai's mission is to create...  ...building the high-performance inference platform that serves Grok to millions of...  ...will own everything from distributed infrastructure (global KV cache, continuous batching... 
    Platform
    Temporary work

    Xai

    San Francisco, CA
    4 days ago
  • Role Overview We are building infrastructure that enables the world's largest financial institutions...  ...opportunity to architect and build a platform that will power the next generation of...  ...on this team, you will drive the technical direction of our infrastructure services... 
    Platform

    Motive Partners

    San Francisco, CA
    2 days ago
  •  ...companies running on this platform. That is a problem set with...  ...copy from. About the Role Members of Technical Staff (MTS) are the senior...  ...its core. Multi‑tenant data infrastructure across very different portcos...  ...engineering depth. Staff or principal‑equivalent. You have built... 
    Platform

    BEACON SOFTWARE COMPANY

    San Francisco, CA
    1 day ago
  •  ...Member of Technical Staff, Product TL;DR: Listen teaches AI what people actually think and want...  ...will be our customers. Our platform runs AI-moderated video interviews at...  ...decisions across the LLM pipeline, infrastructure, backend, and UX. You're a future... 
    Platform
    Flexible hours
    Shift work

    Listen Labs

    San Francisco, CA
    3 days ago
  • $150k - $350k

     ...homogeneous, vertically integrated infrastructure. Gimlet addresses this by decoupling...  ...workloads from the underlying hardware. Our platform intelligently partitions workloads...  .... Mission Gimlet Labs is seeking a Member of Technical Staff focused on kernels and GPU... 
    Platform

    Gimlet Labs, Inc.

    San Francisco, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Principal Member of Technical Staff, Platform Infrastructure. Be the first to apply!