Principal Member of Technical Staff, Platform Infrastructure
$200k - $350kEdison Scientific
About Edison Scientific builds and commercializes AI agents for science. Scientific discovery moves too slowly, and autonomous AI agents are how we intend to fix that. We're assembling a team of top researchers and engineers across AI and biology to build an AI scientist. Role As a Principal MTS , you'll play a key role in designing, scaling, and operating the core platform infrastructure that powers autonomous scientific discovery. Your primary focus will be the orchestration for our agents at scale — building and managing clusters that orchestrate thousands of persistent, stateful workloads, developing custom resource definitions (CRDs) and operators, and ensuring the reliability and efficiency of our compute layer at scale. Our mission is to build an AI scientist, and you'll own the infrastructure foundation it runs on. AI agents performing long‑running scientific research demand resilient scheduling, lifecycle management, and resource orchestration far beyond typical cloud‑native workloads. This role will influence platform architecture, establish infrastructure best practices, and partner closely with backend engineers, ML engineers, and researchers to deliver a production‑grade environment that lets science move faster. At Edison Scientific, engineering at the senior level is about technical ownership and leverage- understanding how complex systems interact, making sound architectural tradeoffs, and building foundations that allow teams and science to move faster. This role is on‑site at our San Francisco office in the Dogpatch neighborhood. Our office is a converted warehouse with high ceilings, open space, and a team that genuinely believes in what they're building. This position is part of the Platform team. Responsibilities Architect, implement, and operate Kubernetes clusters that support thousands of concurrent, persistent resources (agents, jobs, services) with high availability and efficient resource utilization. Design and develop custom resource definitions (CRDs) and Kubernetes operators to model and manage domain‑specific workloads such as AI agent lifecycles, research pipelines, and long‑running compute tasks. Drive the strategy for cluster scaling, node pool management, autoscaling policies, and resource quota frameworks to handle rapid workload growth. Build and maintain infrastructure‑as‑code (Terraform, Pulumi, or similar) for reproducible, version‑controlled environment management. Design and implement robust scheduling, placement, and affinity strategies to optimize cost, performance, and fault tolerance for heterogeneous workloads (CPU, GPU, memory‑intensive). Establish and uphold best practices around observability, monitoring, alerting, and incident response for infrastructure systems (Prometheus, Grafana, Datadog, or similar). Own storage and networking strategy within Kubernetes — including persistent volume management, CSI drivers, service mesh, network policies, and ingress architecture. Troubleshoot complex, cross‑system infrastructure issues and guide others through effective debugging and remediation in distributed environments. Collaborate closely with backend, ML, and research teams to understand workload requirements and translate them into reliable infrastructure patterns. Qualifications Typically, 10+ years of professional infrastructure or platform engineering experience, with deep hands‑on Kubernetes expertise in production environments. Experience designing and implementing custom resource definitions (CRDs) and Kubernetes operators (using frameworks such as Kubebuilder, Operator SDK, or controller‑runtime). Track record of operating and scaling Kubernetes clusters supporting thousands of persistent or long‑lived resources (stateful workloads, persistent pods, long‑running jobs). Deep understanding of Kubernetes internals — API server, etcd, scheduler, controller manager, kubelet — and how they behave at scale. Expertise with cloud infrastructure (AWS EKS, GCP GKE, or Azure AKS) and associated networking, storage, and IAM primitives. Proficiency in at least one systems or backend language for operator development and infrastructure tooling. Hands‑on experience with infrastructure‑as‑code tools (Terraform, Pulumi, or Crossplane) and GitOps workflows. Strong working knowledge of container networking (CNI plugins, service mesh, network policies), storage (CSI, persistent volumes, StatefulSets), and security (RBAC, Pod Security Standards, secrets management). Ability to operate autonomously, make sound technical judgments, and drive projects from concept through production. Bonus points for Experience with data‑intensive platforms, scientific computing, or ML/AI infrastructure. Prior experience in startups or small teams with significant architectural ownership and ambiguity. Experience scaling systems, teams, or platforms through periods of rapid growth. Salary $200,000 - $350,000 • Offers equity Why join us? Competitive salary and equity Full healthcare coverage — we pay 100% of premiums for you and your dependents Support for growing families, including a yearly new parent stipend and fertility coverage through Carrot 401(k) company matching $300 health and wellness benefit Lunch is on us every day you're in the office, and dinner is on us when you're working late Regular team offsites and company events A fast‑moving, mission‑driven culture where smart people do their best work and actually enjoy doing it #J-18808-Ljbffr Edison Scientific
- Member of Technical Staff - Infrastructure Security We're partnering with a frontier AI research company that is building next-generation open-weight foundation... ..., cloud infrastructure, incident response, and platform security while defining the long-term security...Platform
- ...research lab building the foundational infrastructure to train specialized AI agents. We... ...like one seamless system. As a Member of Technical Staff, Infrastructure / DevOps, you will own... ...cloud infrastructure, orchestration platforms, or developer tooling. Are comfortable...Platform
- Take full ownership of NeoSigma's platform infrastructure — lead architectural decisions and design... ...regulated enterprise customers Own the technical relationship with enterprise customers... ...career-defining impact As a founding member, you’ll help define the technical...Platform
- ...observe their code. We are responsible for designing, building, and scaling core infrastructure that powers a high-volume data platform for AI applications. We are looking for team members who love building enabling systems that empower our engineers and power our rapidly...PlatformWork at office
- ...world’s most advanced digital asset platform for institutions. Anchorage... ...Goldman Sachs, KKR, Visa, and others. Technical Skills Develop and maintain infrastructure that powers digital asset custody... ..., and assist or teach other team members when possible. You may be a fit...PlatformWorldwide
$275k - $350k
...an AI scientist. Role As a Principal Machine Learning Engineer at... ...including building internal infrastructure to improve the efficiency of... ...extend our experimentation platform for internal tools and projects... ...to adapt to various technical challenges in the data, ML,...PlatformPrincipalWork at officeFlexible hours- ...low-level engineering Hands‑on experience building or significantly enhancing distributed compute platforms, orchestration systems, or high‑performance infrastructure at scale Ability to thrive in a fast‑paced, meritocratic environment with full ownership, high standards...Platform
- ...reliably in the real world. Our platform sits between robot hardware... ...foundational software and infrastructure that everything else depends... ...What We Look For Senior to staff-level experience in software... ...or other publicly visible technical work. Comfort owning ambiguous...Platform
- ...our lifetimes. Mandolin is laying the clinical and financial infrastructure to get groundbreaking treatments to patients faster, powered... ...climbing quickly and we’re preparing for a broad public launch. The platform must deliver enterprise-grade reliability, airtight security,...PlatformLocal area
- ...research and the largest training runs possible. It's building infrastructure at a scale where billion-image datasets are normal and where... ...object storage such as S3 and Azure Blob Storage, cloud platforms (AWS, GCP, or Azure) and Slurm/HPC environments for distributed...PlatformWorldwide
$150k - $265k
...ways text alone can't: voice makes technology human again. Mission We're building the platform for the future of voice technology. Our market edge is extensible, reliable infrastructure designed for the full complexity of voice interactions. 18 months, 150k developers,...PlatformFull timeShift work- ...CloudCruise is building the coding agent for enterprise computer automation. Our developer platform writes, tests, and maintains automation code on fully‑managed infrastructure - cutting dev time by 90%. We’re starting with healthcare, where legacy systems make reliable...PlatformImmediate startRemote work
- About Us Gimlet is building the next generation of AI infrastructure: large-scale AI datacenters and the orchestration platform that coordinates them. The future of AI will require vastly more compute than exists today. But as AI workloads become more complex and new...Platform
- ...candidates typically come from staff or principal-level roles and are recognized for establishing technical direction, leading large-... ...office: a fully‑integrated platform that lets employees reserve... ...size space and budgets. This infrastructure already powers 16,000 workplaces...PlatformWork at officeLocal areaMonday to Thursday
- ...secure, integrated workplace management platform and ecosystem. More than 16,000... ...Pulumi, and AWS Cloud, to automate our infrastructure and deliver reliable applications efficiently... ...Strong communication, analytical, and technical leadership skills. Preferred Skills Experience...PlatformWork at officeLocal areaMonday to Thursday
- ...systems? Do you want to set technical direction and help shape the next generation of AI platforms powering advanced NLP... ...We are looking for a Lead Member of Technical Staff to join the Model Serving... ...experience running production infrastructure at a large scale, with a track...PlatformFull timeWork at officeLocal areaRemote workHome office
- ...commerce layer for AI - the missing infrastructure that lets agents not just search the... ...discover and buy online. Role As a Member of Technical Staff, you will ship core systems, set engineering... ...move the mission from prototype to platform. You will work across the stack and...PlatformWork at office
- ...Pixeltable Inc. Member of Technical Staff San Francisco, CA·Full time Apply for Member of Technical... ...landscape with our data-centric platform designed to simplify and accelerate... ...teams to focus on innovation, not on infrastructure. We aim to simplify the AI development...PlatformFull timePart timeWork at officeWork from homeFlexible hours2 days per week
- ...We’re an AI platform out to redefine knowledge work. The team builds agents that... ...500 companies. About the Role As a Member of Technical Staff, you will be part of the team responsible... ...works vigorously on the underlying infrastructure, core features, agent configurations...PlatformWork experience placementH1bWork at officeVisa sponsorship
$150k - $300k
...key areas are: Building the infrastructure to serve LLMs efficiently at... ...our RL training stack. Core Technical Responsibilities LLM Serving... ...a multi‑tenant LLM serving platform that operates across our cloud... ...and encourage team members to contribute to the broader...PlatformWork at officeRemote workVisa sponsorshipRelocation packageFlexible hoursShift work$227.5k - $401k
...the financial technology platform of choice. At Adyen, everything... ...who tackle unique technical challenges at scale and solve... ...financial technology sector. As a Member of Technical Staff, you will operate with a... ...in AI‑enabled fintech or infrastructure companies. Familiarity...PlatformWork at officeImmediate startRelocationFlexible hours- ...and AI converge; it is who builds the infrastructure to make that convergence reliable,... ...hardware into a stable, ready‑to‑run platform accessible through a simple chat... ...frontier of science. Role Overview As a Member of Technical Staff you will shape Conductor's core...Platform
- ...is partnering with Context , an AI platform redefining knowledge work by building... ...Apple, Ramp, Stripe, and Meta. As a Member of Technical Staff , you will own products end‑to‑end across... ...Operate across frontend, backend, infrastructure, integrations, and agent systems...PlatformWork at office
$200k
...Join to apply for the Member of Technical Staff role at Listen Labs . TL;DR: We are seeing... ...Listen Labs is an AI‑powered research platform that helps teams uncover insights... ...decisions across the LLM pipeline, infrastructure, backend, and UX. You have a high...PlatformFlexible hours$225k - $300k
...Member of Technical Staff Location: San Francisco, CA Onsite Policy: Full-time onsite Comp & Benefits... ...is rebuilding consumer underwriting infrastructure from the ground up using AI-powered... ..., and financial decisioning. Their platform has already helped over a million...PlatformFull time$180k
...Member Of Technical Staff - Inference Palo Alto, CA About Xai Xai's mission is to create... ...building the high-performance inference platform that serves Grok to millions of... ...will own everything from distributed infrastructure (global KV cache, continuous batching...PlatformTemporary work- Role Overview We are building infrastructure that enables the world's largest financial institutions... ...opportunity to architect and build a platform that will power the next generation of... ...on this team, you will drive the technical direction of our infrastructure services...Platform
- ...companies running on this platform. That is a problem set with... ...copy from. About the Role Members of Technical Staff (MTS) are the senior... ...its core. Multi‑tenant data infrastructure across very different portcos... ...engineering depth. Staff or principal‑equivalent. You have built...Platform
- ...Member of Technical Staff, Product TL;DR: Listen teaches AI what people actually think and want... ...will be our customers. Our platform runs AI-moderated video interviews at... ...decisions across the LLM pipeline, infrastructure, backend, and UX. You're a future...PlatformFlexible hoursShift work
$150k - $350k
...homogeneous, vertically integrated infrastructure. Gimlet addresses this by decoupling... ...workloads from the underlying hardware. Our platform intelligently partitions workloads... .... Mission Gimlet Labs is seeking a Member of Technical Staff focused on kernels and GPU...Platform
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Principal Member of Technical Staff, Platform Infrastructure. Be the first to apply!
- salesforce technical analyst San Francisco, CA
- desktop support analyst San Francisco, CA
- personal computer support technician San Francisco, CA
- technical support specialist San Francisco, CA
- support analyst San Francisco, CA
- customer support technician San Francisco, CA
- support technician San Francisco, CA
- application support technician San Francisco, CA
- technical solutions specialist San Francisco, CA
- help desk administrator San Francisco, CA

