Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Principal Infrastructure Engineer

$200k - $350k

Edison Scientific

About Edison Scientific builds and commercializes AI agents for science. Scientific discovery moves too slowly, and autonomous AI agents are how we intend to fix that. We’re assembling a team of top researchers and engineers across AI and biology to build an AI scientist. Role As a Principal Infrastructure Engineer, you’ll play a key role in designing, scaling, and operating the core platform infrastructure that powers autonomous scientific discovery. Your primary focus will be the orchestration for our agents at scale — building and managing clusters that orchestrate thousands of persistent, stateful workloads, developing custom resource definitions (CRDs) and operators, and ensuring the reliability and efficiency of our compute layer at scale. Our mission is to build an AI scientist, and you’ll own the infrastructure foundation it runs on. AI agents performing long‑running scientific research demand resilient scheduling, lifecycle management, and resource orchestration far beyond typical cloud‑native workloads. This role will influence platform architecture, establish infrastructure best practices, and partner closely with backend engineers, ML engineers, and researchers to deliver a production‑grade environment that lets science move faster. At Edison Scientific, engineering at the senior level is about technical ownership and leverage—understanding how complex systems interact, making sound architectural tradeoffs, and building foundations that allow teams and science to move faster. This role is on‑site at our San Francisco office in the Dogpatch neighborhood. Our office is a converted warehouse with high ceilings, open space, and a team that genuinely believes in what they’re building. This position is part of the Platform team. Responsibilities Architect, implement, and operate Kubernetes clusters that support thousands of concurrent, persistent resources (agents, jobs, services) with high availability and efficient resource utilization. Design and develop custom resource definitions (CRDs) and Kubernetes operators to model and manage domain‑specific workloads such as AI agent lifecycles, research pipelines, and long‑running compute tasks. Drive the strategy for cluster scaling, node pool management, autoscaling policies, and resource quota frameworks to handle rapid workload growth. Build and maintain infrastructure‑as‑code (Terraform, Pulumi, or similar) for reproducible, version‑controlled environment management. Design and implement robust scheduling, placement, and affinity strategies to optimize cost, performance, and fault tolerance for heterogeneous workloads (CPU, GPU, memory‑intensive). Establish and uphold best practices around observability, monitoring, alerting, and incident response for infrastructure systems (Prometheus, Grafana, Datadog, or similar). Own storage and networking strategy within Kubernetes — including persistent volume management, CSI drivers, service mesh, network policies, and ingress architecture. Troubleshoot complex, cross‑system infrastructure issues and guide others through effective debugging and remediation in distributed environments. Collaborate closely with backend, ML, and research teams to understand workload requirements and translate them into reliable infrastructure patterns. Qualifications 10+ years of professional infrastructure or platform engineering experience, with deep hands‑on Kubernetes expertise in production environments. Experience designing and implementing custom resource definitions (CRDs) and Kubernetes operators (using frameworks such as Kubebuilder, Operator SDK, or controller‑runtime). Track record of operating and scaling Kubernetes clusters supporting thousands of persistent or long‑lived resources (stateful workloads, persistent pods, long‑running jobs). Deep understanding of Kubernetes internals—API server, etcd, scheduler, controller manager, kubelet—and how they behave at scale. Expertise with cloud infrastructure (AWS EKS, GCP GKE, or Azure AKS) and associated networking, storage, and IAM primitives. Proficiency in at least one systems or backend language for operator development and infrastructure tooling. Hands‑on experience with infrastructure‑as‑code tools (Terraform, Pulumi, or Crossplane) and GitOps workflows. Strong working knowledge of container networking (CNI plugins, service mesh, network policies), storage (CSI, persistent volumes, StatefulSets), and security (RBAC, Pod Security Standards, secrets management). Ability to operate autonomously, make sound technical judgments, and drive projects from concept through production. Bonus points for Experience with data‑intensive platforms, scientific computing, or ML/AI infrastructure. Prior experience in startups or small teams with significant architectural ownership and ambiguity. Experience scaling systems, teams, or platforms through periods of rapid growth. Salary $200,000 - $350,000 • Offers equity Why join us? Competitive salary and equity Full healthcare coverage — we pay 100% of premiums for you and your dependents Support for growing families, including a yearly new parent stipend and fertility coverage through Carrot 401(k) company matching $300 health and wellness benefit Lunch is on us every day you’re in the office, and dinner is on us when you’re working late Regular team offsites and company events A fast‑moving, mission‑driven culture where smart people do their best work and actually enjoy doing it #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Principal Infrastructure Engineer in San Francisco, CA vacancy
  • $2,000 per month

     ...best and pathologically unfair at worst. Our mission is to reimagine the world of data with you. The Role As a Principal Infrastructure Engineer , you will help lead and build out the automation for provisioning and managing the Nextdata OS in multiple clouds. Given... 
    Suggested

    NextData

    San Francisco, CA
    1 day ago
  • $190k - $240k

     ...transformation with innovative virtual payment products. Our infrastructure will be across multiple continents over the next few years...  ...looking for: ~12+ years of experience in infrastructure engineering or software engineering ~ Strong experience working with... 
    Suggested
    Work at office
    Local area
    Home office
    Flexible hours

    HighNote

    San Francisco, CA
    2 days ago
  • $172k - $215k

     ...F5 Principal Network Engineer At Early Warning, we've powered and protected the U.S. financial system for over thirty years with cutting-...  ...delivery and cloud networking initiatives, partnering across infrastructure, security, and application teams to deliver scalable... 
    Suggested
    Hourly pay
    Work at office
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa
    Flexible hours

    Early Warning Services

    San Francisco, CA
    5 days ago
  • $250k - $325k

     ...Location San Francisco, CA Employment Type Full time Department Engineering Compensation $250K – $325K We're building the company which will de-risk the largest infrastructure build‑out in history. When people finance GPU clusters, the datacenters housing them, and the... 
    Suggested
    Long term contract
    Full time
    Contract work
    Fixed term contract
    Work at office
    Local area
    Visa sponsorship
    Shift work
    3 days per week

    Electric Capital

    San Francisco, CA
    10 hours ago
  •  ...instrumental in designing, building, and maintaining the shared infrastructure services and platforms that our product and application...  ...years of experience in an Infrastructure Development, Platform Engineering, or Site Reliability Engineering role, with a strong focus... 
    Suggested

    Rival Inc

    San Francisco, CA
    10 hours ago
  • $285k - $315k

     ...Ironclad Inc. is seeking a Principal Engineer in San Francisco to drive the development of AI-powered contract solutions. The role requires over 10 years of experience in software engineering, especially in designing and evolving distributed systems. You'll collaborate... 
    Contract work

    Ironclad Inc

    San Francisco, CA
    9 hours ago
  • $184k - $230k

     ...ineligible for employment Visa sponsorship. Purpose The Principal Splunk Engineer operates as a logging architecture subject matter expert...  .... The Principal Splunk Engineer maintains the logging infrastructure for Splunk Enterprise and Cloud and other dependent... 
    Hourly pay
    Work at office
    Immediate start
    Visa sponsorship
    Work visa
    Flexible hours

    Early Warning Services

    San Francisco, CA
    1 day ago
  •  ...Senior Cloud, AI & Data Security Engineer We are seeking an enthusiastic and passionate professional for a Senior Cloud, AI & Data...  ...of collaboration, and are keen to work in a team of CI/CD, infrastructure, AI, and data specialists You are energized by the rapidly... 

    Bmo

    San Francisco, CA
    1 day ago
  • $170k - $277k

     ...Palo Alto Networks, Inc. is seeking a Principal Software Engineer in San Francisco, CA to drive technical leadership for next-generation cloud security solutions. The role involves designing, implementing, and troubleshooting high-scale distributed systems while collaborating... 

    Palo Alto Networks

    San Francisco, CA
    1 day ago
  •  ...Palo Alto Networks, Inc. is seeking a Sr. Principal Software Engineer to innovate in secure cloud environments. You will lead automation in cloud security and design cutting-edge infrastructure solutions. The ideal candidate will have extensive experience in GCP, Kubernetes... 

    Palo Alto Networks

    San Francisco, CA
    10 hours ago
  •  ...A leading AI infrastructure company in San Francisco is seeking an IC Agentic Engineering Manager to lead the development of agent-based systems for infrastructure delivery. This role involves both hands-on contributions to system design and managing a small team. Ideal... 

    AI Chopping Block, Inc.

    San Francisco, CA
    9 hours ago
  • $240k

     ...Convex is seeking experienced engineers to design and maintain its global cloud infrastructure in San Francisco. This role involves architectural decisions and collaboration with teams to improve system performance and reliability while prioritizing simplicity. The ideal... 

    Convex

    San Francisco, CA
    9 hours ago
  •  ...Jack & Jill is looking for a Principal Software Engineer to join their team in San Francisco. In this role, you will architect and build secure embedded finance products using Java. You’ll work closely with a seasoned team to shape a high-scale platform and innovate on... 

    Jack & Jill

    San Francisco, CA
    9 hours ago
  • Health Universe, Inc. is seeking a Principal Engineer to enhance their platform that revolutionizes science and medicine. This role focuses on developing a web application that supports health data scientists in deploying cutting-edge ML apps while ensuring compliance... 

    Health Universe, Inc.

    San Francisco, CA
    2 days ago
  • Principal Engineer, AI Platform & Infrastructure About the Role SPREEAI is building the future of AI-powered commerce through photorealistic virtual try‑on and multimodal intelligence. We bring together cutting‑edge AI and real‑world retail to deliver production systems... 

    SpreeAI

    San Francisco, CA
    1 day ago
  •  ...Abby Care is seeking a Principal Engineer to lead the technical direction of its platform in San Francisco. This full-time role demands over 10 years of experience in architecture for large-scale systems, with a focus on scalable AI-driven workflows. The successful candidate... 
    Full time

    Abby Care

    San Francisco, CA
    9 hours ago
  • $261k - $326k

     ...A technology company specializing in AI infrastructure is seeking a Principal Engineer to enhance reliability and scalability of cloud systems. This role demands over 15 years of experience in production engineering or related fields and involves setting technical directions... 

    Crusoe

    San Francisco, CA
    9 hours ago
  • $275k - $300k

     ...Snorkel AI is seeking a Principal Software Engineer to shape product and technical systems to meet today's AI challenges. The role demands 12+ years of experience in software engineering, focusing on building scalable AI data solutions for enterprise clients. This position... 

    jobs.frontdoordefense.com - Jobboard

    San Francisco, CA
    10 hours ago
  •  ...Dormont Manufacturing Co is seeking a Principal Backend Engineer in San Francisco, California, to lead backend development for the Cortex platform. The ideal candidate will have over 8 years of experience in software engineering, strong programming skills in Go and Python... 

    Dormont Manufacturing Company

    San Francisco, CA
    10 hours ago
  • $10 per hour

     ...Principal Full-Stack Engineer, Operations & Fleet Platform Las Vegas, Nevada, United States Ever imagined saying, "I helped launch the future of transportation"? We're rewriting the rules of urban mobility. At Vay, customers tap a button and a car arrives - with... 
    Live in
    Work at office
    Remote work
    Relocation
    Home office
    Relocation package

    Vay

    San Francisco, CA
    18 days ago
  •  ..., Skills, and Tools that every product engineer at MaintainX assembles to ship AI-powered...  ...the Scheduling Agent all run on it. As Principal Engineer, you will own the technical...  ...distributed architecture, or platform / infrastructure engineering. ~3+ years building ML or... 
    Summer work
    Remote work
    Flexible hours

    MaintainX

    San Francisco, CA
    3 days ago
  • $240k - $250k

     ...Saviynt Inc. is seeking a Principal Software Engineer in San Francisco, CA, to join their AI Security team. In this role, you will design and implement workflows for AI security products and develop secure, scalable software across major cloud platforms. The ideal candidate... 

    Saviynt

    San Francisco, CA
    10 hours ago
  •  ...Salesforce, Inc. is seeking a Principal Software Engineer for their Platform Security team in San Francisco, California. This role involves...  ...software development experience, strong knowledge in security infrastructure, and proven leadership capabilities. The... 

    Salesforce

    San Francisco, CA
    1 day ago
  • $175k - $250k

     ...Principal Security & Infrastructure Engineer Emeryville, California, United States; Hybrid (2-3 days on-site) Profluent is an AI-first protein design company. Founded in 2022, we develop deep generative models to design and validate novel, functional proteins to... 
    Remote work

    Profluent

    Emeryville, CA
    3 days ago
  •  ...10 + years of experience in software engineering, with a significant focus on designing...  ...in cloud environments , 5 + years in a Principal or Lead Engineer role, with a proven track...  ..., (Desirable) Experience in AI/ML Infrastructure: Direct experience building or operating... 
    Flexible hours

    SambaNova Systems

    San Francisco, CA
    4 days ago
  • $240k - $250k

     ...Saviynt in San Francisco is hiring a Principal Software Engineer to lead the development of AI security products. With over 10 years of software engineering experience required, you will design, implement, and release end-to-end workflows across cloud platforms like AWS... 

    Saviynt

    San Francisco, CA
    1 day ago
  •  ...B Capital is looking for a Principal Software Engineer for the Data Platform. In this high-impact role, you will be responsible for architecting...  ...You will lead the design and implementation of scalable infrastructures while mentoring engineers. The ideal candidate has over... 

    B Capital

    San Francisco, CA
    10 hours ago
  • $280k - $350k

     ...Staff / Principal Platform Engineer Join our team and take end-to-end ownership of building, securing, and scaling our AI products. You'll be the driving force behind our cloud infrastructure, partnering with engineers across the organization to deploy and evolve services... 
    Full time
    Work at office
    Remote work
    Relocation

    Inworld AI

    San Francisco, CA
    7 days ago
  • $144k - $240k

     ...Lila Sciences is seeking a Sr Principal / Principal Software Engineer to join their innovative team in San Francisco, CA. You will design and build AI-driven applications, focusing on performance, reliability, and cross-functional collaboration with scientists. Ideal candidates... 
    Flexible hours

    Jobr

    San Francisco, CA
    9 hours ago
  • $347k

     ...transformative technologies, and engaging a robust security culture. About the Role OpenAI is seeking a Principal Security Engineer to join our Infrastructure Security (InfraSec) team. InfraSec protects the foundations of OpenAI's research and production... 

    OpenAI

    San Francisco, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Principal Infrastructure Engineer. Be the first to apply!