Average salary: $133,824 /yearly

More stats
Get new jobs by email
  •  ...to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety... 
    Suggested
    Full time
    Work at office
    Relocation package

    OpenAI

    San Francisco, CA
    11 hours ago
  •  ...they integrate with infrastructure for model training, fine-tuning, and inference. Hands-on experience working with distributed systems such as Ray, Spark, or Kubernetes. Familiarity with cloud services (AWS, GCP, Azure) including compute and storage (e.g., EC2, GKE... 
    Suggested
    Full time
    Worldwide

    Lancedb

    San Francisco, CA
    11 hours ago
  •  ...or VAPI technologies. ~ Strong engineering fundamentals with hands-on experience designing, building, or scaling production systems. ~ Excellent communication and interpersonal skills — able to explain complex technical concepts clearly to non-technical audiences... 
    Suggested
    Full time

    Searchability NS&D

    San Francisco, CA
    17 days ago
  • Director of Site Acquisition – Hyperscale Infrastructure | Dallas, TX or San Francisco, CA Confidential Infrastructure Developer is pioneering the future of AI and high-performance computing by delivering ultra-efficient data centers across North America. As part of...
    Suggested
    Remote work

    Blue Signal Search

    San Francisco, CA
    3 days ago
  •  ...building the core of our product: the Agent. We're hiring a Technical Lead who is deeply technical and can architect cutting-edge AI systems while remaining hands-on with implementation. You'll guide the technical direction of our platform while mentoring senior engineers... 
    Suggested
    Full time
    Remote work
    Flexible hours

    Sema4.Ai

    San Francisco, CA
    11 hours ago
  •  ...power embedded hardware. Adapt and compress larger ML models to fit power, memory, and latency constraints of real-time wearable systems. Own the full ML development cycle: system design, data collection & curation, synthetic data generation, model training &... 
    Suggested
    Full time
    Contract work
    Flexible hours

    Sesame

    San Francisco, CA
    11 hours ago
  •  ...future where computers truly come alive. About the Role We are seeking an engineer living at the intersection of embedded systems and ML to enable rich, reliable interactions on wearable devices. The ideal candidate will be comfortable working across the software... 
    Suggested
    Full time
    Contract work
    Flexible hours

    Sesame

    San Francisco, CA
    11 hours ago
  • $183k - $210k

     ...SmartNICs, BlueField devices, and TPUs What You’ll Bring to the Team: ~5+ years of professional experience in Compute SRE, Linux system engineering, or compute infrastructure roles. ~ Strong proficiency in Linux kernel internals, with exposure to scheduler, memory... 
    Suggested
    Full time
    Temporary work

    Crusoe

    San Francisco, CA
    11 hours ago
  • $175k - $250k

     ...The Site Reliability Engineering (SRE) team ensures the WorkOS platform remains fast, reliable, and resilient at scale. We build the systems and practices that keep everything running smoothly—handling hundreds of millions of requests, minimizing downtime, and... 
    Suggested
    Remote job
    Full time

    Workos

    San Francisco, CA
    11 hours ago
  •  ...rapidly and expanding adoption across the entire healthcare industry. What You’ll Do You’ll be the go-to expert for keeping our systems fast, stable, and resilient. While your primary mission is reliability, you’ll also help shape the infrastructure, CI/CD, and... 
    Suggested
    Work at office

    Assort Health

    San Francisco, CA
    11 hours ago
  • $162k - $191k

     ...perspectives and lived experiences. Checkr believes in hiring people of all backgrounds, including those whose histories are impacted by the justice system in accordance with local, state, and/or federal laws, including the San Francisco’s Fair Chance Ordinance . #LI-TD1... 
    Suggested
    Full time
    Work at office
    Local area
    Remote work
    Home office
    Flexible hours
    2 days per week
    3 days per week

    Checkr

    San Francisco, CA
    11 hours ago
  •  ...that any developer or data scientist can scale an ML application from their laptop to the cluster without needing to be a distributed systems expert. Proud to be backed by  Andreessen Horowitz, NEA, and Addition with $250+ million raised to date. About the role:... 
    Suggested
    Work experience placement
    Work at office
    Flexible hours

    Anyscale

    San Francisco, CA
    11 hours ago
  • $160k - $250k

     ...our San Francisco, Seattle, and Delhi offices. Please reach out if you are interested in joining the future of AI! DevOps and Systems Team Our unique machine learning needs led us to open our own data centers, with an emphasis on distributed high performance computing... 
    Suggested
    Full time

    Hive

    San Francisco, CA
    11 hours ago
  •  ...building something new from the ground up, come join us! THE ROLE As a Site Reliability Engineer, you'll envision and build robust systems and processes that ensure our infrastructure is scalable, reliable, and efficient. This can range from automating deployments and... 
    Suggested
    Full time
    Work experience placement

    Baseten

    San Francisco, CA
    11 hours ago
  • $154k - $191k

     ...maintainability of our technology platform. This role bridges the gap between application development and infrastructure, ensuring systems are robust, observable, and easy to maintain. A key focus will be leveraging Generative AI to standardize engineering processes, improve... 
    Suggested
    Full time
    Local area
    Immediate start
    Flexible hours

    Prosper

    San Francisco, CA
    11 hours ago
  • $170k - $230k

     ...’ll be at the forefront of building the infrastructure that powers the future of AI. Your role is critical—not just in scaling our systems, but in ensuring they are reliable and secure at every level.  You will help Mithril build and operate solutions that harvest compute... 
    Full time
    Work at office
    Local area
    Flexible hours
    1 day per week

    Mithril

    San Francisco, CA
    11 hours ago
  •  ...Engineering at Lambda is responsible for building and scaling our cloud offering. Our scope includes the Lambda website, cloud APIs and systems as well as internal tooling for system deployment, management and maintenance. What You’ll Do Operate and maintain bare-... 
    Remote job
    Full time
    Work at office
    Local area
    Work from home
    Flexible hours

    Lambda

    San Francisco, CA
    11 hours ago
  • $97k - $125k

     ...platform. This entry-level position is designed to bridge the gap between application development and infrastructure, ensuring our systems are robust, observable, and easy to maintain. You will also contribute to optimizing deployment workflows, observability practices,... 
    Full time
    Local area
    Immediate start
    Flexible hours

    Prosper

    San Francisco, CA
    11 hours ago
  • $155k - $224k

     ...home day is currently Tuesday. What You’ll Do Define Fleet Health metrics and indicators to objectively measure and improve system availability Collaborate with the observability team on comprehensive monitoring and alerting systems to proactively predict,... 
    Full time
    Work at office
    Local area
    Work from home
    Flexible hours

    Lambda

    San Francisco, CA
    11 hours ago
  • $150k - $250k

     ...Develop and design a better dev experience Improve our observability stack and its usability Automate and optimise our delivery system, infrastructure provisioning etc Help implementing and educating over best practices for software development and monitoring... 
    Full time

    Loft Orbital Solutions

    San Francisco, CA
    11 hours ago
  • $255k

     ...running new, cutting-edge models across tens of thousands of GPUs Help build a high-throughput, low-latency API and routing system running at geographically-distributed scale Shape a highly reliable distributed system with a focus on reducing operational... 
    Full time
    Work at office
    Local area
    Work from home
    Flexible hours
    Shift work

    Lambda

    San Francisco, CA
    11 hours ago
  • $175k - $250k

     ...of our legal AI platform. You’ll join a high-leverage team that sits at the intersection of infrastructure and product, owning the systems that keep our platform fast, secure, and always on. From scaling across 50+ regions to automating mission-critical operations, your... 
    Full time
    Relocation package

    Harvey

    San Francisco, CA
    11 hours ago
  • $1,500 per month

     ...coordination. ~ Excellent communication and time management skills. ~ Ability to design and implement highly available, reliable systems. Nice to have Experience in game development and game server hosting, ensuring high-performance and scalable... 
    Remote job
    Full time
    Flexible hours

    Argus Labs

    San Francisco, CA
    11 hours ago
  • $165k - $250k

     ...enables our rapid product development and guarantees 99.9%+ stability and performance of our clinical AI platform for major health systems. Your focus on operational excellence is directly tied to a patient's access to life-saving treatment. What We Look for in a... 
    Work at office

    Latent

    San Francisco, CA
    11 hours ago
  • $50 per hour

     ...operational excellence of our critical infrastructure. We are dedicated to building and maintaining highly available and resilient systems that power Crusoe's innovative solutions. SREs at Crusoe play a crucial role in detecting, analyzing, and preventing issues that may... 
    Full time
    Temporary work
    Work experience placement

    Crusoe

    San Francisco, CA
    11 hours ago
  • $130k - $175k

     ...are seeking a highly skilled and motivated Site Reliability Engineer to collect requirements, design & implement highly available systems & solutions, coordinate work across multiple teams, drive improvements to existing systems, introduce automation, integrations, and... 
    Full time
    Casual work
    Work at office
    Local area
    Night shift

    Redwood Materials

    San Francisco, CA
    11 hours ago
  •  ...automate, and maintain the infrastructure that powers our core platform—including data pipelines, ML workloads, and real-time analytics systems. This is a hands-on, high-impact role with visibility across the stack and the opportunity to shape the future of our... 

    Alembic

    San Francisco, CA
    11 hours ago
  •  ...scales, automates, and recovers without skipping a beat. As a Site Reliability Engineer, you’ll help us design, run, and improve the systems that power ConductorOne. Your work makes sure our customers never have to think about whether we’re up or down — we just work.... 
    Full time
    Remote work
    Flexible hours

    Conductorone

    San Francisco, CA
    11 hours ago
  •  ...very foundation on which our users build their futures. You'll work closely with our engineering team to develop and maintain the systems that power our code sandboxes, ensuring a seamless and stable experience for our customers. This is a critical role that blends a deep... 
    Full time
    Work at office
    Work from home
    1 day per week

    Runloop

    San Francisco, CA
    11 hours ago
  • $150k - $250k

     ...Interactive Systems Developer Location: San Francisco Bay Area (Hybrid or Onsite) Employment Type: Full-Time Compensation: $150,000 – $250,000 base + equity Tech Stack: C++, Python, React, TypeScript About the Work We’re building autonomous surgical... 
    Full time

    Connect Staffing and Consulting

    San Francisco, CA
    a month ago