Average salary: $133,824 /yearly
More statsGet new jobs by email
- ...to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety...SuggestedFull timeWork at officeRelocation package
- ...they integrate with infrastructure for model training, fine-tuning, and inference. Hands-on experience working with distributed systems such as Ray, Spark, or Kubernetes. Familiarity with cloud services (AWS, GCP, Azure) including compute and storage (e.g., EC2, GKE...SuggestedFull timeWorldwide
- ...or VAPI technologies. ~ Strong engineering fundamentals with hands-on experience designing, building, or scaling production systems. ~ Excellent communication and interpersonal skills — able to explain complex technical concepts clearly to non-technical audiences...SuggestedFull time
- Director of Site Acquisition – Hyperscale Infrastructure | Dallas, TX or San Francisco, CA Confidential Infrastructure Developer is pioneering the future of AI and high-performance computing by delivering ultra-efficient data centers across North America. As part of...SuggestedRemote work
- ...building the core of our product: the Agent. We're hiring a Technical Lead who is deeply technical and can architect cutting-edge AI systems while remaining hands-on with implementation. You'll guide the technical direction of our platform while mentoring senior engineers...SuggestedFull timeRemote workFlexible hours
- ...power embedded hardware. Adapt and compress larger ML models to fit power, memory, and latency constraints of real-time wearable systems. Own the full ML development cycle: system design, data collection & curation, synthetic data generation, model training &...SuggestedFull timeContract workFlexible hours
- ...future where computers truly come alive. About the Role We are seeking an engineer living at the intersection of embedded systems and ML to enable rich, reliable interactions on wearable devices. The ideal candidate will be comfortable working across the software...SuggestedFull timeContract workFlexible hours
$183k - $210k
...SmartNICs, BlueField devices, and TPUs What You’ll Bring to the Team: ~5+ years of professional experience in Compute SRE, Linux system engineering, or compute infrastructure roles. ~ Strong proficiency in Linux kernel internals, with exposure to scheduler, memory...SuggestedFull timeTemporary work$175k - $250k
...The Site Reliability Engineering (SRE) team ensures the WorkOS platform remains fast, reliable, and resilient at scale. We build the systems and practices that keep everything running smoothly—handling hundreds of millions of requests, minimizing downtime, and...SuggestedRemote jobFull time- ...rapidly and expanding adoption across the entire healthcare industry. What You’ll Do You’ll be the go-to expert for keeping our systems fast, stable, and resilient. While your primary mission is reliability, you’ll also help shape the infrastructure, CI/CD, and...SuggestedWork at office
$162k - $191k
...perspectives and lived experiences. Checkr believes in hiring people of all backgrounds, including those whose histories are impacted by the justice system in accordance with local, state, and/or federal laws, including the San Francisco’s Fair Chance Ordinance . #LI-TD1...SuggestedFull timeWork at officeLocal areaRemote workHome officeFlexible hours2 days per week3 days per week- ...that any developer or data scientist can scale an ML application from their laptop to the cluster without needing to be a distributed systems expert. Proud to be backed by Andreessen Horowitz, NEA, and Addition with $250+ million raised to date. About the role:...SuggestedWork experience placementWork at officeFlexible hours
$160k - $250k
...our San Francisco, Seattle, and Delhi offices. Please reach out if you are interested in joining the future of AI! DevOps and Systems Team Our unique machine learning needs led us to open our own data centers, with an emphasis on distributed high performance computing...SuggestedFull time- ...building something new from the ground up, come join us! THE ROLE As a Site Reliability Engineer, you'll envision and build robust systems and processes that ensure our infrastructure is scalable, reliable, and efficient. This can range from automating deployments and...SuggestedFull timeWork experience placement
$154k - $191k
...maintainability of our technology platform. This role bridges the gap between application development and infrastructure, ensuring systems are robust, observable, and easy to maintain. A key focus will be leveraging Generative AI to standardize engineering processes, improve...SuggestedFull timeLocal areaImmediate startFlexible hours$170k - $230k
...’ll be at the forefront of building the infrastructure that powers the future of AI. Your role is critical—not just in scaling our systems, but in ensuring they are reliable and secure at every level. You will help Mithril build and operate solutions that harvest compute...Full timeWork at officeLocal areaFlexible hours1 day per week- ...Engineering at Lambda is responsible for building and scaling our cloud offering. Our scope includes the Lambda website, cloud APIs and systems as well as internal tooling for system deployment, management and maintenance. What You’ll Do Operate and maintain bare-...Remote jobFull timeWork at officeLocal areaWork from homeFlexible hours
$97k - $125k
...platform. This entry-level position is designed to bridge the gap between application development and infrastructure, ensuring our systems are robust, observable, and easy to maintain. You will also contribute to optimizing deployment workflows, observability practices,...Full timeLocal areaImmediate startFlexible hours$155k - $224k
...home day is currently Tuesday. What You’ll Do Define Fleet Health metrics and indicators to objectively measure and improve system availability Collaborate with the observability team on comprehensive monitoring and alerting systems to proactively predict,...Full timeWork at officeLocal areaWork from homeFlexible hours$150k - $250k
...Develop and design a better dev experience Improve our observability stack and its usability Automate and optimise our delivery system, infrastructure provisioning etc Help implementing and educating over best practices for software development and monitoring...Full time$255k
...running new, cutting-edge models across tens of thousands of GPUs Help build a high-throughput, low-latency API and routing system running at geographically-distributed scale Shape a highly reliable distributed system with a focus on reducing operational...Full timeWork at officeLocal areaWork from homeFlexible hoursShift work$175k - $250k
...of our legal AI platform. You’ll join a high-leverage team that sits at the intersection of infrastructure and product, owning the systems that keep our platform fast, secure, and always on. From scaling across 50+ regions to automating mission-critical operations, your...Full timeRelocation package$1,500 per month
...coordination. ~ Excellent communication and time management skills. ~ Ability to design and implement highly available, reliable systems. Nice to have Experience in game development and game server hosting, ensuring high-performance and scalable...Remote jobFull timeFlexible hours$165k - $250k
...enables our rapid product development and guarantees 99.9%+ stability and performance of our clinical AI platform for major health systems. Your focus on operational excellence is directly tied to a patient's access to life-saving treatment. What We Look for in a...Work at office$50 per hour
...operational excellence of our critical infrastructure. We are dedicated to building and maintaining highly available and resilient systems that power Crusoe's innovative solutions. SREs at Crusoe play a crucial role in detecting, analyzing, and preventing issues that may...Full timeTemporary workWork experience placement$130k - $175k
...are seeking a highly skilled and motivated Site Reliability Engineer to collect requirements, design & implement highly available systems & solutions, coordinate work across multiple teams, drive improvements to existing systems, introduce automation, integrations, and...Full timeCasual workWork at officeLocal areaNight shift- ...automate, and maintain the infrastructure that powers our core platform—including data pipelines, ML workloads, and real-time analytics systems. This is a hands-on, high-impact role with visibility across the stack and the opportunity to shape the future of our...
- ...scales, automates, and recovers without skipping a beat. As a Site Reliability Engineer, you’ll help us design, run, and improve the systems that power ConductorOne. Your work makes sure our customers never have to think about whether we’re up or down — we just work....Full timeRemote workFlexible hours
- ...very foundation on which our users build their futures. You'll work closely with our engineering team to develop and maintain the systems that power our code sandboxes, ensuring a seamless and stable experience for our customers. This is a critical role that blends a deep...Full timeWork at officeWork from home1 day per week
$150k - $250k
...Interactive Systems Developer Location: San Francisco Bay Area (Hybrid or Onsite) Employment Type: Full-Time Compensation: $150,000 – $250,000 base + equity Tech Stack: C++, Python, React, TypeScript About the Work We’re building autonomous surgical...Full time


