Software Engineer, Reliability Platforms
DoorDash USA
About the Team The Reliability Platform role is a key pillar of DoorDash’s Production Lifecycle team, alongside Observability and Deploy Platform. This group’s mandate is to enable users and agents to reason about the health of our services, facilitate change control safety, and provide the means to rapidly address any unexpected state. Ownership is fundamental in DoorDash culture, and all teams own what they build. We are not here to operate services on others’ behalf, but to provide tools that enable their success and ensure a consistently high level of quality for everything we do. We approach challenges with the pragmatic perspective of an SRE, and deliver solutions with the mindset of a SWE who detests toil and repetitive tasks. We use software and agents to “keep the lights on” and focus our energy on innovation that will level up the entire organization. This mission falls into three main categories. Service Health – Providing SLO frameworks, analytics tools, and AI Agent enablement to extract high quality insights from our telemetry to pinpoint faults, or highlight deficiencies Change Orchestration – Provide self-service provisioning orchestration, evolving from UI to Agent-driven to allow our developers to safely affect production from their IDE Incident Management – Define and deliver tools/processes/policies leveraged by our peers to quickly understand and recover from any unexpected issues in the environment This mandate implies a broad contribution across many aspects of the infrastructure, and demands equal parts software development and systems integration. Our priorities are always informed by an obsession to level up over 4,000 internal customers/peers, and obfuscate infrastructure complexity so they can focus on making the DoorDash product itself amazing! About the Role As a Software Engineer on the Reliability Platform team, you’ll help design, build, and operate services and infrastructure that deliver on the team’s broad mandate described above. This team has a unique opportunity for breadth, often in collaboration with expert peers across the Infrastructure and Product teams. Depending on need and interest, you may be working on mission-critical back-end services or pipelines, complex orchestration workflows, self-service UI, or AI Agent continuous improvements. We have fully embraced the use of AI tools in everything we do, and believe in the incredible potential this provides while remaining pragmatic enough to ensure the critical infrastructure we maintain cannot be compromised. Our goal is to deliver innovative next generation capabilities, as well as make data in our custody available to others pursuing the same. A few examples of efforts the team has owned in recent years: Delivering framework to capture/alert/report on SLO quality across tens of thousands of endpoints ensuring all teams are accountable for the quality of their delivered services Replacement of our escalation management tools including alignment with our internal Asset/Team Catalog to allow automated alert routing and cross-brand alignment Delivery of MCP back-end for Reliability Platform data/tools, as well as enabling the same for peer teams across the Core Infrastructure organization Design and delivered orchestration tools to enable self-service provisioning of critical infrastructure (Kafka topics, Databases, CPU/GPU Pools, Service Scaffolding, etc) PoC for internal SRE AI Agentic tooling leveraging internal MCPs and domain specific profiles to facilitate troubleshooting and Q&A capabilities replacing FAQs/Runbooks Delivered per-pod realtime configuration key-value tooling enabling runtime feature flag management from a central source of truth across the fleet (100K+ pods) We are proud of our engineering culture, and many of our greatest successes are born from an individual with an idea spending some time hacking out a rudimentary demonstrable prototype. The mandate of this team is ripe for individuals with this creative pioneering mindset, and the ability to execute. You’re excited about this opportunity because you will… Delivery Innovative Capabilities: You don’t want to ‘turn the crank’ somewhere, but you want to contribute to some frontier thinking and help us push the industry forward Build Great Infrastructure: You know great infrastructure often goes unnoticed by design. You are content knowing your efforts allow you to claim a portion of everyone’s success. Balance Practical and Possible: Sometimes our pragmatic perspective is needed to maintain a high quality service; your experience will support finding the right risk balance Be Custom Obsessed: We want to learn from our customers to ensure we are solving the right challenges, and also share our perspective to influence in areas of expertise Automate Everything: Well… not everything… but if your first instinct is to ask how this toil could be automated or better yet avoided then you’re on the right team Shape the Future of Operations: Experiment with agentic, AI-assisted workflows that can propose, validate, and safely execute production changes — moving DoorDash toward proactive, self-healing systems in step with industry first movers. We’re excited about you because you have… Platform Engineering Mindset: You think in terms of APIs, abstractions, and workflows — you enjoy building systems that other engineers depend on every day. Proven Experience: You have 5+ years of experience in an infrastructure, platform, or backend engineering role, showing you can deliver and maintain complex systems. Backend Development Skills: You’re fluent in Go (or a similar language) and can design and deliver mission-critical services that are scalable, resilient, performant, and efficient. Cloud/Infra Expertise: You’re comfortable with AWS primitives, security best practices, containerization, and Infrastructure as Code tools like Terraform or Pulumi. SRE Experience: You understand concepts like SLOs, error budgets, and incident response though this is a platform development team, not an SRE/oncall team. Flexibility: You will work on cool/fun stuff the majority of your time, but also accept that some tasks just need to get done and reflect an opportunity for automation/improvement AI Alignement: You embrace the use of AI tools to be a more productive and capable engineer. This applies to coding, planning, supporting peers, and everything you do. Curiosity About the Future: You’re excited about automation and agentic, AI-assisted operations and want to help shape how engineers interact with production systems. Compensation The successful candidate’s starting pay will fall within the pay range listed below and is determined based on job-related factors including, but not limited to, skills, experience, qualifications, work location, and market conditions. Base salary is localized according to an employee’s work location. Ranges are market-dependent and may be modified in the future. In addition to base salary, the compensation for this role includes opportunities for equity grants. Talk to your recruiter for more information. DoorDash cares about you and your overall well-being. That’s why we offer a comprehensive benefits package to all regular employees, which includes a 401(k) plan with employer matching, 16 weeks of paid parental leave, wellness benefits, commuter benefits match, paid time off and paid sick leave in compliance with applicable laws (e.g. Colorado Healthy Families and Workplaces Act). DoorDash also offers medical, dental, and vision benefits, 11 paid holidays, disability and basic life insurance, family-forming assistance, and a mental health program, among others. To learn more about our benefits, visit our careers page here. See below for paid time off details: For salaried roles: flexible paid time off/vacation, plus 80 hours of paid sick time per year. For hourly roles: vacation accrued at about 1 hour for every 25.97 hours worked (e.g. about 6.7 hours/month if working 40 hours/week; about 3.4 hours/month if working 20 hours/week), and paid sick time accrued at 1 hour for every 30 hours worked (e.g. about 5.8 hours/month if working 40 hours/week; about 2.9 hours/month if working 20 hours/week). The national base pay range for this position within the United States, including Illinois and Colorado.
$159,800—$235,000 USD
About DoorDash At DoorDash, our mission to empower local economies shapes how our team members move quickly, learn, and reiterate in order to make impactful decisions that display empathy for our range of users—from Dashers to merchant partners to consumers. We are a technology and logistics company that started by enabling door-to-door delivery, and we are looking for team members who can help us go from a company that is known as the place you order food to a company that people turn to for any and all goods. DoorDash is growing rapidly and changing constantly, which gives our team members the opportunity to share their unique perspectives, solve new challenges, and own their careers. We're committed to supporting employees’ happiness, healthiness, and overall well-being by providing comprehensive benefits and perks including premium healthcare, wellness expense reimbursement, paid parental leave and more. Our Commitment to Diversity and Inclusion We’re committed to growing and empowering a more inclusive community within our company, industry, and cities. That’s why we hire and cultivate diverse teams of people from all backgrounds, experiences, and perspectives. We believe that true innovation happens when everyone has room at the table and the tools, resources, and opportunity to excel. Statement of Non-Discrimination: In keeping with our beliefs and goals, no employee or applicant will face discrimination or harassment based on: race, color, ancestry, national origin, religion, age, gender, marital/domestic partner status, sexual orientation, gender identity or expression, disability status, or veteran status. Above and beyond discrimination and harassment based on “protected categories,” we also strive to prevent other subtler forms of inappropriate behavior (i.e., stereotyping) from ever gaining a foothold in our office. Whether blatant or hidden, barriers to success have no place at DoorDash. We value a diverse workforce – people who identify as women, non-binary or gender non-conforming, LGBTQIA+, American Indian or Native Alaskan, Black or African American, Hispanic or Latinx, Native Hawaiian or Other Pacific Islander, differently-abled, caretakers and parents, and veterans are strongly encouraged to apply. Thank you to the Level Playing Field Institute for this statement of non-discrimination. Pursuant to the San Francisco Fair Chance Ordinance, Los Angeles Fair Chance Initiative for Hiring Ordinance, and any other state or local hiring regulations, we will consider for employment any qualified applicant, including those with arrest and conviction records, in a manner consistent with the applicable regulation. If you need any accommodations, please inform your recruiting contact upon initial connection. Notice to Applicants for Jobs Located in NYC or Remote Jobs Associated With Office in NYC Only We used Covey as part of our hiring and/or promotional process for jobs in NYC and certain features may qualify it as an AEDT in NYC. As part of the hiring and/or promotion process, we provided Covey with job requirements and candidate submitted applications. We began using Covey Scout for Inbound from August 21, 2023, through December 21, 2023. We resumed using Covey Scout for Inbound again on June 29, 2024, and ceased using Covey Scout for Inbound on April 30, 2026. The Covey tool has been reviewed by an independent auditor. Results of the audit may be viewed here:$168.93k - $192.5k
...more, visit Role Overview We are seeking a Site Reliability Engineer to join our Core Platform Engineering organization. The SRE team builds the automation... ..., and incident response, partnering closely with Software Engineering teams to foster a culture of reliability...SuggestedFull timeTemporary workWork at officeRemote workFlexible hours- CrowdStrike, Inc. is seeking a senior platform engineer to manage production infrastructure across multiple clouds. You will deploy and maintain... ...services while automating CI/CD pipelines, ensuring system reliability, and collaborating with various teams. The ideal candidate...SuggestedWork at office2 days per week
- A leading technology company is seeking a Site Reliability Engineer in Cupertino, California. The role involves owning the reliability of AWS and Kubernetes services, designing systems, and collaborating with engineering teams for observability and automation. Candidates...Suggested
$176k - $276k
Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain... ...availability using the combination of software and systems engineering practices.... ...Observability & Telemetry collection platform with a focus on performance at scale,...Suggested- Product Quality Engineer, GPU Platforms, Hardware Quality and Reliability Google Sunnyvale, CA, USA Qualifications Bachelor's degree in Mechanical Engineering, Industrial Systems Engineering, Manufacturing, or equivalent practical experience. 5 years of experience in...SuggestedContract work
$120.3k - $194.53k
...Job Summary Palo Alto Networks runs a large hybrid infrastructure across multiple public clouds. As a Site Reliability Engineer on the Internet Security Platform team, you will be part of a team supporting Advanced DNS Security services. This includes automation, architecture...Full timeWork at officeVisa sponsorshipWork visa$232k - $258k
Uber is seeking a Staff Engineer for their Core Services Production Engineering team in Sunnyvale, CA. This role involves designing and maintaining software to ensure the reliability and scalability of Uber's services. Candidates should have over 8 years of experience...Work at office- About the Team The Reliability Platform role is a key pillar of DoorDash’s Production Lifecycle... ...detests toil and repetitive tasks. We use software and agents to “keep the lights on” and... ...! About the Role As a Software Engineer on the Reliability Platform team, you’...Hourly payWork at officeLocal areaRemote workFlexible hours
$125k - $222k
...complex systems safely. Our software is used by top automotive OEMs... ...The Team The Insights platform is the data and analytics layer... ...Our platform gives autonomy engineers complete visibility and... ...Work On Ingestion & Data Reliability — The entry point for all data...Full timeFor contractorsFor subcontractorCasual workWork at officeRemote workDay shift$123k - $190k
...Description About The HIL Platform and Services team in the GM... ...develop and scale high-fidelity and reliable Hardware-in-Loop validation platforms to test AV software prior to deployment, to ensure... .... Role As a Software Engineer on the HIL Platform and...Local areaWork from homeRelocation packageFlexible hours$140k - $215k
...with the world's most advanced AI-native platform. We work on large scale distributed... ...We're seeking a highly skilled Senior Engineer to join our Falcon Risk Platform team at... ...applications, with a focus on performance, reliability, and security. Develop and maintain APIs...Work experience placementWork at officeLocal area2 days per week3 days per week$210k - $250k
...Ad Experience Team Engineer The Ad Experience team builds the products... ...on the Samsung Tizen platform, where performance isn't optional... ...deeply and building software that's fast, resilient, and consistent... .... Passion for building reliable "done right the first time" applications...Hourly payFull timeWorldwide$145k - $170k
...Software Engineer, AI Platform - New Grad Mountain View, California (HQ) Who We Are Nuro is a self-driving technology company on a mission... ...Our onboard system team's software engineers provide a reliable and high-performance platform that allows our autonomy teams...$1,000 - $2,030 per month
..., KFC, and Eataly trust our software to power their delivery business... ...you'll do Our Menu Platform (featured on our tech blog)... ...a backend-focused Software Engineer , you will help advance... ...workloads. Improve platform reliability , performance, and developer...Full timeTemporary workWork at officeWorldwideFlexible hours$193.93k - $291.15k
...Senior Software Engineer, Map Platform Nuro Mountain View, CA, US Who We Are Nuro is a self-driving technology company on a mission to... ...We are searching for an engineer with experience building reliable and scalable machine learning infrastructure and a strong...$140k - $265k
...Glean: Glean is the Work AI platform that helps everyone work... ...more. We are seeking creative engineers to build this context platform... ..., and agents can easily and reliably call into Glean. Build custom... .... ~1+ years of industry software engineering experience. ~ Experience...Work at officeHome officeFlexible hours3 days per week$140k - $265k
...: Glean is the Work AI platform that helps everyone work smarter... ...Role: We're seeking an engineer to work on the billing and... ...a high bar for correctness, reliability, and debuggability.... ...~2+ years of industry software engineering experience ~ Experience...Work at officeHome officeFlexible hours$132k - $198.45k
...Full Stack Software Engineer, Fleet Platform and Operations Tooling Nuro Mountain View, CA, US Who We Are Nuro is a self-driving technology... ...APIs. The systems you build will directly impact how reliably and efficiently Nuro operates vehicles on public roads....$207k - $300k
Google Inc. is looking for a Staff Software Engineer specializing in Site Reliability Engineering in Sunnyvale, CA. This role combines software and systems engineering to build and manage distributed systems, ensuring high reliability and uptime. The ideal candidate should...- ...We're looking for a deeply technical, hands-on software engineer to join our on-field Kernel Reliability team. You'll help tackle a critical challenge: improving... ...joined Cerebras: # Build a breakthrough AI platform beyond the constraints of the GPU. # Publish and...Internship
$170k - $216k
...applied to a range of vehicle platforms and product use cases. The... .... The Planner/Perception Reliability team builds out architectures... ...is accountable for onboard software health while ensuring high development... ...report to a Staff Software Engineer / Tech Lead Manager. You...Full timeImmediate startRemote work$174k - $252k
Senior Software Engineer, Site Reliability Engineering X Applicants in San Francisco: Qualified applications with arrest or conviction records will... ...data centers to building the next generation of Google platforms, we make Google’s product portfolio possible. We’re proud...Full time- ...Nuro gives the automakers and mobility platforms a clear path to AVs at commercial... ...connected future. About the Role As a software engineering intern, you will work closely with... ...system team's software engineers provide a reliable and high-performance platform that...Internship
- ...how we work. The Role We're looking for a Senior Software Engineer to join our Platform team and build the foundational infrastructure that powers... ...schedules, battery dispatch commands must execute reliably in real time, and ML models need to train and deploy continuously...
$175k - $215k
...also be applied to a range of vehicle platforms and product use cases. The Waymo... ...expands to more cities by enabling 100+ engineers to develop onboard software and to evolve Waymo's architecture.... ...for engineering high-performance, reliable systems, including interface design...Full timeRemote work$204k - $259k
...Sr. Software Engineer, Core Platforms Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver... ...analysis, full-system debugging, system telemetry, and reliability. We work closely with the Hardware, Compute, Sensor, Perception...Full timeWork experience placementRemote work- ...0K annualized Join Corvic's platform team for a summer internship... ...while learning from world-class engineers and researchers.... ...innovative features Enhance system reliability through tooling and infrastructure... ...degree in Computer Science, Software Engineering, AI, or related...InternshipSummer internship
$174k - $252k
Senior Software Engineer, Performance, Platforms Infrastructure Engineering Google, Sunnyvale, CA, USA Bachelor’s degree or equivalent practical experience... ...and Infrastructure at unparalleled scale, efficiency, reliability and velocity. Our customers include Googlers, Google...Full timeWorldwide$141k - $202k
About the job Google's software engineers develop the next-generation technologies that change... ...Infrastructure at unparalleled scale, efficiency, reliability and velocity. Our customers include... ...across global services, and offers platforms that developers use to build services....Full timeWorldwide$230k - $315k
Job Summary We’re seeking a Distinguished Engineer to lead the architecture, scalability, reliability, and advanced detection algorithms of Palo Alto Networks’ Data Loss Prevention (DLP) platform. This role drives the technical vision for low‑latency, multi‑tenant, and...Visa sponsorship
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Software Engineer, Reliability Platforms. Be the first to apply!
- software engineer internship remote Sunnyvale, CA
- new grad software engineer Sunnyvale, CA
- software engineer staff Sunnyvale, CA
- machine learning software engineer Sunnyvale, CA
- software engineer part time Sunnyvale, CA
- senior robotics software engineer Sunnyvale, CA
- junior software developer Sunnyvale, CA
- software engineer entry level Sunnyvale, CA
- software development engineer aws Sunnyvale, CA
- startup software engineer Sunnyvale, CA


