Software Engineer, Reliability Platforms
$159.8k - $235kFairygodboss
About the Team The Reliability Platform role is a key pillar of DoorDash's Production Lifecycle team, alongside Observability and Deploy Platform. This group's mandate is to enable users and agents to reason about the health of our services, facilitate change control safety, and provide the means to rapidly address any unexpected state. Ownership is fundamental in DoorDash culture, and all teams own what they build. We are not here to operate services on others' behalf, but to provide tools that enable their success and ensure a consistently high level of quality for everything we do. We approach challenges with the pragmatic perspective of an SRE, and deliver solutions with the mindset of a SWE who detests toil and repetitive tasks. We use software and agents to \"keep the lights on\" and focus our energy on innovation that will level up the entire organization. This mission falls into three main categories. Service Health - Providing SLO frameworks, analytics tools, and AI Agent enablement to extract high quality insights from our telemetry to pinpoint faults, or highlight deficiencies Change Orchestration - Provide self-service provisioning orchestration, evolving from UI to Agent-driven to allow our developers to safely affect production from their IDE Incident Management - Define and deliver tools/processes/policies leveraged by our peers to quickly understand and recover from any unexpected issues in the environment This mandate implies a broad contribution across many aspects of the infrastructure, and demands equal parts software development and systems integration. Our priorities are always informed by an obsession to level up over 4,000 internal customers/peers, and obfuscate infrastructure complexity so they can focus on making the DoorDash product itself amazing! About the Role As a Software Engineer on the Reliability Platform team, you'll help design, build, and operate services and infrastructure that deliver on the team's broad mandate described above. This team has a unique opportunity for breadth, often in collaboration with expert peers across the Infrastructure and Product teams. Depending on need and interest, you may be working on mission-critical back-end services or pipelines, complex orchestration workflows, self-service UI, or AI Agent continuous improvements. We have fully embraced the use of AI tools in everything we do, and believe in the incredible potential this provides while remaining pragmatic enough to ensure the critical infrastructure we maintain cannot be compromised. Our goal is to deliver innovative next generation capabilities, as well as make data in our custody available to others pursuing the same. A few examples of efforts the team has owned in recent years: Delivering framework to capture/alert/report on SLO quality across tens of thousands of endpoints ensuring all teams are accountable for the quality of their delivered services Replacement of our escalation management tools including alignment with our internal Asset/Team Catalog to allow automated alert routing and cross-brand alignment Delivery of MCP back-end for Reliability Platform data/tools, as well as enabling the same for peer teams across the Core Infrastructure organization Design and delivered orchestration tools to enable self-service provisioning of critical infrastructure (Kafka topics, Databases, CPU/GPU Pools, Service Scaffolding, etc) PoC for internal SRE AI Agentic tooling leveraging internal MCPs and domain specific profiles to facilitate troubleshooting and Q&A capabilities replacing FAQs/Runbooks Delivered per-pod realtime configuration key-value tooling enabling runtime feature flag management from a central source of truth across the fleet (100K+ pods) We are proud of our engineering culture, and many of our greatest successes are born from an individual with an idea spending some time hacking out a rudimentary demonstrable prototype. The mandate of this team is ripe for individuals with this creative pioneering mindset, and the ability to execute. You're excited about this opportunity because you will... Delivery Innovative Capabilities: You don't want to 'turn the crank' somewhere, but you want to contribute to some frontier thinking and help us push the industry forward Build Great Infrastructure: You know great infrastructure often goes unnoticed by design. You are content knowing your efforts allow you to claim a portion of everyone's success. Balance Practical and Possible: Sometimes our pragmatic perspective is needed to maintain a high quality service; your experience will support finding the right risk balance Be Custom Obsessed: We want to learn from our customers to ensure we are solving the right challenges, and also share our perspective to influence in areas of expertise Automate Everything: Well... not everything... but if your first instinct is to ask how this toil could be automated or better yet avoided then you're on the right team Shape the Future of Operations: Experiment with agentic, AI-assisted workflows that can propose, validate, and safely execute production changes - moving DoorDash toward proactive, self-healing systems in step with industry first movers. We're excited about you because you have... Platform Engineering Mindset: You think in terms of APIs, abstractions, and workflows - you enjoy building systems that other engineers depend on every day. Proven Experience: You have 5+ years of experience in an infrastructure, platform, or backend engineering role, showing you can deliver and maintain complex systems. Backend Development Skills: You're fluent in Go (or a similar language) and can design and deliver mission-critical services that are scalable, resilient, performant, and efficient. Cloud/Infra Expertise: You're comfortable with AWS primitives, security best practices, containerization, and Infrastructure as Code tools like Terraform or Pulumi. SRE Experience: You understand concepts like SLOs, error budgets, and incident response though this is a platform development team, not an SRE/oncall team. Flexibility: You will work on cool/fun stuff the majority of your time, but also accept that some tasks just need to get done and reflect an opportunity for automation/improvement AI Alignement: You embrace the use of AI tools to be a more productive and capable engineer. This applies to coding, planning, supporting peers, and everything you do. Curiosity About the Future: You're excited about automation and agentic, AI-assisted operations and want to help shape how engineers interact with production systems. Compensation The successful candidate's starting pay will fall within the pay range listed below and is determined based on job-related factors including, but not limited to, skills, experience, qualifications, work location, and market conditions. Base salary is localized according to an employee's work location. Ranges are market-dependent and may be modified in the future. In addition to base salary, the compensation for this role includes opportunities for equity grants. Talk to your recruiter for more information. DoorDash cares about you and your overall well-being. That's why we offer a comprehensive benefits package to all regular employees, which includes a 401(k) plan with employer matching, 16 weeks of paid parental leave, wellness benefits, commuter benefits match, paid time off and paid sick leave in compliance with applicable laws (e.g. Colorado Healthy Families and Workplaces Act). DoorDash also offers medical, dental, and vision benefits, 11 paid holidays, disability and basic life insurance, family-forming assistance, and a mental health program, among others. See below for paid time off details: For salaried roles: flexible paid time off/vacation, plus 80 hours of paid sick time per year. For hourly roles: vacation accrued at about 1 hour for every 25.97 hours worked (e.g. about 6.7 hours/month if working 40 hours/week; about 3.4 hours/month if working 20 hours/week), and paid sick time accrued at 1 hour for every 30 hours worked (e.g. about 5.8 hours/month if working 40 hours/week; about 2.9 hours/month if working 20 hours/week). The national base pay range for this position within the United States, including Illinois and Colorado.
$159,800-$235,000 USD
Statement of Non-Discrimination In keeping with our beliefs and goals, no employee or applicant will face discrimination or harassment based on: race, color, ancestry, national origin, religion, age, gender, marital/domestic partner status, sexual orientation, gender identity or expression, disability status, or veteran status. Above and beyond discrimination and harassment based on \"protected categories\", we also strive to prevent other subtler forms of inappropriate behavior (i.e., stereotyping) from ever gaining a foothold in our office. Whether blatant or hidden, barriers to success have no place at DoorDash. We value a diverse workforce - people who identify as women, non-binary or gender non‑conforming, LGBTQIA+, American Indian or Native Alaskan, Black or African American, Hispanic or Latinx, Native Hawaiian or Other Pacific Islander, differently‑abled, caretakers and parents, and veterans are strongly encouraged to apply. Pursuant to the San Francisco Fair Chance Ordinance, Los Angeles Fair Chance Initiative for Hiring Ordinance, and any other state or local hiring regulations, we will consider for employment any qualified applicant, including those with arrest and conviction records, in a manner consistent with the applicable regulation. Statement of Non-Discrimination: In keeping with our beliefs and goals, no employee or applicant will face discrimination or harassment based on: race, color, ancestry, national origin, religion, age, gender, marital/domestic partner status, sexual orientation, gender identity or expression, disability status, or veteran status. Above and beyond discrimination and harassment based on \"protected categories\", we also strive to prevent other subtler forms of inappropriate behavior (i.e., stereotyping) from ever gaining a foothold in our office. Whether blatant or hidden, barriers to success have no place at DoorDash. We value a diverse workforce - people who identify as women, non-binary or gender non‑conforming, LGBTQIA+, American Indian or Native Alaskan, Black or African American, Hispanic or Latinx, Native Hawaiian or Other Pacific Islander, differently‑abled, caretakers and parents, and veterans are strongly encouraged to apply. Thank you to the Level Playing Field Institute for this statement of non‑discrimination. #J-18808-Ljbffr Fairygodboss- ...technology firm based in San Francisco is seeking a DevOps Engineer to enhance the reliability of their production systems. You will collaborate with... ...with strong knowledge in observability stacks and cloud platforms. Join us in our mission to revolutionize hardware design...Suggested
- ...dynamic healthcare technology company is seeking a Senior DevOps Engineer to enhance their infrastructure and deployment systems.... ...cloud infrastructure, maintaining CI/CD pipelines, and ensuring platform reliability. Candidates should have over 5 years of experience in B2B...SuggestedRemote jobFlexible hours
- 53 Stations is seeking a DevOps Engineer to enhance the systems powering Flux's platform. You’ll tackle operations from billing to onboarding while ensuring high system reliability and performance. With a focus on collaboration and ownership, you will develop internal...Suggested
- A healthcare technology company is looking for an Infrastructure Engineer to ensure reliability, security, and performance of their platform. You will oversee infrastructure health and make crucial architectural decisions while supporting product teams. The ideal candidate...SuggestedRemote job
- ...edge AI startup in San Francisco is seeking a Senior Infrastructure Engineer to build platforms for AI agents. Your role will involve creating systems that other engineers rely on, ensuring reliability and fast deployment. You'll work with technologies like Python, AWS,...Suggested
$193.8k - $285k
About the Team The Reliability Platform role is a key pillar of DoorDash’s Production Lifecycle team... ...toil and repetitive tasks. We use software and agents to “keep the lights on” and... ...amazing! About the Role As a Software Engineer on the Reliability Platform team, you’...Hourly payFlexible hours- ...distributed computing and make it accessible to software developers of all skill levels. We’re... ...Anyscale is looking for a Senior Site Reliability Engineer to join the Infrastructure team.... ...that powers Anyscale’s cloud platform. You will have the opportunity to work...
$232k - $319k
...too, let's talk. The Infrastructure Platform and Shared Services Team Okta authenticates... ...scale the service with great people and reliable, cost-effective, and efficient... ...Accelerate the velocity of SRE and product engineering by developing robust platforms, powerful...Permanent employmentLocal areaWorldwideFlexible hours- LendingClub is seeking a Sr Manager of Database Engineering in San Francisco to define and own the database engineering strategy. This... ...involves leading a team of Database Engineers and ensuring the reliability and performance of the database systems. The ideal candidate...Work at office3 days per week
$157.7k - $277.8k
...Full time Location Type Hybrid Department Engineering, product & design Compensation SF & NYC... .... With WRITER's end-to-end platform, hundreds of companies like Mars, Marriott... ...platform must be available, performant, and reliable, 24/7. As an Infrastructure engineer, you...Full timeWork at officeLocal areaFlexible hours$180k - $280k
...the Role As an ML infrastructure and reliability engineer, you will join the team responsible for... ...building and maintaining TypeSafe’s API platform for inference. These APIs will be user... .... Have 5+ years of professional software engineering experience (3+ years of infra...Visa sponsorship- ...At Sierra, we’re creating a platform to help businesses build better... ...is an evolution of software development that needs new tools... ...requires a high bar on security, reliability, and performance. What you'... .... Experience with data engineering, MLOps, and LLMOps. Comfortable...Full timeFlexible hours
$130.9k - $198k
...Connected Operations™ Cloud, which is a platform that enables organizations that depend... ...content that wins deals. As a Senior Software Engineer, AI Platform, you’ll lead the design... ...experiences. You’ll focus on building scalable, reliable systems that enable multi-step AI...Full timeContract workInternshipRemote workFlexible hours$194k - $239k
...Senior Software Engineer, Infrastructure Hover helps people design, improve, and protect the properties... ...problems while improving the platform that engineering teams build on. You will... ...to deploy, easier to operate, and more reliable in production. You’ll collaborate with...Full timeFor contractorsWork at officeLocal areaFlexible hours- ...emissions and decarbonize their business. We’re looking for software engineers to help build the AI platform that powers our agents product. You’ll be a technical... ..., and tooling that let our product teams ship reliable, observable AI features on top of a wealth of operational...Work at officeRemote work
- ...some frequently asked questions about our Engineering team. You can also learn more about us on our engineering careers page. Our Platform Engineers Design, build, and maintain... ...Postgres databases to ensure performance and reliability. Manage and optimize our cloud...Work at officeRemote workFlexible hoursShift work
$136k - $170k
...information via a revolutionary cloud-based platform to authoritative figures in commercial... ..., manufacturing, data processing, and software engineering, our office is a truly inspiring mix... ...enables Planet's engineering teams to reliably deploy and scale their services with...Full timeTemporary workWork at officeLocal areaRemote workHome office3 days per week- ...five days a week in our new San Francisco headquarters. About the Role As a Backend Engineer at Mercor, you will build core platform systems that make our operations secure, reliable, and scalable. This role focuses on two critical areas: Contracts and Identity & Access...Contract workWork at officeRelocation packageFlexible hours
$200k - $265k
...The Role As a Senior Software Engineer on the Platform team, you will design, build, and maintain the core infrastructure that empowers healthcare... ...long-term success, investing in platform observability, reliability, and scalability. You'll own the entire lifecycle of major...Remote workFlexible hours- ...safely. CodeIntegrity is the platform security teams use to control... ...we are a small team, mostly engineers, shipping fast. We are building... ...that make agent activity reliable and understandable. Own production... ...and turn it into working software. You write code that is clear...
- ...and ambitious. We value craft, intellectual rigor, and direct communication. About the role Platform engineers build the systems that let Lassie’s agents do real work reliably: integrations, orchestration, data infrastructure, observability, and the foundations behind...Work at office
$230k - $265k
...financial tools they need through the platforms they already sell on. We partner... ...The Position We're looking for a software engineer to join Parafin's Infrastructure team... .... This role is critical to building reliable, scalable, and developer-friendly systems...Work from homeFlexible hours- ...that powers the next generation of AI agents across our platform. As a Software Engineer on our Agentic Infrastructure team, you'll be at forefront... ...'s platform Design evaluation, observability, & reliability frameworks that ensure agent behavior is safe, auditable...Full timeFreelanceInternshipWork at officeRemote workFlexible hours
$220k - $300k
...on placing the best product managers, software, and hardware talent at innovative companies... ...to help them hire. Senior Software Engineer, Platform Location: San Francisco, CA (... ...include: Building highly reliable distributed systems Scaling document...Work at officeRemote workVisa sponsorship$196k - $245k
...community of millions of daily active users who use the platform for many different reasons, but there’s one... ...to production while ensuring Discord remains reliable, efficient, and scalable. As a Senior Software Engineer on these teams, you will continuously improve our...Full timeRelocationRelocation package$130k - $400k
...drive long-term success for both clients and candidates. Software Engineer, Platform Location: San Francisco or New York City Company... ...Translate complex operational requirements into scalable, reliable backend services Improve system reliability, security,...Work at officeRemote workRelocation packageFlexible hours$190k - $235k
...Engineering Manager Hover is looking for an Engineering Manager... ...intersection of DevOps, internal platform development, security... ...ship code safely, quickly, and reliably, even as our technical complexity... ...Includes ~ A strong software engineering foundation with...Full timeWork at officeLocal areaFlexible hours$196k - $245k
...community of millions of daily active users who use the platform for many different reasons, but there's one... ...to production while ensuring Discord remains reliable, efficient, and scalable. As a Senior Software Engineer on these teams, you will continuously improve our...Full timeRelocationRelocation package$150k - $270k
...Who We Need Experienced engineers who've made infrastructure a competitive advantage — not just reliable plumbing. You've owned the systems that let product teams... ...industry-defining accounting and finance software. Our platform needs to scale with us, and we want an...Full timeWork at officeRemote workRelocationFlexible hours$196k - $220.5k
...nearly everyone does on our platform: play video games. Over 90% of... ...systems support hundreds of engineers daily, process thousands of builds... ...and platforms. As a Senior Software Engineer on this team, you... ...development at Discord fast, safe, and reliable. Becoming an expert in...Full timeRelocationRelocation package
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Software Engineer, Reliability Platforms. Be the first to apply!
- software sales engineer San Francisco, CA
- software engineer amazon San Francisco, CA
- software engineer student San Francisco, CA
- agile software developer San Francisco, CA
- rust software engineer San Francisco, CA
- software developer positions San Francisco, CA
- senior software design engineer San Francisco, CA
- software developer San Francisco, CA
- ngo software engineer San Francisco, CA
- startup software engineer San Francisco, CA

