Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Engineering Manager, Reliability Platform

DoorDash USA

About the Team The Reliability Platform role is a key pillar of DoorDash’s Production Lifecycle team, alongside Observability and Deploy Platform. This group’s mandate is to enable users and agents to reason about the health of our services, facilitate change control safety, and provide the means to rapidly address any unexpected state. Ownership is fundamental in DoorDash culture, and all teams own what they build. We are not here to operate services on others’ behalf, but to provide tools that enable their success and ensure a consistently high level of quality for everything we do. We approach challenges with the pragmatic perspective of an SRE, and deliver solutions with the mindset of a SWE who detests toil and repetitive tasks. We use software and agents to “keep the lights on” and focus our energy on innovation that will level up the entire organization. This mission falls into three main categories. Service Health – Providing SLO frameworks, analytics tools, and AI Agent enablement to extract high quality insights from our telemetry to pinpoint faults, or highlight deficiencies Change Orchestration – Provide self-service provisioning orchestration, evolving from UI to Agent-driven to allow our developers to safely affect production from their IDE Incident Management – Define and deliver tools/processes/policies leveraged by our peers to quickly understand and recover from any unexpected issues in the environment This mandate implies a broad contribution across many aspects of the infrastructure, and demands equal parts software development and systems integration. Our priorities are always informed by an obsession to level up over 4,000 internal customers/peers, and obfuscate infrastructure complexity so they can focus on making the DoorDash product itself amazing! About the Role As a Software Engineer on the Reliability Platform team, you’ll help design, build, and operate services and infrastructure that deliver on the team’s broad mandate described above. This team has a unique opportunity for breadth, often in collaboration with expert peers across the Infrastructure and Product teams. Depending on need and interest, you may be working on mission-critical back-end services or pipelines, complex orchestration workflows, self-service UI, or AI Agent continuous improvements. We have fully embraced the use of AI tools in everything we do, and believe in the incredible potential this provides while remaining pragmatic enough to ensure the critical infrastructure we maintain cannot be compromised. Our goal is to deliver innovative next generation capabilities, as well as make data in our custody available to others pursuing the same. A few examples of efforts the team has owned in recent years: Delivering framework to capture/alert/report on SLO quality across tens of thousands of endpoints ensuring all teams are accountable for the quality of their delivered services Replacement of our escalation management tools including alignment with our internal Asset/Team Catalog to allow automated alert routing and cross-brand alignment Delivery of MCP back-end for Reliability Platform data/tools, as well as enabling the same for peer teams across the Core Infrastructure organization Design and delivered orchestration tools to enable self-service provisioning of critical infrastructure (Kafka topics, Databases, CPU/GPU Pools, Service Scaffolding, etc) PoC for internal SRE AI Agentic tooling leveraging internal MCPs and domain specific profiles to facilitate troubleshooting and Q&A capabilities replacing FAQs/Runbooks Delivered per-pod realtime configuration key-value tooling enabling runtime feature flag management from a central source of truth across the fleet (100K+ pods) As the leader of this team, you will take an active role across the organization to: Recruit, hiring, retain world class engineering talent into the team and continuously level up the team’s capabilities and outcomes Manage team performance, including ongoing and annual assessment in addition to regular 1:1 alignment, coaching, and feedback Establish rituals and expectations for project execution that maximize productivity while minimizing overhead/meetings/administrative work Align with European counterparts to forge a shared global culture, and alignment of planning aligned with a shared mission Own global processes/policies for incident response, communications, and reporting. We enable our colleagues, but do not own the response itself. Manage the team’s budget for Cloud Provider Infra and 3rd party vendor spend within the team’s mandate, including forcasting We are proud of our engineering culture, and many of our greatest successes are born from an individual with an idea spending some time hacking out a rudimentary demonstrable prototype. The mandate of this team is ripe for individuals with this creative pioneering mindset, and the ability to execute. You’re excited about this opportunity because you will… Delivery Innovative Capabilities: You don’t want to ‘turn the crank’ somewhere, but you want to contribute to some frontier thinking and help us push the industry forward Build Great Infrastructure: You know great infrastructure often goes unnoticed by design. You are content knowing your efforts allow you to claim a portion of everyone’s success. Balance Practical and Possible: Sometimes our pragmatic perspective is needed to maintain a high quality service; your experience will support finding the right risk balance Be Custom Obsessed: We want to learn from our customers to ensure we are solving the right challenges, and also share our perspective to influence in areas of expertise Automate Everything: Well… not everything… but if your first instinct is to ask how this toil could be automated or better yet avoided then you’re on the right team Shape the Future of Operations: Experiment with agentic, AI-assisted workflows that can propose, validate, and safely execute production changes — moving DoorDash toward proactive, self-healing systems in step with industry first movers. We’re excited about you because you have… Leading Teams: You have 5+ years leading teams of high calibre Engineers, and providing structure and rituals that enable the team to thrive Proven Experience: You have 5+ years of experience in an infrastructure, platform, or backend engineering role, showing you can deliver and maintain complex systems through a team or as an individual contributor. Platform Mindset: You think in terms of products/platforms/customers while designing systems that other engineers depend on every day. Influence: You are comfortable influencing others via conversations, presentations, demonstrations, and policies. Consistency: Your influence and leadership is seeking an outcome that will become the new best practice, to be applied consistently across the organization. Cloud/Infra Fundamentals: You’re comfortable broadly across the infra discussing topics related to AWS primitives, security best practices, containerization, and Infrastructure as Code. SRE Experience: You understand concepts like SLOs, error budgets, and incident response though this is a platform development team, not an SRE/oncall team. AI Alignement: You embrace the use of AI tools to be a more productive Engineering Manager, and instill the same mindset in your team. Curiosity About the Future: You’re excited about automation and agentic, AI-assisted operations and want to help shape how engineers interact with production systems. Compensation The successful candidate’s starting pay will fall within the pay range listed below and is determined based on job-related factors including, but not limited to, skills, experience, qualifications, work location, and market conditions. Base salary is localized according to an employee’s work location. Ranges are market-dependent and may be modified in the future. In addition to base salary, the compensation for this role includes opportunities for equity grants. Talk to your recruiter for more information. DoorDash cares about you and your overall well-being. That’s why we offer a comprehensive benefits package to all regular employees, which includes a 401(k) plan with employer matching, 16 weeks of paid parental leave, wellness benefits, commuter benefits match, paid time off and paid sick leave in compliance with applicable laws (e.g. Colorado Healthy Families and Workplaces Act). DoorDash also offers medical, dental, and vision benefits, 11 paid holidays, disability and basic life insurance, family-forming assistance, and a mental health program, among others. To learn more about our benefits, visit our careers page here. See below for paid time off details: For salaried roles: flexible paid time off/vacation, plus 80 hours of paid sick time per year. For hourly roles: vacation accrued at about 1 hour for every 25.97 hours worked (e.g. about 6.7 hours/month if working 40 hours/week; about 3.4 hours/month if working 20 hours/week), and paid sick time accrued at 1 hour for every 30 hours worked (e.g. about 5.8 hours/month if working 40 hours/week; about 2.9 hours/month if working 20 hours/week). The national base pay range for this position within the United States, including Illinois and Colorado.

$193,800—$285,000 USD

About DoorDash At DoorDash, our mission to empower local economies shapes how our team members move quickly, learn, and reiterate in order to make impactful decisions that display empathy for our range of users—from Dashers to merchant partners to consumers. We are a technology and logistics company that started by enabling door-to-door delivery, and we are looking for team members who can help us go from a company that is known as the place you order food to a company that people turn to for any and all goods. DoorDash is growing rapidly and changing constantly, which gives our team members the opportunity to share their unique perspectives, solve new challenges, and own their careers. We're committed to supporting employees’ happiness, healthiness, and overall well-being by providing comprehensive benefits and perks including premium healthcare, wellness expense reimbursement, paid parental leave and more. Our Commitment to Diversity and Inclusion We’re committed to growing and empowering a more inclusive community within our company, industry, and cities. That’s why we hire and cultivate diverse teams of people from all backgrounds, experiences, and perspectives. We believe that true innovation happens when everyone has room at the table and the tools, resources, and opportunity to excel. Statement of Non-Discrimination: In keeping with our beliefs and goals, no employee or applicant will face discrimination or harassment based on: race, color, ancestry, national origin, religion, age, gender, marital/domestic partner status, sexual orientation, gender identity or expression, disability status, or veteran status. Above and beyond discrimination and harassment based on “protected categories,” we also strive to prevent other subtler forms of inappropriate behavior (i.e., stereotyping) from ever gaining a foothold in our office. Whether blatant or hidden, barriers to success have no place at DoorDash. We value a diverse workforce – people who identify as women, non-binary or gender non-conforming, LGBTQIA+, American Indian or Native Alaskan, Black or African American, Hispanic or Latinx, Native Hawaiian or Other Pacific Islander, differently-abled, caretakers and parents, and veterans are strongly encouraged to apply. Thank you to the Level Playing Field Institute for this statement of non-discrimination. Pursuant to the San Francisco Fair Chance Ordinance, Los Angeles Fair Chance Initiative for Hiring Ordinance, and any other state or local hiring regulations, we will consider for employment any qualified applicant, including those with arrest and conviction records, in a manner consistent with the applicable regulation. If you need any accommodations, please inform your recruiting contact upon initial connection. Notice to Applicants for Jobs Located in NYC or Remote Jobs Associated With Office in NYC Only We used Covey as part of our hiring and/or promotional process for jobs in NYC and certain features may qualify it as an AEDT in NYC. As part of the hiring and/or promotion process, we provided Covey with job requirements and candidate submitted applications. We began using Covey Scout for Inbound from August 21, 2023, through December 21, 2023. We resumed using Covey Scout for Inbound again on June 29, 2024, and ceased using Covey Scout for Inbound on April 30, 2026. The Covey tool has been reviewed by an independent auditor. Results of the audit may be viewed here:

Vacancy posted 15 hours ago
Similar jobs that could be interesting for youBased on the Engineering Manager, Reliability Platform in Sunnyvale, CA vacancy
  •  ...applications, without the hassle of managing hundreds of GPUs or TPUs....  ...for a deeply technical, hands-on engineering leader for our on-field Kernel Reliability team. You will lead a high performing...  ...# Build a breakthrough AI platform beyond the constraints of the GPU... 
    Platform

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    4 days ago
  • CrowdStrike Holdings, Inc. is seeking an Engineering Manager for their Site Reliability Engineering team in Sunnyvale, California. This hybrid role involves...  ...and scalability of their AI-native cybersecurity platform.. Ideal candidates will possess over 10 years of software... 
    Platform

    CrowdStrike Holdings, Inc.

    Sunnyvale, CA
    1 day ago
  • $198.3k - $342.8k

    Site Reliability Engineering Manager, eBusiness Services Sunnyvale, California, United States Software and Services Imagine what we could do together...  ...(SRE) Manager to help support and scale our Next‑gen platforms. We are seeking a technically hands on Manager with a real... 
    Platform
    Relocation

    Apple Inc.

    Sunnyvale, CA
    2 days ago
  • $228.1k - $393.8k

    Site Reliability Engineering Manager, Storage - Apple Services Engineering Cupertino, California, United States Software and Services Are you a talented...  ...providing foundational block storage to various compute platforms, working closely with peer SRE teams and development... 
    Platform
    Relocation

    Apple Inc.

    Cupertino, CA
    2 days ago
  • $207k - $300k

    Site Reliability Engineering Manager, Google Distributed Cloud Google Sunnyvale, CA, USA Bachelor’s degree in Computer Science, a related field,...  ...our data centers to building the next generation of Google platforms, we make Google's product portfolio possible. We're proud... 
    Platform
    Full time

    Google Inc.

    Sunnyvale, CA
    22 hours ago
  • $250k - $300k

     ...reimagining how developers build reliable, scalable, event-driven...  ...project, OSS Conductor, our platform is now powering billions of...  .... We're hiring a strong engineering leader to own the Cloud &...  ...Tech Lead ready to step into management OR a Manager who still codes... 
    Platform
    Work at office
    Shift work

    Orkes

    Cupertino, CA
    2 days ago
  •  ...Description Job Description Job title: Engineering Manager Location: Only local candidates,...  ...complex Python-based applications and platforms Manage end-to-end software...  ...application performance, scalability, and reliability Drive innovation, adopt new technologies... 
    Platform
    Local area

    ZENITH INFOTEK LLC

    Santa Clara, CA
    6 days ago
  •  ...Engineering Manager, Inference ML Runtime Sunnyvale CA or Toronto Canada Cerebras Systems...  ...power the fastest generative AI inference platform in the world. As an Engineering...  ...memory efficiency); observability and reliability across the inference stack. Ensure... 
    Platform

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    9 days ago
  • $180k - $230k

     ...Senior Engineering Manager, Control Plane Mountain View, CA Aerospike is the real-time database...  ...a team building the Aerospike Cloud Platform — our multi-cloud, multi-tenant...  ...backend teams to deliver performant, reliable, and intuitive user experiences at scale... 
    Platform
    Work at office

    Aerospike

    Mountain View, CA
    4 days ago
  •  ...Engineering Manager As a global leader in cybersecurity, CrowdStrike protects the people, processes...  ...the world's most advanced AI-native platform. We work on large scale distributed...  ...practices and comprehensive system reliability standards. Work with internal peers... 
    Platform
    Permanent employment
    Work experience placement
    Remote work
    Flexible hours

    CrowdStrike

    Sunnyvale, CA
    2 days ago
  • $207k - $304k

     ...p e s t r y ) Software Engineering Mountain View, CA (HQ) About...  ...electricity remains reliable and affordable for everyone....  ...You will work closely with platform and infrastructure teams to...  ...technical leadership or people management role, with a focus on guiding... 
    Platform
    Full time
    Flexible hours

    X: The Moonshot Factory

    Mountain View, CA
    12 hours ago
  • $213k - $266k

     ...Sr. Manager, Engineering At Cloudera, we empower people to transform complex data into clear...  ...world's largest enterprises. The Data Platform Pillar is the bedrock of Cloudera's...  ...testing, deployment, system scalability, reliability, and security, tracking key... 
    Platform
    Remote work
    Work from home
    Flexible hours

    Cloudera

    Santa Clara, CA
    2 days ago
  • $190k - $253.75k

     ...data and AI infrastructure platform so our customers can use deep...  ...their business. Founded by engineers - and customer obsessed - we...  ...deliver extreme elasticity, reliability and cost efficiency. We are...  ...looking for an Engineering Manager to lead a team responsible for... 
    Platform
    Local area
    Immediate start
    Worldwide

    Databricks

    Mountain View, CA
    3 days ago
  • $250k

     ...is the leader in AI knowledge management solutions for enterprises. As...  ...source of truth—explainable, reliable, and maintainable—that serves...  ...Director of Site Reliability Engineering, you will ensure that eGain’s AI knowledge management platform operates with the reliability... 
    Platform
    Work at office

    eGain

    Sunnyvale, CA
    22 hours ago
  • $200k - $322k

    Senior Manager, Site Reliability Engineering page is loaded## Senior Manager, Site Reliability Engineeringlocations: US, CA, Santa Claratime type: Full...  ...* Lead the development of automation and orchestration platforms that reduce manual effort across the outage lifecycle,... 
    Platform

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $190k - $261.25k

    Engineering Manager - Pipelines Engine Mountain View, California; San Francisco, California At...  ...world's best data and AI infrastructure platform so our customers can use deep data...  ...ensure the team builds high-quality and reliable infrastructure services. Experience being... 
    Platform

    Databricks Inc.

    Mountain View, CA
    22 hours ago
  • $190k - $261.25k

     ...data and AI infrastructure platform, so our customers can focus...  ...world. Identity and Access Management (IAM) is at the heart of this...  ...time. We are looking for an Engineering Manager to lead our IAM team...  ...services that must be highly reliable, secure, and fast under extreme... 
    Platform
    Local area

    Databricks Inc.

    Mountain View, CA
    2 days ago
  • $251k - $310k

     ...applied to a range of vehicle platforms and product use cases. The Waymo...  ...Research, Oracles, and Simulation. Manage a team (~10) with diverse skills including engineering, modeling, ML infrastructure,...  ...performance, efficiency, and reliability. Develop the execution... 
    Platform
    Temporary work
    Immediate start

    Neura Market

    Mountain View, CA
    2 days ago
  • $202k - $310k

     ...VP, Engineering Req ID: 81694 Location: Sunnyvale, California...  ...-powered networking, cloud platforms, and enterprise infrastructure...  ...solutions that bring reliability and performance to a world always...  ...performance with simplified management. If you want to grow your... 
    Platform
    Worldwide

    Vistance Networks

    Sunnyvale, CA
    4 days ago
  • $207k - $304k

     ...p e s t r y ) Software Engineering Mountain View, CA (HQ) About...  ...electricity remains reliable and affordable for everyone....  ...to serve as the Engineering Manager for our Infrastructure and Developer...  ...: Architect self-service platforms and CI/CD primitives that allow... 
    Platform
    Full time
    Flexible hours

    X: The Moonshot Factory

    Mountain View, CA
    1 day ago
  •  ...for the modern world. Our cloud-native platform uses computer vision and AI to help...  ...We are looking for a technically deep Engineering Manager to lead the AI team at Coram. This team...  ...Establish strong engineering standards around reliability, observability, and model evaluation... 
    Platform
    Shift work

    Coram AI

    Sunnyvale, CA
    4 days ago
  • $193.8k - $285k

     ...DoorDash Drive is our B2B delivery platform, powering same-day and next-...  ...under Drive: Pizza - Reliability, efficiency, and quality programs...  ...-to-point logistics). Fleet Management (new bet) - Self-serve...  ...About the Role We're hiring an Engineering Manager to lead this team.... 
    Platform
    Work at office
    Flexible hours

    Fairygodboss

    Sunnyvale, CA
    22 hours ago
  • $228.6k - $314.25k

    Sr. Engineering Manager, App Framework Team As a Sr. Engineering Manager on the App Framework team...  ...every service at Databricks fast, reliable, observable, and secure by default, without...  .... The team owns the core application platform: multi-language service and RPC... 
    Platform
    For contractors

    Menlo Ventures

    Mountain View, CA
    22 hours ago
  • $251k - $310k

     ...Engineering Manager, Rider Growth Waymo is an autonomous driving technology company with the...  ...also be applied to a range of vehicle platforms and product use cases. The Waymo Driver...  ...economists to design and scale highly reliable, low-latency promotional engines. Build... 
    Platform
    Full time
    Remote work

    Waymo

    Mountain View, CA
    2 days ago
  • $255.7k - $346k

     ...States and Europe. We are looking for an Engineering Manager to lead ML teams within SDS Core. This...  ...program teams to translate vehicle platform constraints into model architecture and...  ...-to-end) while maintaining production reliability. Nice to have: Familiarity with... 
    Platform
    Full time

    Applied Intuition

    Sunnyvale, CA
    4 days ago
  • $207k - $300k

    Engineering Manager, Capacity Management, Google Fleet corporate_fare Google place Sunnyvale, CA...  ...prioritize security, efficiency, and reliability across everything we do - from developing...  ...Cloud’s Vertex AI, the leading AI platform for bringing Gemini models to enterprise... 
    Platform
    Full time

    Google Inc.

    Sunnyvale, CA
    1 day ago
  • $217k - $288.4k

     ...best data and AI infrastructure platform so our customers can use deep...  ...their business. Founded by engineers — and customer obsessed — we...  ...started. As a Senior Engineering Manager at Databricks, you will lead...  ...available, scalable and reliable software that is necessary to... 
    Platform
    Local area
    Worldwide

    Databricks Inc.

    Mountain View, CA
    2 days ago
  • $255.7k - $346k

     ...and trust our employees to manage their schedules responsibly....  ...Europe. We are looking for an Engineering Manager to lead ML teams...  ...teams to translate vehicle platform constraints into model architecture...  ...maintaining production reliability. Nice to have: Familiarity... 
    Platform
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Remote work
    Day shift

    Decisive Point

    Sunnyvale, CA
    22 hours ago
  • $193.8k - $285k

    Fairygodboss is hiring an Engineering Manager to lead a multidisciplinary team across various Lines of Business, including Pizza and Health...  ...you'll be expected to drive technical direction and ensure reliability for critical merchant deliveries, all within a hybrid... 
    Platform
    Flexible hours

    Fairygodboss

    Sunnyvale, CA
    22 hours ago
  • $202k - $247k

    Job Category Site Reliability Engineering Posting Date 11/18/2025, 12:24 AM Locations Santa Clara,...  ...You will help support the FortiCNAPP platform and play a key role in building, operating...  ..., network/connectivity, workload management, observability, and storage services.... 
    Platform
    Full time
    Worldwide

    Fortinet, Inc.

    Santa Clara, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Engineering Manager, Reliability Platform. Be the first to apply!