Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Incident Manager

$103.9k - $145.53k
Full-time

Databricks Inc.

Incident Manager US Remote

CSQ127R151

At Databricks, we are passionate about empowering data teams to tackle the world’s most challenging problems — from bringing the next mode of transportation to reality to accelerating the development of medical breakthroughs. We achieve this by building and operating the world’s best data and AI infrastructure platform, enabling our customers to leverage deep data insights and enhance their business. Founded by engineers — and customer-obsessed — we leap at every opportunity to tackle technical challenges, from designing next-gen UI/UX for interfacing with data to scaling our services and infrastructure across millions of virtual machines. And we're only getting started. As an Incident Manager, you will lead Databricks’ most critical production incidents while providing clear, accurate, and timely communication to customers, executives, and engineers. You’ll serve as both incident commander and reliability engineer; orchestrating multi-team responses, driving real-time status updates, and partnering with engineering to analyze and prevent failures. Your work will ensure Databricks maintains not only technical resilience but also customer and stakeholder confidence during high-impact events. This role combines operational leadership, technical systems knowledge, and exceptional communication skills. You will be at the intersection of engineering depth and operational clarity, ensuring that every major incident is managed with precision, transparency, and continuous improvement. The impact you will have here: Lead critical incidents — coordinate multi-disciplinary response efforts across Databricks’ cloud-based services to rapidly mitigate impact and restore operations. Drive technical root cause analysis and reliability improvements: collaborate with engineering teams to trace and document underlying causes across distributed systems, services, and data stores. Summarize key learnings, clearly communicate action items, and ensure that technical and procedural improvements are followed through. Own communications during incidents — deliver frequent, high-quality updates to internal stakeholders (executives, engineering leadership, support) and compose and publish customer-facing notifications that are accurate, timely, and empathetic. Mentor and train peers in both incident communication and technical response disciplines to raise the overall quality of Databricks’ incident response. What are we looking for: 5+ years of experience in incident management, site reliability engineering, or production operations supporting large-scale, cloud-native systems. Proven ability to lead and coordinate high-severity incidents, including identifying impact, isolating fault domains, and managing multi-team response efforts. Strong understanding of cloud infrastructure (AWS, Azure, or GCP) — including compute, networking, storage, and observability components. Deep expertise in log analysis and debugging: Familiarity with log aggregation and search tools (e.g., Datadog, Elasticsearch, Splunk, Cloud Logging, or OpenTelemetry). Hands-on experience with observability systems — metrics, logging, and tracing frameworks (Prometheus, Grafana, OpenTelemetry, etc.). Proficiency in at least one major programming or scripting language (Python, Go, or Bash) for automating diagnostics, data collection, or analysis. Experience developing and maintaining incident playbooks and communication templates to ensure consistent, timely updates. Excellent contextual interpretation and writing skills, as well as the ability to effectively summarize and communicate to both technical and business audiences, are required. BS, Master's or other advanced degree in Computer Science or Computer Engineering, or related Engineering field. Pay Range Transparency Databricks is committed to fair and equitable compensation practices. The pay range(s) for this role is listed below and represents the expected salary range for non-commissionable roles or on-target earnings for commissionable roles. Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to job-related skills, depth of experience, relevant certifications and training, and specific work location. Based on the factors above, Databricks anticipates utilizing the full width of the range. The total compensation package for this position may also include eligibility for annual performance bonus, equity, and the benefits listed above. For more information regarding which range your location is in visit our page here. Zone 3 Pay Range

$103,900—$145,525 USD

About Databricks Databricks is the data and AI company. More than 10,000 organizations worldwide — including Comcast, Condé Nast, Grammarly, and over 50% of the Fortune 500 — rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark™, Delta Lake and MLflow. To learn more, follow Databricks on Twitter, LinkedIn and Facebook. Benefits At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region click here. Our Commitment to Diversity and Inclusion At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics. Compliance If access to export-controlled technology or source code is required for performance of job duties, it is within Employer's discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone.

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Incident Manager in Texas vacancy
  •  ...Job Summary: As a Business Development Executive focused on Incident Response, IT Services, and Technology Rentals, you'll drive revenue...  ...(temporary networks, mobile IT setups, backup systems) Managed IT, Cloud, and Help Desk Services M365 Licensing and Tiered... 
    Suggested
    Temporary work
    Local area

    Smart Source

    Carrollton, TX
    3 days ago
  •  ...McKinney Consulting is looking for a Security Operations Center (SOC) Manager in Houston, Texas. The SOC Manager will oversee daily operations and strategic direction of the SOC, ensuring effective incident response and compliance with security policies. The candidate... 
    Suggested

    K L McKinney Consulting

    Houston, TX
    4 days ago
  • A leading investment firm is seeking an Incident and Problem Manager in Dallas, TX. The role involves managing the incident lifecycle, resolving service disruptions, and conducting root cause analysis. Candidates should have a Bachelor’s degree and 5+ years in IT Service... 
    Suggested

    NorthMark Strategies LLC

    Dallas, TX
    4 days ago
  • Kentro is hiring a Major Incident Management (MIM) Support Specialist to support the VA End Point Support contract. This role involves investigating high-priority incidents, improving incident response processes, and collaborating with stakeholders to restore critical IT... 
    Suggested
    Contract work

    kentro

    Austin, TX
    6 days ago
  • Sysco is seeking an Incident Manager in Houston, Texas, to oversee the incident management process as part of the IT Service Management team. This hybrid position requires on-site presence 3-4 days a week, focusing on incident resolution, accountability, and improving... 
    Suggested
    3 days per week

    Sysco

    Houston, TX
    6 days ago
  • NorthMark Compute and Cloud LLC is seeking an Incident & Problem Manager to oversee the Incident Management and Problem Management practices. This critical role ensures rapid service restoration and effective resolution of issues. The ideal candidate should have over 5... 

    NorthMark Compute and Cloud LLC

    Dallas, TX
    3 days ago
  • Stellantis Financial Services is looking for a Director of Service Desk and Incident Management in Dallas, TX. The role oversees the Service Desk and Incident Management functions, ensuring high-quality service delivery and team leadership while driving accountability and... 
    Work at office
    2 days per week
    3 days per week

    Stellantis Financial Services US

    Dallas, TX
    4 days ago
  • Director, Service Desk & Incident Management Stellantis Financial Services (SFS) is the new captive finance company for one of the world's leading automakers and a mobility provider with iconic brands including Abarth, Alfa Romeo, Chrysler, Citroën, Dodge, DS Automobiles... 
    Work at office
    Immediate start
    Visa sponsorship
    Work visa
    Monday to Friday
    Shift work
    2 days per week
    3 days per week

    First Investors Financial Services, Inc.

    Dallas, TX
    3 days ago
  •  ...Culture first, we can show this not only produces results, but more importantly, can change lives. Compass is seeking an Operations Incident Manager at our Dallas, TX location. The Operations Incident Manager is responsible for the execution and continuous improvement of... 
    Live out
    Work visa

    Compass Datacenters

    Dallas, TX
    5 days ago
  • Limelight Health is seeking an IT Service Management Analyst in Dallas, Texas to facilitate the introduction of beneficial changes in a live production environment. The role includes major incident management services, collaboration with project managers, and improving... 

    Limelight Health

    Dallas, TX
    6 days ago
  •  ...scientific research, simulations, analysis, and decision-making, accelerating discovery and driving faster innovation. The Incident & Problem Manager is accountable for establishing and operating the Incident Management and Problem Management practices within NMC²,... 

    NMC2

    Dallas, TX
    5 days ago
  • $100k - $130k

     ...Austin, TX Role Responsibilities Responsible to protect service availability through effective execution of the enterprise Major Incident Management process. Manages, monitors, reports, and executes the Major Incident Management Process. Responds to early symptoms and major... 
    Temporary work
    Work experience placement
    Work at office
    Shift work
    3 days per week

    Western Union

    Austin, TX
    3 days ago
  • $142.9k - $266k

    Job Number: R0241993 Cyber Incident Response Business Development Senior Manager The Opportunity: Join a team to contribute to Booz Allen's growth efforts for its Incident Response business, applying business development, strategic sales expertise and knowledge of Incident... 
    Full time
    Contract work
    Part time
    Local area

    Phase2 Technology

    Houston, TX
    6 days ago
  •  ...on-site can increase based on business needs. Candidate MUST be flexible to Weekend Shifts and late hours. Description As an Incident Manager, you will be a part of the Sysco Information Technology Service Management team based in Houston, Texas, and responsible for the... 
    Temporary work
    Local area
    Flexible hours
    Weekend work
    3 days per week
    Weekday work

    Sysco

    Houston, TX
    1 day ago
  • $120k - $140k

     ...Job Title – Incident Response Manager (04B3S) You’ll be a part of bringing humanity to business by leading incident response and vulnerability management for federal clients. This remote role in the United States involves working with the Information Security team, consulting... 
    Work experience placement
    Remote work
    Work from home

    TeleTech Holdings, Inc.

    Austin, TX
    1 day ago
  • $110.8k - $226.4k

     ...services. Join us at Crowe and embark on a career where you can help shape the future of our industry. Job Description: Incident Response Manager Position Summary The Incident Response Manager serves as a senior technical leader responsible for managing complex... 
    Local area
    Worldwide

    Crowe

    The Woodlands, TX
    7 hours ago
  • Major Incident Manager Plano, TX Contract Major incident Management Incident Management Regards, Ashutosh Pasbola Assistant Manager | Syntricate Technologies Inc. Direct: (***) ***-**** Fax: (***) ***-**** Email: ****@*****.*** Web:
    Contract work

    Syntricate Technologies

    Plano, TX
    1 day ago
  • $62.2k - $105.7k

     ...Position Overview The Incident Manager oversees the end‑to‑end lifecycle of IT incidents in an enterprise environment, ensuring rapid restoration of normal service with minimal disruption to mission‑critical systems. The role coordinates cross‑functional technical teams... 
    Contract work
    Work experience placement
    Work at office

    ASM Research, An Accenture Federal Services Company

    Austin, TX
    2 days ago
  • A tech-driven cloud infrastructure firm in Dallas is seeking an Incident & Problem Manager to oversee incident management practices. This role requires strong IT Service Management background, with 5+ years of experience in high-availability environments. You'll lead incident... 

    NMC2

    Dallas, TX
    6 days ago
  • Talos Energy LLC is seeking an IT Service Desk Manager to oversee the daily operations of the IT Helpdesk in Houston. This role involves managing a team of six, ensuring effective incident management, request fulfillment, and problem resolution to enhance user experience... 

    Talos Energy

    Houston, TX
    3 days ago
  • Stellantis Financial Services, Inc. is looking for a Director of Service Desk & Incident Management in Dallas, Texas. This strategic role oversees Service Desk operations and incident management, ensuring high-quality service delivery and innovation through automation and... 

    First Investors Financial Services, Inc.

    Dallas, TX
    4 days ago
  • $130.6k - $211.2k

     ...ensuring their repeat business and future endorsement. As the Manager, Technical Customer Support, Focused Services, you will lead, empower...  ...setting clear, realistic expectations for outcomes. Lead Post‑Incident Reviews (PIRs) to cultivate continuous learning from setbacks.... 
    Shift work

    Palo Alto Networks

    Plano, TX
    5 days ago
  •  ...throughout the United States. *** Role Summary The Project Manager 2 will provide overall direction and leadership on moderately complex...  ...individual actions and mentoring others. Investigates safety incidents and retrains staff asneeded. Manages the JE Dunn prestart... 
    Contract work
    For subcontractor
    Relocation

    JE Dunn

    Dallas, TX
    1 day ago
  • Incident Management Responsibilities Understand the incident and the diagnostic/resolution actions attempted already by the service desk and any other technology tracks. Use the designated or allotted communication bridge, monitoring facilities, and on-call schedule from... 
    Immediate start

    Yochana

    Frisco, TX
    4 days ago
  • $164.9k - $223.1k

     ...Interested in defending a global tech company from the latest cyber threats? Arm is seeking a passionate, experienced Cyber Incident Response Manager to join our growing Cyber Defence Operations (CDO) team, protecting Arm against current and future cyber-attacks! Situated... 
    Work at office
    Local area

    Arm Limited

    Austin, TX
    2 days ago
  • $62.2k - $105.7k

    ASM Research, An Accenture Federal Services Company, seeks an Incident Manager to oversee the incident lifecycle in a federal IT environment. This role involves coordinating cross-functional teams to ensure rapid incident resolution with minimal disruption. Candidates... 

    ASM Research, An Accenture Federal Services Company

    Austin, TX
    3 days ago
  •  ...is responsible for providing operational excellence, financial management, team leadership and relationship management with all...  ...individual actions and mentoring others. Investigates safety incidents and retrains staff as needed and implements corrective action.... 
    For subcontractor
    Night shift

    JE Dunn Construction

    Temple, TX
    8 days ago
  • $100k - $120k

     ...This role oversees production, warehouse operations, inventory management, shipping, receiving, and continuous improvement initiatives...  ...standards. Lead investigations and corrective actions for safety incidents and operational issues. Maintain adherence to customer and... 
    Full time
    Work at office

    Samsill Corporation

    Fort Worth, TX
    5 days ago
  •  ...Traveling Project Manager- Self Perform (AFG) Location: Kansas City, MO, US, 64106Temple, TX, USReston, VA, US, 20190Washington,...  ...through individual actions and mentoring others. Investigates safety incidents and retrains staff as needed. Manages the JE Dunn prestart... 
    Contract work
    For subcontractor
    Relocation

    JE Dunn Construction

    El Paso, TX
    4 days ago
  • $123k - $160k

     ...Contracts Manager page is loaded## Contracts Managerremote type: Onsitelocations: The Woodlands, Texas: Jurupa Valley, Californiatime...  ...contracts. May be required to perform duties related to government incident evaluation and reporting.* Represent Crane for finalizing and... 
    Contract work
    Local area

    Crane Co.

    The Woodlands, TX
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Incident Manager. Be the first to apply!