Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

24x7 IT Infrastructure & Incident Specialist

Net2Source (N2S)

Location: Redmond/ WA, Local onsite; 24x7 rotational shifts (including weekends and on-call support) Local only Shift Requirement • 24x7 rotational shifts (including weekends and on-call support) Role Overview Responsible for 24x7 monitoring, incident management, and operational support of a large-scale hybrid infrastructure. The role ensures high availability, performance, and reliability across Production, DR, and Non-Production environments. Key Responsibilities Infrastructure Monitoring & Operations Monitor 1200+ servers (Windows/Linux), virtualization platforms (VMware, Nutanix), and web servers Oversee PB-scale storage systems (Quantum, Isilon, NAS, SAN) Monitor 1200+ network devices including switches, routers, firewalls, VPNs, WAPs, and ISP circuits Handle incidents and service requests related to infrastructure and tools Perform L1/L2 triage for alerts, incidents, and outages Ensure timely resolution and escalation as per SLAs Correlate alerts across tools to identify root causes Application & Service Monitoring Track service health and dependencies (web, middleware, backend) Capacity & Performance Management Monitor utilization trends across compute, storage, and network Identify bottlenecks and recommend optimizations Change & Release Support Support deployments, patching, and maintenance Validate system health before and after changes Disaster Recovery & Resilience Support DR readiness and failover validation Participate in DR drills Reporting & Documentation Maintain dashboards, runbooks, and reports Provide daily/weekly health and SLA updates Required Skills Technical Skills Networking: TCP/IP, DNS, VPN, Firewalls, Load Balancers (F5) Monitoring tools: New Relic, Splunk, Nagios, Zabbix, Dynatrace, SCOM ITSM tools: ServiceNow (preferred) Backup tools: Rubrik Operational Skills Strong incident management in 24x7 environments Troubleshooting and analytical skills Ability to correlate infra, network, and application issues Strong communication and coordination Ability to work under pressure Documentation and reporting skills Preferred Qualifications ITIL Foundation Certification Experience in large enterprise or MSP environments Exposure to AWS/Azure (preferred) Process flows Knowledge transfer and mentoring Contribution to project deliverables Data conversion and maintenance Industry best practices and innovative solutions Technical configuration and development support #J-18808-Ljbffr Net2Source (N2S)

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the 24x7 IT Infrastructure & Incident Specialist in Redmond, WA vacancy
  •  ...Job Title: Systems & Infrastructure Specialist Any additional information you require for this job can be found in the below text Make sure...  ..., and performance. • Document system architectures, incident responses, and recovery protocols with meticulous clarity.... 
    Suggested
    For contractors
    Remote work

    micro1

    Sammamish, WA
    1 day ago
  • $95k - $120k

     ...actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. DATA INFRASTRUCTURE OPERATIONS SPECIALIST II (STARLINK) The Infrastructure Operations team is responsible for the end-to-end management of suppliers, costs,... 
    Suggested
    Permanent employment
    Temporary work
    Work at office
    Monday to Friday
    Flexible hours

    SpaceX

    Redmond, WA
    1 day ago
  • $85k - $100k

     ...International Infrastructure Operations Specialist (Starlink) Redmond, WA SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the... 
    Suggested
    Permanent employment
    Temporary work
    Work at office
    Worldwide
    Monday to Friday
    Flexible hours

    SpaceX

    Redmond, WA
    3 hours ago
  •  ...or local law. Summary of Position: The Senior Infrastructure Engineer is Denali Advanced Integration's hands-on technical...  ...Monitor ServiceNow and respond to infrastructure Alerts, Incidents, and Service Requests in alignment with defined SLAs and SOC... 
    Suggested
    Hourly pay
    Contract work
    Temporary work
    Work experience placement
    Work at office
    Local area

    3md

    Redmond, WA
    3 days ago
  •  ...The Cloud Support Engineer will serve as a part of the incident management team in a 24x7 Microsoft Azure environment. Candidate will diagnose,...  ...~2 years operations experience providing application infrastructure support ~1 year performing system administration support... 
    Suggested
    Contract work
    Work at office
    Shift work
    Night shift

    ASM Research

    Redmond, WA
    4 days ago
  •  ...to 10:30pm. Position Description Serve as a part of the incident management team in a 24X7 Microsoft Azure environment. Candidate will diagnose,...  ...2 years of operations experience providing application infrastructure support; 1 year performing system administrator support... 
    Shift work
    Night shift
    Afternoon shift

    Manpower Group Inc.

    Redmond, WA
    3 days ago
  •  ...Incident Management Team Member Serve as a part of the incident management team in a 24X7 Microsoft Azure environment. Candidate will diagnose, mitigate and/or escalate...  ...operations experience providing application infrastructure support; 1 year performing system... 
    Contract work
    Work at office

    ASM Research

    Redmond, WA
    4 days ago
  • $160k - $225k

     ...developing and deploying interconnection infrastructure, including peering, transit, caching,...  ...engineers, software developers, and operations specialists to ensure seamless integration of the...  ...including route validation, telemetry, incident response, and lifecycle tracking.... 
    Permanent employment
    Temporary work
    Work at office
    Remote work
    Worldwide
    Monday to Friday
    Flexible hours

    SpaceX

    Redmond, WA
    3 days ago
  • $45k - $121k

     ...delivering end-to-end network operations and infrastructure support for Wipro in the Puget Sound...  ...Provide onsite and remote support for incidents, service requests, and scheduled...  ...evolution of our business and our industry. It has always been in our DNA - as the world... 
    Minimum wage
    Work at office
    Local area
    Remote work
    Relocation

    Wipro

    Redmond, WA
    5 days ago
  • $100.6k - $199k

     ...scalability and reliability. We design and develop cutting-edge infrastructure that supports high-performance AI model training at scale,...  ...definitions to manage deployments. Supports the management of incidents by applying technical knowledge to diagnose and triage issues... 
    Ongoing contract
    Local area

    Microsoft Corporation

    Redmond, WA
    2 days ago
  • $29.25 - $43.87 per hour

    HNTB is seeking a Project Coordinator in Bellevue, WA, to support project teams through administrative duties. This role involves tracking client deliverables, maintaining schedules, and coordinating project documentation. A Bachelor's degree or relevant experience is required...
    Hourly pay

    HNTB

    Bellevue, WA
    4 days ago
  • $100k - $130k

     ...support, executing procedures for outage restoration, and maintaining documentation. Candidates must possess a TS/SCI clearance, strong IT troubleshooting skills, and a Bachelor's degree in a related field. Competitive compensation ranges from $100,000 to $130,000 based... 

    Zachary Piper Solutions

    Redmond, WA
    3 days ago
  • $142.8k - $274.8k

     ...with deep Kubernetes expertise to help build the cloud-native infrastructure layer for Observability services. We design and operate...  ...owning live production systems, including on-call rotations, incident mitigation, and operational excellence. - Proficiency in one... 
    Ongoing contract
    Local area

    Microsoft Corporation

    Redmond, WA
    9 days ago
  • $119.8k - $234.7k

     ...around the world. As Microsoft continues to evolve its secure infrastructure and elevate customer trust, the Secure Production Access (SPA...  ...-as-code practices for consistency and efficiency. Incident Response: Serve as the Tier 3 escalation point for complex network... 
    Ongoing contract
    Work at office
    Local area
    3 days per week

    Microsoft Corporation

    Redmond, WA
    2 days ago
  •  ...: DOJ, DOD, CJIS Core Responsibilities Provide Service Operations Support for: Monitoring Triaging Incident management Problem management Troubleshooting Crisis management Scheduled work items Project Management of... 
    Full time
    Remote work
    Night shift

    Futran Tech Solutions Pvt. Ltd.

    Kirkland, WA
    3 days ago
  • $61.31 - $104.39 per hour

     .... Demonstrated experience in deploying and managing cloud infrastructure (IaC) using Terraform and Azure CLI tools. Proficient in scripting...  ...to support an on call rotation and respond to production incidents outside normal business hours. Excellent verbal and... 
    Minimum wage
    Full time
    Shift work

    Providence Service

    Redmond, WA
    2 days ago
  • $106.4k - $177.3k

     ...exciting new opportunity to join us as a Sr. Infrastructure Engineer I! About the role As a Sr...  ...to-day operations, lead coordination of incident and request handling, and contribute to...  ...and become a student of the business (it takes real effort!), there are easier... 
    Full time
    Immediate start
    Remote work
    Work from home
    Flexible hours

    Symetra

    Bellevue, WA
    4 days ago
  • SOC Design Verification Engineer Location: Redmond, WA Hybrid (Remote option allowed) Minimum Qualifications: ~ Track record of 'first-pass success' in ASIC development cycles. ~ Bachelor's degree in Computer Science, Computer Engineering, relevant technical...
    Remote work

    Redolent

    Redmond, WA
    5 days ago
  • $100k - $140k

     ...Senior IT Infrastructure Engineer SystImmune is a leading and well-funded clinical-stage biopharmaceutical company located in Redmond, WA and Princeton, NJ. It specializes in developing innovative cancer treatments using its established drug development platforms, focusing... 
    Full time
    Work at office
    Local area
    Remote work
    Relocation

    SystImmune Inc.

    Redmond, WA
    5 days ago
  • $100.6k - $199k

     ...one of the highest-scale experimentation platforms - critical infrastructure that enables rapid iteration in AI systems and product...  ...access, and improve TSGs, telemetry, and fixes that reduce future incidents. · Contribute to engineering and operational excellence through... 
    Ongoing contract
    Local area

    Microsoft Corporation

    Redmond, WA
    4 days ago
  • $58.17 - $62.5 per hour

     ...for both customer-facing support and platform operations. The ideal candidate will bring expertise in Microsoft 365 environments, incident resolution coordination, and the ability to drive continuous improvement across support and deployment operations. Key Responsibilities... 
    Contract work
    Work at office

    ASM Research, An Accenture Federal Services Company

    Redmond, WA
    2 days ago
  • $85.1k - $169.8k

     ...Security Architects, Enterprise Architects, IT Management, and Developers to to Secure...  ...field AND 4+ years experience in cloud/infrastructure technologies, information technology (IT...  ...information Assurance Certification, Incident Response. ~ Knowledge of cloud security... 
    Ongoing contract
    Local area
    Work from home
    Flexible hours

    Microsoft Corporation

    Redmond, WA
    3 days ago
  •  ...Azure subscription access, sensitive app/access requests) and ensure approvals/justifications are captured. Security Operations, Incident Triage, and Response Triage and manage security incidents and notifications, including escalations and coordination with external... 

    Ascendion

    Redmond, WA
    3 days ago
  •  ...maintenance of automation in support of project objectives using Infrastructure as Code tools like Terraform, Ansible / Ansible Tower, and...  ...with the consulting team and client stakeholders to prevent incidents by delivering robust solutions, as well as resolving priority... 
    Hourly pay
    Temporary work
    Work at office
    Local area

    3md

    Redmond, WA
    4 days ago
  • $157.6k - $197k

     ...Senior Security Engineer - Infrastructure Bellevue Office, Sunset Corporate Campus Armada is the hyperscaler for the edge, delivering...  ...and maintain security monitoring tools, and participate in incident response activities Architect and implement security solutions... 
    Work at office
    Flexible hours

    Armada

    Bellevue, WA
    5 days ago
  • Backend Developer Expertise on Automation Framework Development Experienced in scripting language Python and Automation tool Spirent iTest Experience in test automation development . Experienced in 5GC / LTE / 3G Packet Core Experienced in preparing, review...

    Yantran LLC

    Redmond, WA
    5 days ago
  •  ...• Previous MS experience is PLUS. • Having Knowledge in Dev-ops is a Plus • Keeping relevant parties updated on the status of incidents • Developing and implementing processes and procedures for incident management • Developing change management plans for change... 

    Futran Tech Solutions Pvt. Ltd.

    Bellevue, WA
    1 day ago
  • ITAR Network Engineer Excellent network administration and troubleshooting skills. Good understanding of TCP/IP, routing protocols, load balancing and network security concepts Knowledge and experience in VTP and other switching technologies Ability to understand...
    Work experience placement

    Tech Tammina

    Redmond, WA
    5 days ago
  • Sr Network Engineer Contract Location: Redmond WA Skill: 1g to 5g Journey foundation Experience: 10+ years Strong knowledge in LTE/UMTS/GPRS with basic IMS knowledge is desired Experience with Wireless Data Core networks, Architecture, Protocols and Interfaces...
    Contract work

    Keylent Inc

    Redmond, WA
    5 days ago
  •  ...etc.) Good experience with the production support processes (incident/service, change, problem) Good experience with patch implementations...  ...every year of education. At least 4 years of experience in IT Additional Information ** U.S. citizens and those... 
    Full time

    SonSoft

    Bellevue, WA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to 24x7 IT Infrastructure & Incident Specialist. Be the first to apply!