Reliability & Observability Analyst II
IREN
Job Type: Full-Time | Location: Dallas / Fort Worth, TX | Department: Operations | Reporting to: Data Center Manager | Work Location Type: #onsite IREN is a leading next-generation data center business powered by 100% renewable energy. We build, own and operate high‑performance computing data centers, focusing on sustainability and positive community impact. Job Description We are seeking an IOC Reliability & Observability Analyst II to support our 24/7 HPC Data Center Operations. The role performs advanced incident triage, enhances alert quality and routing, and maintains operational telemetry and reporting. It partners with engineering and operations teams to identify detection gaps, tune monitoring dashboards, and implement small automations to reduce operational toil. Job Requirements 3–5 years of experience in IOC/NOC/SRE‑adjacent operations, reliability engineering, observability, or production support within 24/7 production environments. Bachelor’s degree in Computer Science, Data Science, IT, or equivalent hands‑on professional experience. Demonstrated ability to apply reliability engineering principles (incident lifecycle, MTTD/MTTR, operational risk). Strong working knowledge of Linux systems, basic networking, and infrastructure dependencies across compute, network, and facility domains. Practical experience supporting GPU‑based compute environments or high‑density clusters, including analysis of GPU health, performance degradation, and failure patterns. Proven experience owning and improving alert quality, reducing false positives, missed detections, poor routing, and alert fatigue. Hands‑on experience maintaining service health dashboards and operational reliability metrics, including SLI/SLO reporting where defined. Ability to correlate logs, metrics, and alerts across distributed systems (GPU, network, facility telemetry) to accelerate triage. Experience working with AIOps‑enabled outputs (anomaly detection, event correlation, automated enrichment) and validating accuracy. Ability to write or modify small automation artifacts (scripts, templates, configuration‑driven workflows). Experience ensuring operational data integrity across ticketing systems, incident records, and dashboards. Strong communication skills and ability to work cross‑functionally with IOC leadership, engineering, and operations teams, including mentoring. Strong working experience with IOC/NOC tooling: ITSM/ticketing systems (ServiceNow, Jira) and monitoring platforms (Splunk, Datadog). Experience producing operational reports, incident summaries, and shift handoff documentation for IOC leadership and stakeholders. Familiarity with RCA workflows and ensuring incident records, timelines, and artifacts are complete and accurate. Other Important Requirements This role operates in a 24×7 IOC/NOC environment and works 12‑hour rotating shifts on a 4‑days‑on / 3‑days‑off schedule, alternating with a 3‑days‑on / 4‑days‑off schedule. Pre‑employment screening, including background check and substance testing, may be required according to company policies. Job Responsibilities Perform advanced Level 2 incident analysis to review data, system behavior, and operational signals across GPU clusters, networks, and facilities for recurring issues. Maintain IOC service health dashboards and operational metrics that reflect alert effectiveness, incident response performance (MTTD/MTTR), and customer impact. Identify alerting and monitoring gaps, under‑monitored systems, and noisy or ineffective alerts; perform day‑to‑day tuning of thresholds, routing, suppression, and enrichment. Own operational alert quality outcomes by ensuring sustained reductions in false positives, missed detections, poor routing, and alert fatigue. Analyze GPU health and performance signals during incidents to support faster triage and reduce customer impact. Validate and oversee automated detection and correlation outputs to ensure alerts, anomalies, and insights are accurate and actionable. Implement and maintain IOC‑level automation (alert routing rules, enrichment fields, ticket templates, runbook scripts). Ensure ITSM incident and ticket records meet IOC quality standards, supporting RCA workflows. Provide peer coaching and onboarding support to Analyst I team members on triage patterns, alert interpretation, dashboard usage, and runbooks. Support IOC shift operations through detailed incident handoffs, queue hygiene, and coordination with on‑call engineering and facilities teams during escalations. Job Benefits IREN offers a comprehensive, market‑competitive total rewards package that supports well‑being, career advancement, and financial wealth. Key components include: Competitive compensation and overtime pay for non‑exempt workers. 100% company‑paid health insurance premiums for employees and 75% for dependents; disability, life, and voluntary coverage options. 401(k) retirement plan with company match; paid professional development services. Paid Time Off (PTO) and paid holidays. Access to employee assistance program and wellness resources. Opportunities for professional certifications and continuing education. IREN values diverse perspectives and believes skills can be developed. Whether you meet all criteria or not, we encourage you to apply. IE US Operations Inc., the employing entity and proud member of the IREN group, is an equal‑opportunity employer committed to creating an inclusive workplace. We evaluate qualified applicants and do not discriminate against protected characteristics under applicable legislation. We participate in E‑Verify and will provide the federal government with your Form I‑9 information to confirm that you are authorized to work in the U.S. (E‑Verify Participation Notice). By applying for this position and submitting your resume and application materials, you consent to the processing of your personal information in accordance with our Job Applicant Privacy Statement available at #J-18808-Ljbffr IREN
- IREN is looking for an IOC Reliability & Observability Analyst II in Fort Worth, TX. The role supports 24/7 HPC Data Center Operations, focusing on incident analysis, operational telemetry, and system reliability. Ideal candidates have 3-5 years in related fields and a...Suggested
- ...data center business powered by 100% renewable energy. As part of our HPC Data Center Operations, we are seeking an IOC Reliability & Observability Analyst I. This entry‑level role focuses on analyzing operational signals, improving incident quality, and supporting AIOps‑...SuggestedFull timeTemporary workNight shiftRotating shift
- IREN is seeking an entry-level IOC Reliability & Observability Analyst I for its Fort Worth location. This role involves analyzing incident data and improving incident quality in a 24/7 environment. Candidates should have 1-3 years of relevant experience and a Bachelor'...Suggested
- ...HIM Coder Analyst II-REMOTE within State of TX page is loaded## HIM Coder Analyst II-REMOTE within State of TXlocations: Fort Worth,... ...physician documentation for ambulatory surgery, special procedure, observation, emergency department, outpatient ancillary and clinic visit...SuggestedFull timeWork at officeRemote workFlexible hoursShift workDay shift
$95.86k - $208.27k
...Advisory. KPMG is currently seeking a Senior Specialist, SOC Analyst Level II to join our Advisory Services practice. Responsibilities... ..., each year KPMG publishes a calendar of holidays to be observed during the year and provides eligible employees two breaks each...SuggestedH1bLocal areaShift workNight shiftWeekend work- ...meet you. Apply today and start the most rewarding chapter of your career with us. Job Description The Information Security Analyst II is responsible for safeguarding the bank's sensitive data, systems, and customer information from cyber threats. The Information...Contract workLocal areaImmediate start
- ...our patients and customers. We foster an inclusive culture and are looking for diverse, talented people to join Alcon. As a QA Analyst II, Product Releasesupporting our QC, Analysis and Test Service Team, you will be trusted to lead all aspects of the batch release...Visa sponsorshipRelocation packageMonday to Friday
- ...board! Why you'll love this job This job is a member of the Reliability Engineering Team within the Technical Operations Division. Responsible... ...be performed whenever it is deemed appropriate to do so, observing, of course, any legal obligations including any collective...ApprenticeshipWork at officeLocal areaFlexible hours
- A financial institution in Fort Worth, Texas is seeking an experienced Information Security Analyst II to safeguard sensitive data and systems from cyber threats. You will monitor for security incidents, conduct risk assessments, and support audits while developing security...
- ...can truly make a global impact. This position will be posted until filled. Responsibilities About the role The Data Analyst II supports the Credit Review department by utilizing technical expertise to develop new and enhance existing reporting, prepare...Work experience placementWork at officeVisa sponsorshipFlexible hours2 days per week
$15.26 - $23.28 per hour
...The Provider Management Analyst is responsible for verifying provider information and requests for documentation to audit claims including... ...ranges may be modified at any time. For leveled roles (I, II, III, Senior, Lead, etc.) new hires may be slotted into a different...Hourly payMinimum wageFull timeLocal areaRemote workFlexible hours- Alcon in Fort Worth, Texas, is seeking a QA Analyst II to lead the batch release process for quality control. This role requires ensuring compliance with corporate and regulatory standards while collaborating closely with production teams. The ideal candidate will possess...
- ...ensure company compliance with FAA 96‑hour submittal requirement. Aid in Corrosion Prevention and Control Program and Major Repair Reliability evaluation and reporting. Assist in identifying and entering events into the various Tech Ops databases. Complete analysis of...Work at officeFlexible hours
- A financial institution is seeking an experienced Cyber Security Analyst II in Fort Worth, Texas. This position focuses on safeguarding sensitive data through vulnerability management and requires strong problem-solving skills along with a technical background in cybersecurity...
- ...aligning with our brand platform, "We go above. So you can go beyond." Job Description The Credit Services Business Analyst II position oversees the operation, maintenance and system related procedures for one of the organization's systems; responsible for...Local area
- ...setting is preferred.Must have regular and punctual attendance, reliable transportation, flexibility to work overtime, evenings and / or... ...driving record as defined by City policy. Code Compliance Officer II - High school diploma or equivalent.An equivalent combination of...For contractorsWork at officeLocal areaWeekend workAfternoon shift
- A leading automotive finance company is searching for a Risk Analyst II to analyze credit risk related to loan and lease activities. Key responsibilities include data mining to assess risk exposure, developing credit policies, and monitoring performance metrics. Candidates...Work at officeFlexible hours2 days per week
- ...A healthcare provider for children in Texas is seeking an HIM Coder Analyst II to accurately code medical records, ensuring compliance with ICD-10-CM and CPT guidelines. This role requires attention to detail and collaboration with healthcare professionals to improve...Remote work
$66.94k - $101.26k
...Payment Integrity Analyst II The Payment Integrity Analyst is responsible for accurately reviewing and completing pre- and post pay claim audits based on client, policy, industry standards and/or CMS guidelines. Essential Functions & Responsibilities: Reviews...Minimum wageFull timeWork experience placementWork at officeLocal areaFlexible hours- Lockheed Martin in Fort Worth, TX is seeking an experienced professional to support the development of quality plans and procedures. This role involves ensuring conformance to company and regulatory standards, analyzing quality-related data, and maintaining accurate quality...Full time3 days per week
- ...than work — we thrive. Our Purpose: We pioneer the innovations that move and connect people to what matters About the role: Risk Analyst II - Credit Risk Analytics is responsible for analyzing credit risk exposure related to consumer and commercial loan and lease...
$87k - $165.5k
...Insurance Product Analyst II, State Management Working at General Motors Insurance is a chance to help reinvent what insurance feels like for GM drivers standing at the intersection of the larger GM enterprise and a growth business focused on safeguarding the GM consumer...Full timeWork experience placementWork at officeRemote workVisa sponsorship- ...customer. We also offer commercial lending products to help dealers finance and grow their businesses. Job Description The Risk Analyst II - Portfolio Forecasting is responsible for assisting in the design, development, and maintenance of complex portfolio forecasting...Work experience placementWork at officeVisa sponsorshipFlexible hours2 days per week
- ...than work — we thrive. Our Purpose: We pioneer the innovations that move and connect people to what matters About The Role Risk Analyst II - Credit Risk Analytics is responsible for analyzing credit risk exposure related to consumer and commercial loan and lease...Work experience placementWork at officeFlexible hours2 days per week
- ...PRIDE Industries Job Description Job: HVAC Mechanic II Job Code: J64 - WF-HVAC Mechanic II SCA Occup: 23410 Heating... ...preventative maintenance on heating and cooling systems. Observes pressure and vacuum gauges and adjusts controls to insure proper...Contract workWork experience placementWork at officeLocal area
- ...construction of transportation projects. The Construction Inspector II will receive supervision and support from one or more senior... ...weekends required. Work may require contact with the public. Duties Observes and inspects ongoing construction work. Reviews plans and...Contract workFor contractorsWork at officeFlexible hoursNight shiftWeekend work
- ...efficiently in support of quality and compliance objectives • Execute assigned tasks with a high level of attention to detail, reliability, and consistency following job-related training • Monitor product quality at different stages and detect and escalate non-...Full timeRemote work
$19.21 - $28.73 per hour
...Repricing Quality Assurance Analyst I The Repricing Quality Assurance Analyst is responsible for reviewing, analyzing, and monitoring... ...Our ranges may be modified at any time. For leveled roles (I, II, III, Senior, Lead, etc.) new hires may be slotted into a...Hourly payMinimum wageFull timeWork at officeLocal areaRemote workFlexible hours$25 - $35.75 per hour
Siemens Mobility in Fort Worth, TX, is seeking a Quality Test Technician II to support the calibration and quality of measurement tools in manufacturing. This role requires experience in manufacturing, calibration software, and precision tools. The position offers a competitive...Hourly pay- A leading global finance provider is looking for a Risk Analyst II in Fort Worth, Texas, to analyze credit risk exposure for consumer and commercial loans. This role involves conducting risk analysis, creating and monitoring credit policies, and measuring credit performance...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Reliability & Observability Analyst II. Be the first to apply!
- IT analyst Fort Worth, TX
- call center workforce analyst Fort Worth, TX
- cash analyst Fort Worth, TX
- recruiting analyst Fort Worth, TX
- language analyst Fort Worth, TX
- category analyst Fort Worth, TX
- agriculture analyst Fort Worth, TX
- internal audit analyst Fort Worth, TX
- strategic sourcing analyst Fort Worth, TX
- senior purchasing analyst Fort Worth, TX

