Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Platform Reliability Engineer, Azure

Wellfit Technologies

Job Description

Job Description

Wellfit is the dental industry’s fintech solution , breaking down financial barriers so patients, providers, employers, and payors can all access better care. As a healthcare fintech innovator, we’re transforming the patient journey and redefining what’s possible in dental care.

About Wellfit

Wellfit is the dental industry’s fintech solution, breaking down financial barriers so patients, providers, employers, and payors can access better care. As a healthcare fintech innovator, we are transforming the patient journey and redefining what is possible in dental care. Today, Wellfit supports a growing production platform serving 1,100+ offices and processing $1.5B+ in annual transactions . As we continue to scale, reliability, observability, alerting, and production readiness are critical to how we support our customers and deliver with confidence.

About the Role:
We are seeking a hands-on  Platform Reliability Engineer, Azure to help strengthen the reliability, visibility, and operational maturity of our Azure-based platforms.

This role is ideal for someone who enjoys working directly in Azure, improving production systems, troubleshooting issues across infrastructure and application layers, and building practical monitoring and alerting solutions that help teams respond faster and operate more confidently.

You do not need to be an expert in every part of the stack on day one. We are looking for someone with strong Azure experience, solid troubleshooting instincts, a DevOps/reliability mindset, and the ability to collaborate closely with engineering teams across systems, services, and applications.

What You’ll Do

• Own and improve monitoring, alerting, and observability across Azure-based production systems.
•Work directly in Azure Monitor, Application Insights, App Services, logs, metrics, traces, and related Azure tooling to troubleshoot reliability and performance issues.
• Build and refine practical alerting workflows, including Sev0/Sev1 alert routing, escalation paths, and runbook integration.
• Create and maintain clear, actionable runbooks that help on-call engineers respond confidently to production incidents.
• Partner with engineering teams to investigate issues across infrastructure, configuration, deployments, services, and application behavior.
• Support release readiness by improving visibility into critical Azure resources before, during, and after production deployments.
• Build and maintain dashboards in tools such as Grafana, Azure Monitor, Application Insights, or similar observability platforms.
• Help configure incident routing integrations, including Slack/webhook-based alert delivery to the appropriate team channels.
• Automate repeatable operational tasks using PowerShell, Logic Apps, Azure tooling, or similar workflow automation methods.
• Contribute to RCA documentation, incident follow-up, reliability improvements, and operational playbook development.

What We’re Looking For
• Hands-on experience supporting production systems in Azure.
• Strong working knowledge of Azure App Services, Azure Monitor, Application Insights, and Azure production troubleshooting.
• Experience with DevOps, cloud operations, site reliability, platform engineering, or production support in a hands-on environment.
• Strong troubleshooting instincts and the ability to work through ambiguous production issues.
• Comfort working across logs, metrics, traces, alerts, configurations, deployments, and service dependencies.
• Ability to collaborate with software engineering teams across the stack, including .NET, Angular, SQL, APIs, and cloud services.
• Experience building or improving dashboards, alerts, runbooks, incident workflows, or operational playbooks.
• Working knowledge of scripting or automation, preferably with PowerShell, Logic Apps, CLI tooling, or similar technologies.
• Clear communication skills with the ability to document findings, explain issues, and drive follow-through after incidents.
• A high-ownership mindset with the ability to create structure, improve processes, and operate effectively in a fast-moving environment.

Preferred Experience
• Azure certifications.
• Grafana, Prometheus, DataDog, Dynatrace, or similar observability/APM tools.
• Slack integrations, webhooks, Logic Apps, or incident routing workflows.
• Azure Front Door, CDN, Function Apps, WebJobs, Service Bus, Event Hub, Event Grid, SQL Pools, App Service Plans, or related Azure services.
• Experience in healthcare, fintech, payments, or other high-availability environments.
• Experience in startup, SMB, or scale-up environments where ownership is broad and hands-on.

What Success Looks Like 
• You understand how production systems are monitored, where alerting gaps exist, and how to improve them.
• You can work directly in Azure to investigate issues, improve visibility, and support reliable operations.
• You build practical runbooks, dashboards, and alerting workflows that teams actually use.
• You collaborate well with engineers, ask strong troubleshooting questions, and help drive issues to resolution.
• You bring ownership, curiosity, and a builder mindset to a growing platform environment.

Why Wellfit
• Make an Impact: Your work will directly strengthen the reliability of a fast-growing healthcare fintech platform supporting 1,100+ offices and $1.5B+ in annual transactions.
• Build and Own: This is a high-impact role where you will help shape how we monitor, operate, and scale production systems.
• Work Flexibly: Hybrid model based in Dallas with 3 days per week in office.
• Comprehensive Benefits: Full medical, dental, vision, generous PTO, bonus eligibility, and 401(k) matching.
• Fast-Growth Environment: A rare opportunity to grow with a profitable startup on a national trajectory.

 

Alongside a competitive annual bonus, we offer a 401(k) with up to a 4% match, generous paid time off, and comprehensive healthcare benefits.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Vacancy posted 24 days ago
Similar jobs that could be interesting for youBased on the Platform Reliability Engineer, Azure in Irving, TX vacancy
  • $90 per hour

    CorGTA is seeking a Senior SRE Engineer in Dallas, Texas, to support production infrastructure. This role offers a contract to design and...  ...manage CI/CD pipelines, and ensure system observability using Azure and Terraform. Candidates should hold a Bachelor's in computer... 
    Suggested
    Hourly pay
    Contract work

    CorGTA

    Dallas, TX
    5 days ago
  • $85 - $90 per hour

    Senior SRE Engineer (AKS, Azure, Terraform, Kubernetes, and PowerShell.) JOB ID - 7933 Role: Senior SRE Engineer Location: Dallas / Fort Worth...  ...with Azure. Experience with container orchestration platforms such as Kubernetes. Experience using IAC tools such as Terraform... 
    Suggested
    Hourly pay
    Contract work
    Work experience placement

    CorGTA

    Dallas, TX
    5 days ago
  • $116.4k - $204.1k

     ...A prominent software company in Coppell, Texas is looking for a Lead Product Software Engineer - Cloud Operations. This position involves owning the infrastructure foundations that power AI solutions, designing cloud architecture, and ensuring team collaboration. Ideal... 
    Suggested

    Wolters Kluwer

    Coppell, TX
    3 days ago
  • Analytic Partners is hiring for a Platform Engineer to manage the Internal Developer Platform (IDP). This role involves owning the platform,...  ...years of experience in relevant roles, and strong skills in AWS, Azure, and CI/CD practices. The position supports a collaborative... 
    Suggested

    Analytic Partners

    Dallas, TX
    4 days ago
  • A leading financial institution in Dallas is looking for a Senior Platform Engineer to manage and enhance their Azure and Databricks platforms. The successful candidate will provide technical oversight, ensure high-quality service delivery, and lead incident management... 
    Suggested
    Flexible hours

    Scotiabank

    Dallas, TX
    1 day ago
  •  ...Specialist to enhance their critical enterprise platform. This role involves supporting complex...  ...operations while ensuring system reliability and effective change control. The ideal...  ...position emphasizes collaboration with engineering and business teams to enhance platform... 

    Mainz Brady Group

    Dallas, TX
    5 days ago
  • A leading financial institution is seeking a Platform Engineer to enhance their Data & AI platforms. This role involves collaborating with senior engineers to design and manage Azure and Databricks environments. You will be responsible for deploying, supporting, and troubleshooting... 

    Scotiabank

    Dallas, TX
    3 days ago
  •  ...About the Company Role: SRE RunOps Engineer Location: Irving, TX Onsite...  ...Implement infrastructure best practices around reliability, scalability, and cost efficiency....  ...New Relic or similar APM/observability platforms. ~ Experience using additional tools... 
    Work experience placement

    Resolve Tech Solutions

    Irving, TX
    1 day ago
  • A global consulting firm is seeking a Cloud Engineer to lead crucial infrastructure projects, focusing on building and operating cloud solutions with Kubernetes and Azure. The role involves collaborating across teams, automating infrastructure provisioning, and ensuring... 

    Ernst & Young Oman

    Dallas, TX
    1 day ago
  • $107.48k - $143.31k

     ...locations across the U.S. and Canada with approximately 9,000 employees. What You'll Be Doing Lead and apply regional reliability engineering strategies to improve equipment performance, uptime, and maintenance cost effectiveness across multiple cement plants.... 
    Temporary work
    Remote work
    Flexible hours

    51905 HROC LLC

    Irving, TX
    2 days ago
  •  ...JOB SUMMARY The ASU Reliability Engineering Manager will manage the reliability engineering program, manage engineering projects, and provide technical expertise to support a safe, efficient, and reliable ASU (Air Separation Unit) production operation. This role leads... 
    Work experience placement
    Work at office
    Work from home
    Weekend work
    Afternoon shift

    Matheson Tri-Gas

    Irving, TX
    4 days ago
  •  ...Solutions LLC is seeking a Data Application Production Support Engineer in Dallas, Texas. This role involves ensuring mission-critical...  ...candidate will have strong skills in Python, Databricks, SQL, and Azure, along with a proactive approach to monitoring and incident... 

    Glint Tech Solutions LLC

    Dallas, TX
    1 day ago
  •  ...U.S. Bank is looking for a Senior Software Engineer – DevOps based in Irving, TX. The role involves providing Tier 3 production support...  ...automation solutions, and leading cloud migration projects to Microsoft Azure. The ideal candidate will have at least 5 years of experience... 

    U.S. Bank

    Irving, TX
    3 days ago
  •  ...Senior DevOps Platform Engineer As a Senior DevOps Platform Engineer, you will play a critical role in ensuring the reliability, scalability, security, and performance of Berkley's software systems. You will collaborate closely with product engineering, infrastructure... 

    W. R. Berkley

    Irving, TX
    1 day ago
  •  ...are seeking a skilled DevOps Engineer to maintain, optimize, and evolve...  ...SaaS talent intelligence platform. You will own the full deployment...  ...environments on Microsoft Azure, ensuring high availability,...  ...critical to maintaining platform reliability and enabling the engineering... 

    Cnected

    Dallas, TX
    4 days ago
  • $89.6k - $167.6k

     ...cloud programs operating within Azure Government environments. It...  ...for leading the delivery of platform and infrastructure...  ...as a bridge between product, engineering, and platform teams to advance...  ..., and automation to improve reliability and delivery effectiveness... 
    Summer holiday
    Local area
    Flexible hours
    Shift work

    EY

    Dallas, TX
    5 days ago
  • Cnected is seeking a skilled DevOps Engineer in Dallas, Texas to maintain and optimize the cloud...  ...infrastructure for their SaaS talent intelligence platform. The role involves full deployment lifecycle management on Microsoft Azure, ensuring high availability and security.... 

    Cnected

    Dallas, TX
    5 days ago
  • A prominent technology solutions company is seeking an experienced Azure Cloud Engineer in Southlake, Texas. The ideal candidate must possess over 8 years of experience in Azure Cloud Engineering, including expertise in Bicep coding, Infrastructure as Code, and Azure DevOps... 

    Compunnel, Inc.

    Southlake, TX
    2 days ago
  • Must have 8 plus years of Azure Cloud Engineering Must have experience with Bicep coding Must have experience with Infrastructure as a Code Must have experience with Arm template was a predecessor Must have experience with CLI and transition over to Bicep Must have experience... 

    Compunnel, Inc.

    Southlake, TX
    2 days ago
  • $119.8k - $234.7k

     ...cloud storage services? Join the Azure Storage team and be part of a...  ....? As a Senior Software Engineer in the Azure Storage front...  ...product, application, service, or platform. Creates, implements,...  ...will improve the availability, reliability, efficiency, observability, and... 
    Ongoing contract
    Work at office
    Local area
    Remote work

    Microsoft Corporation

    Irving, TX
    6 days ago
  •  ...leading healthcare fintech platform. We remove financial barriers...  ...infrastructure, tooling, and engineering culture to scale both our...  ...software engineering, and site reliability. This is not a traditional...  ...codebases (primarily .NET / Azure environments) where needed... 
    Work at office
    Immediate start
    3 days per week

    Wellfit Technologies

    Irving, TX
    a month ago
  • The Intersect Group in Dallas is seeking a Senior Data Engineer to lead data integration and architect scalable data platforms. This role emphasizes mentoring junior engineers and optimizing ETL processes using Azure Data Factory. The ideal candidate will have over 7... 

    The Intersect Group

    Dallas, TX
    3 days ago
  • $100.8k - $170k

     ...Sirius XM in Irving, Texas, is looking for a Senior Software Engineer to enhance platform productivity. You'll focus on cloud-native development, improving tools, and supporting teams in software deployment. With 5+ years in software engineering, expertise in AWS,... 

    SiriusXM

    Irving, TX
    3 days ago
  • $100.8k - $170k

     ...How you'll make an impact The Platform Engineering team is seeking a Senior Software Engineer to help further our vision of making it effortless...  ...AI‑driven tooling into the platform to enhance efficiency, reliability, and scalability, leveraging AI to optimize infrastructure... 
    Temporary work
    Local area

    Sirius XM Radio Inc

    Irving, TX
    3 days ago
  • $100.8k - $170k

     ...SiriusXM Radio, Inc. is seeking a Senior Software Engineer to enhance platform development and management. The role focuses on building reusable components and streamlining cloud infrastructure management while driving developer productivity. The ideal candidate has... 

    Sirius XM Radio Inc

    Irving, TX
    3 days ago
  • $100.8k - $170k

     ...SiriusXM is seeking a Senior Software Engineer in Irving, Texas, to enhance platform tooling and streamline cloud infrastructure management. In this role, you'll write high-quality code, lead code reviews, and collaborate closely with internal teams. Ideal candidates... 

    SiriusXM

    Irving, TX
    3 days ago
  •  ...asks for money during its onboarding process. Job Title: Platform Engineer & Production Support Contract Length: 12+ months Location...  ...role focuses heavily on application production engineering, reliability, observability, and operational excellence within cloud-... 
    Contract work
    Remote work
    Visa sponsorship

    Leading Utilities Organization

    Irving, TX
    9 days ago
  •  ...Job Description Summary: The Windows Platform Engineer is responsible for designing, engineering, and operating enterprise Windows-...  ...Required Licenses or Certifications: Win certifications Azure, AWS, GCP certifications Any technical certifications... 
    Work at office

    ADT

    Irving, TX
    3 days ago
  •  ...Overview: GenAI Adoption - Platform Engineer Design and operate GenAI platform environments. Manage Kubernetes-based AI platforms...  ...Enable multi-tenant GenAI usage. Ensure scalability and reliability. GenAI platform configurations. Deployment pipelines.... 

    Trans.eu

    Irving, TX
    17 hours ago
  •  ...Gartner in Irving, Texas seeks a Director of Software Engineering to lead platform integrations for C-level executive experiences. You will guide cross regional teams to deliver scalable solutions and ensure engineering excellence. The role requires 12-15 years in... 
    Remote work

    Gartner

    Irving, TX
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Platform Reliability Engineer, Azure. Be the first to apply!