Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Future Openings - SRE Support Engineer - Observability

Virtasant

SRE Support Engineer - Observability

While this position is not currently open, we are interviewing strong candidates for upcoming opportunities on this team.

Location: Remote | Time Zone: (US, Canada, Brazil, Chile, Colombia, Mexico) (8AM–5PM Pacific)

Freedom to grow. Power to deliver.
  Virtasant is a global technology services company delivering large-scale cloud, data, and engineering solutions across 130+ countries. We partner with some of the world’s largest organizations to help them build, operate, and scale internal platforms used by tens of thousands of engineers.

For this role, you will be supporting one of the most advanced internal developer platforms in the world, powering products used by hundreds of millions of people. The problems you will solve are deep, complex, and essential to keeping a global-scale organization moving.

Role Overview

The Observability & Tools Support Engineer provides high-impact technical support for customers of a large technology company’s internal IaaS platform, with a focus on monitoring, alerting, telemetry, and operational tooling .

This role spans a wide range of support—from white-glove onboarding and end-to-end customer enablement, to deep technical troubleshooting across Linux, networking, and observability systems (especially Prometheus and AlertManager ). You will also contribute to improving the support function itself: strengthening tooling, documentation, workflows, and feedback loops so the service scales.

Success depends on excellent troubleshooting, strong written communication, comfort working with highly technical customers, and the maturity to identify patterns and drive operational improvements beyond individual ticket resolution.

Business Outcome

Become a trusted frontline expert for the customer’s observability ecosystem and operational tooling - delivering fast, accurate support across Slack and tickets, improving monitoring reliability, and reducing incident impact through better triage, troubleshooting, onboarding, and knowledge capture.

Success Measures

  • Healthy volume of threads and tickets handled with high-quality outcomes

  • Consistent achievement of time-based SLAs

  • High customer satisfaction through surveys

  • Accurate classification of issue type, severity, and recurring patterns

  • Reduced repeat issues through better docs, tooling, and scalable onboarding

What Will Be True When You Succeed

  • Customers can onboard smoothly to monitoring/alerting with minimal friction

  • Monitoring and alerting issues are resolved quickly, with fewer escalations

  • Linux and networking-related incidents reach resolution faster due to strong troubleshooting and clean handoffs

  • Engineering and SRE teams receive clear, actionable feedback based on real customer trends

  • Knowledge base content prevents tickets and accelerates self-service

Core Work Units

1) Frontline Support for Observability & Tooling

  • Manage Slack threads and tickets (roughly 50/50)

  • Handle a broad range of customer support: simple issue resolution through end-to-end onboarding

  • Provide clear, structured guidance to highly technical customers

  • Maintain strong attention to detail while managing multiple interactions in parallel

2) Deep-Dive Troubleshooting & Incident Support

  • Troubleshoot, isolate, and resolve monitoring and alerting issues (especially Prometheus + AlertManager )

  • Troubleshoot complex Linux and networking issues (TCP/IP fundamentals required)

  • Support OpenTelemetry, tracing, and telemetry pipelines , including investigation of gaps in signals and instrumentation

  • Drive incidents to resolution in partnership with Engineering/SRE teams

3) Documentation & Knowledge Development

  • Build and maintain customer-facing and internal knowledge base articles

  • Create informational posts for the community support platform

  • Turn repeated issues into reusable guides, checklists, and onboarding playbooks

4) Trend Analysis & Feedback to Engineering

  • Analyze and categorize customer interaction trends

  • Provide accurate, meaningful feedback to Engineering and SRE orgs to improve product/tooling

  • Identify “top offenders” and propose practical fixes (tooling, docs, process, product)

5) Operational Excellence & Continuous Improvement

  • Participate in post-mortem reviews and drive follow-through on improvements

  • Contribute meaningfully to team objectives and goals (process, tooling, and service scaling)

  • Bring creativity and discretion to resolve highly complex issues “outside the box”

High-Quality Work - what top performance looks like

Frontline Support

  • Moves smoothly from triage to deeper analysis without losing the customer

  • Communicates clearly and confidently with technical users

  • Maintains clean follow-ups and thread hygiene even with high context switching

Troubleshooting

  • Rapidly isolates issues across monitoring/alerting configs, Linux runtime behavior, and network connectivity

  • Uses structured approaches to incident handling: hypothesis → test → evidence → resolution

  • Produces high-signal writeups that accelerate downstream resolution

Documentation & Enablement

  • Documentation is clear enough that customers avoid opening tickets

  • Onboarding flows reduce time-to-value and prevent common misconfigurations

  • Captures “tribal knowledge” quickly and makes it reusable

Operational Excellence

  • Obsessing over details: correct severity, accurate tagging, clean timelines, strong handoffs

  • Spots patterns early and proactively proposes improvements that scale support

Typical Day / Work Patterns

  • ~50% Slack support, ~50% ticket handling

  • Deep-dive investigations during lower ticket volume periods

  • Documentation writing and lightweight tooling/process improvements when patterns emerge

  • Weekly team review of escalations, themes, and operational improvements

  • High rate of context switching and parallel issue management

Required Skills & Experience (Non-Negotiable)

  • Several years supporting highly scalable applications and web services

  • Hands-on experience with open-source observability and cloud-native tooling, including:

    • Kubernetes (and container fundamentals)

    • Prometheus and AlertManager troubleshooting

    • OpenTelemetry and distributed tracing concepts

  • Strong understanding of the Linux operating system (command line, process/network debugging, logs)

  • Good understanding of infrastructure observability principles (signals, alerting strategy, SLO thinking, noise reduction)

  • Good understanding of the TCP/IP suite and practical networking troubleshooting

  • Strong experience troubleshooting ambiguous, multi-layer issues

  • Excellent analytical capability and strong attention to detail

  • Strong written and verbal communication (clear, structured, customer-friendly)

  • Comfortable working with a very technical customer base

  • Passion for Technical Support and a service mindset

Nice-to-Haves

  • Experience improving or supporting internal support tooling or workflows (automation, templates, runbooks)

  • Experience operating at scale in a services environment (pattern detection, KPI/SLA awareness, operational process maturity)

  • Familiarity with Grafana, log aggregation, incident tooling, and production support practices

  • Prior SRE or platform support experience

Minimum Qualifications

  • 3–7+ years in Technical Support Engineering, SRE support, DevOps, Platform Support, or similar

  • Demonstrated experience supporting distributed systems, IaaS, or cloud platforms

  • Strong Linux, troubleshooting, and customer-facing communication background

  • Evidence of documentation, knowledge-base contributions, and process improvement mindset

Disqualifiers: weak Linux fundamentals, inability to troubleshoot systematically, poor written communication, or discomfort supporting highly technical users.

What You’ll Love

  • Real technical problem solving with tangible customer impact

  • A role that blends deep troubleshooting with scaling support via docs, tooling, and process

  • High autonomy in a remote-first environment

What May Be Challenging

  • High context switching and managing multiple threads in parallel

  • Repeated patterns that require discipline to convert pain into scalable improvements

  • Supporting high-visibility systems where speed and accuracy matter

Differentiation

Industry: Remote-first, trust-based culture; global team; autonomy; modern systems; meaningful technical challenges

Internal: High-impact, customer-facing observability support; direct influence on tooling and process maturity; opportunity to shape scalable support practices

Vacancy posted a month ago
Similar jobs that could be interesting for youBased on the Future Openings - SRE Support Engineer - Observability in Austin, TX vacancy
  • $184k - $287.5k

    Senior System Software Engineer - Data Platform Observability page is loaded## Senior System Software Engineer...  ...platforms such as Apache Spark, Elastic/Open Search, Grafana, Prometheus, and...  ...value diversity in our current and future employees, we do not discriminate (including... 
    Suggested

    NVIDIA Corporation

    Austin, TX
    16 hours ago
  • $47.85 - $57.85 per hour

     ...posting will be posted on 02/05/2026 and open for at least 3 days. Accenture Flex offers...  ...process are not a guarantee of future or continued accommodations once hired....  ...needs such as for a disability or religious observance, please call us toll free at 1 (877) 889-... 
    Suggested
    Hourly pay
    Work experience placement
    Live in
    Work at office
    Local area
    Flexible hours

    Accenture

    Austin, TX
    4 days ago
  •  ...Build & Release Support Engineer – CI/CD  While this position is not currently open, we are interviewing strong candidates for upcoming opportunities on this team....  ...Monitoring tools (Prometheus/Grafana) Prior SRE experience Minimum Qualifications ~2–5 years... 
    Suggested
    Immediate start
    Remote work

    Virtasant

    Austin, TX
    a month ago
  • $152k - $241.5k

     ...Site Reliability Engineer - HPC page is loaded##...  .... Our work opens up new universes to...  ...looking for a Senior SRE to join our Compute...  ...building and supporting critical services....  ...auto-healing, E2E observability or data-driven operations...  ...our current and future employees, we do... 
    Suggested

    NVIDIA Corporation

    Austin, TX
    16 hours ago
  • $106.61k - $284.28k

     .... Manager, Frontline Support Engineering to lead our organization...  ...). Experience with Observability & Monitoring Tools...  ...Qualifications Experience in IT, SRE, DevOps, or Software...  ...Our people fuel our future. Our teams reflect...  ...window for this opening will close on: 07/20/... 
    Suggested
    Hourly pay
    Full time
    Temporary work
    Work experience placement
    Local area

    Hispanic Alliance for Career Enhancement

    Austin, TX
    16 hours ago
  • $78k - $112k

     ...We are searching for a Senior Customer Support Engineer in the AMR region who will be responsible...  ...Your Contribution: Be Yourself. Be Open. Stay Hungry and Humble, Collaborate and...  ...yourself and your loved ones, now and in the future. We believe that good health means more... 
    Full time
    Immediate start
    Remote work
    Work from home
    Flexible hours

    Logitech

    Austin, TX
    3 days ago
  • $35 - $50 per hour

     ...Technical Support Engineer (Premium Team) Austin | New York City Gong harnesses the power...  ...workflows into a single, trusted system that observes, guides, and acts alongside the world's...  ...passionate people. We are shaping the future of revenue intelligence and we want... 
    Hourly pay
    Full time
    Remote work
    Work from home
    Flexible hours

    Gong.io

    Austin, TX
    3 days ago
  •  ...Cornelis we're building the future of AI and HPC networking with...  ...software development. We're seeking engineers who are energized by working...  ...Contribution:Engage with the open-source community and...  ...Experience with monitoring and observability stacks like Prometheus, Grafana... 
    Full time
    Remote work
    Flexible hours

    Cornelis Networks

    Austin, TX
    4 days ago
  • $174.9k - $222k

     ...As a Senior Software Engineer on GM's Notification...  ...Improve system resiliency, observability, and operational...  ...sponsorship now or in the future. This includes direct...  ...or other immigration support from the company (e.g....  ...updates about GM, open roles, career insights... 
    Temporary work
    Work experience placement
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Austin, TX
    1 day ago
  • $152k - $230k

     ...we’re on a mission to engineer a frictionless, next-generation...  ...quo and build the future of travel tech, your...  ...: Design and support reliable backend services...  ...Weave monitoring and observability tools (such as Grafana...  ...and comfort when facing open-ended or ambiguous technical... 
    H1b
    Worldwide
    Flexible hours

    GrabJobs

    Austin, TX
    2 days ago
  •  ...About the Role As an Enterprise Support Engineer, you will be the primary technical authority...  ...enterprise WANs ~ Diagnostics & Observability: Analyze system logs, telemetry, and...  ...help you prepare for your financial future with our 401(k) plan. We prioritize... 
    Full time
    Work at office
    Remote work
    2 days per week
    3 days per week

    NinjaOne

    Austin, TX
    2 days ago
  •  ...dashboard. Our expert, live support team helps deliver exceptional...  ...- with big plans for the future. The Mission & The Upside We...  .... We treat IT as a strategic engineering multiplier, not a basement helpdesk...  ...stipend when working onsite Open communication (We won’t box... 
    Permanent employment
    Work at office
    3 days per week

    ePayPolicy

    Austin, TX
    16 hours ago
  • $135k - $165k

     ...Title: Senior Software Engineer - Foundational Services...  ...logistics companies, our open architecture is built...  ...That Starts With How We Support You At Snapsheet, we...  ...company match—because your future is worth investing in....  ...PTO and 7.5 company-observed holidays to recharge on... 
    Full time
    Temporary work
    Casual work
    Local area
    Remote work
    Visa sponsorship
    Work visa
    Flexible hours

    GrabJobs

    Austin, TX
    16 hours ago
  • $160k - $190k

     ...seeking a Senior Software Engineer - AI Applications with...  ...about the current and future state of AI. You will...  ...vision and leverage open source technologies...  ...can build tooling to support model training, evaluation...  ...experience with AI observability, monitoring, and signaling... 
    Full time
    Work experience placement
    Work at office
    Local area
    Remote work
    Home office
    Flexible hours

    GrabJobs

    Austin, TX
    16 hours ago
  • $135k - $165k

     ...Title: Senior Software Engineer - Implementation Job...  ...logistics companies, our open architecture is built...  ..., developing, and supporting Snapsheet’s integration...  ...practices for reliability, observability, and performance....  ...company match—because your future is worth investing in.... 
    Full time
    Temporary work
    Casual work
    Local area
    Remote work
    Visa sponsorship
    Work visa
    Flexible hours

    GrabJobs

    Austin, TX
    1 day ago
  • $116k - $195k

     ..., and thrive with our open, AI-driven commerce ecosystem...  ...together to shape the future of commerce, this is...  ...quality* Mentor other engineers in the current domain*...  ...quality and security* Support an open, positive, and...  ..., extensibility, and observability to ensure agents can... 
    Remote work

    BigCommerce

    Austin, TX
    16 hours ago
  • $146k

     ...Join Us? To shape the future of travel, people must...  ..., we foster an open culture where everyone...  ...mentor to more junior engineers, applies new engineering...  ...Designs easily testable and observable software....  ...range of benefits to support employees and their families... 
    Local area
    Flexible hours

    Expedia Group

    Austin, TX
    2 days ago
  •  ...Senior Software Engineer — Observability Imagine what you could do here. At Apple, new ideas have...  ...and performance of our services and the supporting infrastructure - an Observability...  ...engineering teams ~ Familiarity with SRE practices including SLOs/SLIs, error budgets... 
    Worldwide

    Apple

    Austin, TX
    1 day ago
  •  ...looking for a Senior Software Engineer excited about shaping the future of developer tooling. At...  ...and improve upon popular open source projects that...  ...Customer Success teams to support Coder’s enterprise user base...  ...: Typescript, Kotlin Observability: Prometheus, Grafana CI/CD... 
    Local area
    Remote work

    GrabJobs

    Austin, TX
    4 days ago
  • $175k - $250k

     ...Runpod is pioneering the future of AI and machine...  ...AI. We’re hiring an Engineering Manager to lead a high...  ...testing, release safety, observability for customer-facing...  ..., Program management, Support, GTM, and Infrastructure...  ...quality high. Open-source contributions in... 
    Remote work
    Home office
    Visa sponsorship
    Work visa
    Flexible hours

    GrabJobs

    Austin, TX
    16 hours ago
  • Sr. Software Engineer - Site Reliability About ShipperHQ: ShipperHQ...  ...product-led company shaping the future of e-commerce logistics....  ...practices, and automation to support and improve our complex cloud...  ...systems in AWS Build and maintain observability, monitoring, and logging... 
    Full time
    Work at office

    Zowta, LLC

    Austin, TX
    2 days ago
  •  ..., and we’re honored to support first responders. And...  ...Senior Site Reliability Engineer who can own our data tier...  ...the broader platform, observability with Prometheus, Loki,...  ...that looks like for an SRE and excited to help shape...  ...engineer will actually open. You write code that... 
    Permanent employment
    Local area
    Flexible hours

    Zello

    Austin, TX
    1 day ago
  • $155.42k - $205.9k

     ...use cases. Our platform supports the serving of state-...  ...ML Infrastructure engineer to help build and scale...  ...opportunity to influence the future of AI infrastructure...  ...of monitoring, observability, and metrics to ensure...  ...practices. Contribute to open source projects;... 
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Austin, TX
    16 hours ago
  •  ...Technology Services enables the future of how clients manage their...  ...financial planning. This engineering role supports the business by الأرد...  ...methodologies, and platform SRE practices. Ability to work...  ...patterns, CI/CD pipelines, and observability practices. What’s in it... 

    Charles Schwab

    Austin, TX
    4 days ago
  • $90k - $120k

     ...within Molex is seeking a Technical Product Support Engineer to join our global Business Development...  ...-minded professional to help shape the future of RF connectivity solutions across a...  ..., we are entrepreneurs. This means we openly challenge the status quo, find new ways... 
    Flexible hours

    Molex

    Austin, TX
    1 day ago
  • $148k - $185k

     ...As a Senior Solutions Engineer , you’ll be the technical...  ...and Customer Success Support seamless handoffs into...  ...share their discoveries openly, and help define best practices...  .../statutory holidays observed 4 BetterUp Inner Work...  ...may be modified in the future. The base salary range... 
    Work experience placement
    Summer holiday
    Live out
    Work at office
    Local area
    Flexible hours
    2 days per week

    BetterUp

    Austin, TX
    16 hours ago
  • $110k - $216k

     ...and thrive with our open, AI-driven...  ...together to shape the future of commerce, this...  ...Lead Infrastructure Engineer** at Commerce, you...  ...software engineering and SRE principles to...  ...and innovate the observability of Commerce’s platform...  ..., operating, or supporting large-scale Linux... 
    Full time
    Remote work

    BigCommerce

    Austin, TX
    5 days ago
  • $180k - $225k

     ...time. Make the system observable : You’ll improve diagnostics...  ..., platform, and support teams. When something...  ...Observability tooling Open-source systems people...  ...runs entirely on AWS. Engineers develop by using remote...  ...the need for current or future sponsorship.... 
    Permanent employment
    Full time
    Work at office
    Local area
    Immediate start
    Remote work
    Home office
    Flexible hours

    GrabJobs

    Austin, TX
    4 days ago
  • $93k - $156k

     ...grow, and thrive with our open, AI-driven commerce...  ...together to shape the future of commerce, this is the...  ...searching for a Software Engineer II - Infrastructure...  ...improvements* Provide support for the development environment...  ...monitor statistics to observe tooling health and... 
    Work experience placement
    Remote work

    BigCommerce

    Austin, TX
    16 hours ago
  •  ...perspectives. Join us as we shape the future of AI and beyond. Together,...  ...for an influential software engineer who is passionate about...  ...a bias toward automation and observability Create and maintain...  ...or Gerrit Contributions to open source projects are a definite... 

    Advanced Micro Devices , Inc.

    Austin, TX
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Future Openings - SRE Support Engineer - Observability. Be the first to apply!