Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Platform Engineer (Observability & Telemetry)

OneMain Financial

We're seeking a Senior Platform Engineer (Observability & Telemetry) to join a high-performing Monitoring Engineering team within a fast-paced financial technology organization. In this role, you will apply SRE principles to design, build, and evolve monitoring and observability capabilities that ensure the reliability, performance, and operability of core applications and infrastructure.

You will partner closely with application, platform, and development teams to implement data-driven alerting, SLO/SLA-based monitoring, telemetry pipelines, dashboards, correlations, and automated remediation. Your work will directly improve system reliability, reduce MTTR, and enhance enterprise-wide operational insight.

This role requires strong analytical thinking, systems engineering discipline, and a proactive approach to identifying risks, preventing incidents, and driving continuous improvement across the production ecosystem.

Key Responsibilities

Design, Build, and Maintain Monitoring & Observability Solutions

  • Architect, deploy, and operate OpenTelemetry-based telemetry pipelines, including instrumentation standards, collector configurations, sampling strategies, and routing to Elastic and other backends.

  • Develop and maintain instrumentation, telemetry, and alerting for the Enterprise Monitoring Center using industry-leading tools, such as:

  • Grafana, OpsRamp, ElasticStack, BigPanda

  • AWS CloudWatch, Azure Monitor

  • Drive observability standards and best practices across multiple engineering teams through influence, documentation, and partnership rather than direct authority.

  • Apply SRE best practices to ensure measurable SLIs/SLOs, reliability dashboards, and health indicators for critical systems.

  • Integrate and manage OpenTelemetry for distributed tracing and telemetry data collection, enabling end-to-end visibility of business-critical transactions.

Collaboration & Project Participation

  • Collaborate with application development teams to define and document observability requirements for each project or release, ensuring accurate and actionable monitoring and tracing are in place for every step of business-critical workflows.

  • Embed reliability considerations early in the SDLC, including SLO definitions, instrumentation needs, and failure-mode awareness.

  • Partner with product and engineering teams to use SLOs and error budgets to guide release decisions, prioritization, and toil reduction.

Alerting & Escalation Process

  • Define and maintain standardized alert payloads per engineering guidelines, ensuring alerts are actionable.

  • Partner with Level 2 and Level 3 support teams to reflect process changes in monitoring dashboards.

  • Maintain and optimize thresholds, ensuring seamless escalations via BigPanda as the central alert hub.

Dashboard Creation & Maintenance

  • Create and maintain intuitive, actionable dashboards for the Enterprise Monitoring Center and other finance teams.

  • Ensure dashboards are effectively monitored by Level 1 teams, presenting clear, actionable data that reduces MTTR.

Documentation, Governance & Reliability Standards

  • Develop and maintain technical documentation, runbooks, diagnostic guides, and observability standards across the enterprise.

  • Evaluate and refine release, deployment, and monitoring processes to support consistent, reliable delivery pipelines.

  • Mentor junior engineers and promote a culture focused on reliability, automation, and operational excellence.

Reliability Engineering, Automation & Continuous Improvement

  • Build automation frameworks for monitoring, alerting, self-healing workflows, and incident response to reduce toil and improve MTTR.

  • Drive system optimization through capacity analysis, performance tuning, and proactive detection of reliability risks.

  • Contribute to the automation of routine operational tasks to improve system reliability and engineer quality of life.

  • Advocate for and implement observability best practices across engineering teams.

  • Define, implement, and operationalize SLIs, SLOs, and error budgets for critical services.

  • Participate in and improve incident response processes, including detection, triage, escalation, and recovery.

Qualifications

Education bachelor's in computer science, IT, or related field.

Experience

  • 5+ years of experience in software, systems, or reliability engineering roles, with multiple years of hands-on experience owning production observability, monitoring, and SLOs in distributed systems.

Required Skills

  • Deep experience building scalable, reliable monitoring and observability solutions, including instrumentation, alerting, dashboarding, and configuration across large, complex environments.

  • Hands-on expertise and proficency with modern monitoring and observability tools, (e.g., OpsRamp, Grafana, Elastic, CloudWatch, Azure Monitor BigPanda (AIOps), and strong knowledge of metrics, logs, traces, and OpenTelemetry.

  • Strong scripting and programming capability (Bash, PowerShell, and one or more languages such as Python, C-family, or JavaScript) to automate telemetry, alerting, and platform workflows.

  • Strong expertise with cloud platforms (AWS and/or Azure) and container orchestration systems (Kubernetes, Docker).

  • Deep hands-on experience with Elastic Observability (APM, Logs, Metrics, Traces)

  • Understanding of distributed systems fundamentals, including networking, security, databases, DevSecOps principles, and performance/capacity engineering.

  • Strong communication skills, with the ability to clearly explain complex technical topics to both technical and non-technical audiences.

  • Exceptional problem-solving and troubleshooting abilities, especially in high-pressure or time-sensitive environments.

  • Effective prioritization and multitasking, able to manage competing deadlines while maintaining quality and focus.

  • Proven cross-functional collaboration, working seamlessly with diverse teams in large, complex IT environments and driving continuous improvement across systems.

Preferred Qualifications

  • Experience with CI/CD pipelines and tools like Jenkins, GitHub, GitLab CI, or CircleCI

  • Experience querying, manipulating, and visualizing time-series data.

  • Familiarity with Infrastructure as Code tools (e.g., Ansible, Terraform).

  • Knowledge of microservices architecture and event-driven systems.

  • Working knowledge of REST APIs, JSON, and ServiceNow.

  • Experience with cloud monitoring-particularly AWS or Azure.

OneMain Holdings, Inc. is an Equal Employment Opportunity (EEO) and Affirmative Action (AA) employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender perception or identify, national origin, age, marital status, protected veteran status, or disability status.

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Senior Platform Engineer (Observability & Telemetry) in Baltimore, MD vacancy
  •  ...Role Summary The Apple Engineer IV is the senior technical lead for enterprise Apple platforms (macOS, iOS/iPadOS, tvOS) across the district....  ...install success ~97% across sites. ~ Observability: Unified telemetry for macOS/iOS (MDM, logs, fleet metrics);... 
    Senior

    3B Staffing LLC

    Baltimore, MD
    1 day ago
  •  ...Senior Veritas eDiscovery Platform (eDP) Engineer Employment Type: Full-Time, Executive-Level Department: Legal CGS is seeking a dedicated Senior...  ...Dependent Care, and Commuter) ~ Paid Time Off and Observance of State/Federal Holidays Contact Government... 
    Senior
    Full time
    For contractors
    Remote work
    Flexible hours

    Contact Government Services LLC

    Baltimore, MD
    1 day ago
  •  ...Job Description Job Description Senior Veritas eDiscovery Platform (eDP) Engineer Employment Type: Full-Time, Executive-Level Department: Legal...  ...Dependent Care, and Commuter) -         Paid Time Off and Observance of State/Federal Holidays   Contact Government... 
    Senior
    Full time
    For contractors
    Remote work
    Flexible hours

    Contact Government Services, LLC

    Baltimore, MD
    3 days ago
  •  ...Sr. Platform Engineer Location: Remote (Must reside within 50 miles of Baltimore, MD; Wilmington, DE; Charlotte, NC; Dallas, TX; New York...  ...planning and delivery alignment GC, USC Senior platform engineering with TypeScript/Angular/NodeJS , AWS... 
    Senior
    Remote work

    3B Staffing LLC

    Baltimore, MD
    1 day ago
  • $67k - $136.8k

     ...efficiency across their ever-changing platform and channel infrastructures. Everything...  ...The opportunity As an FSO DevOps Engineer Senior Analyst, you’ll be based in our...  ...automation, infrastructure reliability, observability, and secure deployment patterns. You will... 
    Senior
    Summer holiday
    Flexible hours

    EY

    Baltimore, MD
    2 days ago
  •  ...Lead Platform Engineer Location: Hybrid - Must reside within 50 miles of Baltimore,...  ...Required skills: SRE, OpenTelemetry, Elastic Observability, Grafana, OpsRamp, BigPanda, AWS/Azure...  ...enhancing observability frameworks, telemetry pipelines, monitoring standards,... 
    Contract work
    Local area

    3B Staffing LLC

    Baltimore, MD
    2 days ago
  •  ...Lead Platform Engineer (Elasticsearch) Location: Baltimore, MD (Hybrid) Duration: 6-Month Contract C2H GC, USC...  ...Elasticsearch to design, build, and manage enterprise-grade search and observability platforms. This role will own the full lifecycle of the Elastic... 
    Contract work

    3B Staffing LLC

    Baltimore, MD
    1 day ago
  • $260k - $310k

     ...scale. We’re looking for an experienced Engineering Leader to lead our growing Data and Storage...  ..., leadership, and stakeholders across platform engineering and product engineering...  ...in the professional growth of junior and senior engineers within the team. What We Look... 
    Senior
    Work at office
    Remote work
    Flexible hours

    Affirm

    Baltimore, MD
    4 days ago
  •  ...Front End Engineer (Senior Consultant, Engineering & Technical Services) We are seeking a Consultant...  ...with back-end and AI-enabled platform components, and participate in ongoing...  ...Analytical Thinking: The ability to observe and interpret information, break down complex... 
    Senior
    Work experience placement
    Work at office
    Immediate start
    Shift work

    Fearless

    Baltimore, MD
    2 days ago
  • $87.97k - $188.95k

     ...Lighthouse. KPMG is currently seeking a Sr. Associate, Cloud Engineer for our Consulting practice. Responsibilities : Assist...  ..., each year KPMG publishes a calendar of holidays to be observed during the year and provides eligible employees two breaks each... 
    Senior
    H1b
    Local area

    KPMG

    Baltimore, MD
    1 day ago
  •  ...Mercor is seeking experts to join their DevOps / Platform Engineer Expert Network. This role connects you with various projects that require your expertise in AI development and infrastructure.Qualified candidates will engage in tasks like training AI models and developing... 
    Remote work
    Flexible hours

    Mercor Inc

    Baltimore, MD
    1 day ago
  • $71 - $76 per hour

     ...and microservices architecture. Deep experience with Microsoft SQL Server (SSMS, SSAS, SSIS, RS, etc.). Knowledge of cloud platforms such as Microsoft Azure. Deep experience with version control systems like Git as well as CI/CD. Demonstrated knowledge of popular... 
    Senior
    Work experience placement
    Local area

    Cynet Systems

    Baltimore, MD
    2 days ago
  •  ...Job Title: Senior DevOps Engineer (Hybrid) Location: Baltimore, MD Duration: up to 5 Years Job Description: The Senior...  ...Server (SSMS, SSAS, SSIS, RS, etc ) Knowledge of cloud platforms such as Microsoft Azure. Deep Experience with version... 
    Senior
    Work experience placement
    Local area

    Serigor Inc

    Baltimore, MD
    14 days ago
  •  ...Senior DevOps Engineer Spruce InfoTech is the leading service provider in the fields of project management, architecture design, installation, implementation and administration of EPM and BI projects. Our team of experts provide with proven industry solutions that... 
    Senior

    Spruce Infotech

    Gwynn Oak, MD
    1 day ago
  •  ...DevOps Engineer Welcome to Interclypse, where innovation meets passion. Every team member...  ..., and security of the Maryland Benefits platform infrastructure. This role focuses on...  ...years of experience in a technical lead or senior DevOps role focusing on AWS cloud... 
    Senior
    Temporary work
    Remote work
    Flexible hours

    Interclypse

    Linthicum Heights, MD
    6 days ago
  • $28.41 - $40.35 per hour

     ...Senior Outpatient Coding Specialist - RemoteMonday - Friday 6AM-6PM ET (40 hours/week)We are seeking aJob RequirementsSenior Outpatient...  ...supervision accurately codes hospital Ambulatory Surgery and Observation visit records for the purpose of appropriate reimbursement,... 
    Senior
    Remote work

    University of Maryland Medical System

    Baltimore, MD
    4 days ago
  •  ...of job functions and through participation in hospital, department or unit patient safety initiatives. Takes action to correct observed risks to patient safety. Reports adverse events and near misses by entering information in the Risk Management reporting system... 
    Senior
    Work experience placement

    University of Maryland Medical System

    Gwynn Oak, MD
    1 day ago
  •  ...of job functions and through participation in hospital, department or unit patient safety initiatives. Takes action to correct observed risks to patient safety. Reports adverse events and near misses by entering information in the Risk Management reporting system... 
    Senior
    Work experience placement
    Night shift

    University of Maryland Medical System

    Baltimore, MD
    2 days ago
  • Textron Systems in Hunt Valley, MD is seeking a Systems Engineer to lead the development of innovative robotic solutions in ground and air domains. The role involves full lifecycle engineering, model-based systems engineering, and integration testing of unmanned systems... 
    Senior
    Flexible hours

    Textron Systems

    Cockeysville, MD
    1 day ago
  •  ...job functions and through participation in hospital, department or unit patient safety initiatives. 1. Takes action to correct observed risks to patient safety. 2. Reports adverse events and near misses to appropriate management authority. 3. Identifies possible... 
    Senior
    Monday to Friday
    Weekend work

    University of Maryland Medical System

    Baltimore, MD
    2 days ago
  •  ...Senior FOIA Disclosure Product Manager CGS is seeking a Senior FOIA Disclosure Product...  ...and final product development by engineering teams Create product strategy documents...  ...Care, and Commuter) ~ Paid Time Off and Observance of State/Federal Holidays Contact... 
    Senior
    Remote work
    Flexible hours

    Contact Government Services LLC

    Baltimore, MD
    6 days ago
  • $130k - $270k

     ...Senior DevOps Engineer Aberdeen Proving Grounds, MD Build to something to be proud of. Captivation has built a reputation on providing customers exactly what is needed in a timely manner. Our team of engineers take pride in what they develop and constantly innovate... 
    Senior
    Hourly pay
    Temporary work

    Captivation Software LLC

    Baltimore, MD
    2 days ago
  • $39 - $56.18 per hour

     ...job functions and through participation in hospital, department or unit patient safety initiatives. # Takes action to correct observed risks to patient safety. # Reports adverse events and near misses to appropriate management authority. # Identifies possible... 
    Senior
    Work experience placement

    University of Maryland Medical System

    Baltimore, MD
    1 day ago
  •  ...job functions and through participation in hospital, department or unit patient safety initiatives. # Takes action to correct observed risks to patient safety. # Reports adverse events and near misses to appropriate management authority. # Identifies possible risks... 
    Senior
    Monday to Friday
    Weekend work

    University of Maryland Medical System

    Baltimore, MD
    2 days ago
  • $175k

     ...JobID: 53340 Senior Data Engineer Pay $175,000+ annually Location (Remote optional...  ..., management, and ELT on large-scale platforms ~3+ years of hands-on experience...  ...Practical knowledge of data quality, observability practices, and orchestration tools... 
    Senior
    Remote work

    Prestige Staffing Healthcare Jobs - Clinical & Allied Health

    Halethorpe, MD
    7 days ago
  •  ...Network Monitoring Platform Engineer Peraton is seeking a Network Monitoring Platform Engineer to join our team of qualified, diverse individuals. The qualified candidate will develop, support, and maintain network monitoring process. This position will be located in... 
    For contractors

    Zortech Solutions

    Baltimore, MD
    1 day ago
  • A healthcare system in Glen Burnie, Maryland, is seeking a Senior Software Development Engineer L4. The role involves developing high-quality software for a hospital system data platform and mentoring junior engineers. Candidates should have 5+ years of experience in software... 
    Senior

    Salem Health Hospitals & Clinics

    Glen Burnie, MD
    3 days ago
  • $117.5k - $176.3k

    RELOCATION ASSISTANCE: No relocation assistance available CLEARANCE REQUIRED FOR START: No CLEARANCE TYPE: None TRAVEL: Yes, 10% of the Time Description At Northrop Grumman, our employees have incredible opportunities to work on revolutionary systems that impact people...
    Senior
    Contract work
    Relocation
    Shift work

    Northrop Grumman

    Baltimore, MD
    10 hours ago
  •  ...what we stand for as a firm KPMG is currently seeking a Senior Associate to join our Business Tax Services practice....  ...Additionally, each year KPMG publishes a calendar of holidays to be observed during the year and provides eligible employees two breaks each... 
    Senior
    Local area

    KPMG

    Baltimore, MD
    1 day ago
  • $144.21k - $344.2k

     ...and what we stand for as a firm. KPMG is currently seeking a Senior Manager to join our State and Local Tax (SALT) practice....  ...Additionally, each year KPMG publishes a calendar of holidays to be observed during the year and provides eligible employees two breaks each... 
    Senior
    Local area

    KPMG

    Baltimore, MD
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Platform Engineer (Observability & Telemetry). Be the first to apply!