Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Lead Software Engineer (Site Reliability)

Vanguard

Lead Software Engineer (Site Reliability)

Apply (

locations

Malvern, PA

time type

Full time

posted on

Posted 30+ Days Ago

job requisition id

173421

Shape the Future of Observability at Vanguard

At Vanguard, we pride ourselves on delivering an exceptional client experience to all investors; at the core of this experience are systems that reside in a technically complex and constantly evolving resiliency landscape. Passionate, technically skilled engineers are at the center of our resiliency operations, and we are looking to grow our team.

We are seeking an experienced engineer with broad, end-to-end software development experience, including operating applications in a microservices environment in production at scale. This role goes beyond feature implementation - it requires someone who can design, build, and support resilient systems from the ground up.

As a Senior Reliability Engineer at Vanguard, you will play a critical role in solving impactful operational problems. You are curious and take a proactive approach to identifying problems and making improvements. You balance innovative thinking with pragmatism and understand the long-term impacts of technical decisions. You communicate complex ideas clearly and collaborate effectively to deliver scalable solutions.

Core Responsibilities

  • Improve resiliency engineering practices across platforms and applications, including resilient application design patterns, system observability and deployment strategies

  • Incident detection, troubleshooting, and resolution.

  • Develop automation for incident response and infrastructure management

  • Develop and support OpenTelemetry integrations for multiple application platforms (browser, ECS, lambda, etc) and languages (JavaScript, Java)

  • Contribute to architectural decisions and support implementation of solutions.

Skills and Qualifications

  • Deep knowledge of Java or Javascript. Practical experience developing and operating software in distributed systems environments.

  • Problem-solving and analytical thinking: ability to diagnose complex issues and propose efficient solutions. Strong debugging and optimization skills for performance and scalability.

  • Cloud platforms: Hands-on experience with AWS services and cloud infrastructure

  • System architecture and design: ability to design scalable, secure, and maintainable systems.

  • Working knowledge of Python (or similar scripting language).

  • Strong knowledge of resiliency engineering techniques for both platforms and applications.

  • Experience troubleshooting complex production issues and implementing effective mitigations.

  • Familiarity with OpenTelemetry specification and core APIs.

Special Factors

Sponsorship

Vanguard is not offering visa sponsorship for this position.

About Vanguard

At Vanguard, we don't just have a mission—we're on a mission.

To work for the long-term financial wellbeing of our clients. To lead through product and services that transform our clients' lives. To learn and develop our skills as individuals and as a team. From Malvern to Melbourne, our mission drives us forward and inspires us to be our best.

How We Work

Vanguard has implemented a hybrid working model for the majority of our crew members, designed to capture the benefits of enhanced flexibility while enabling in-person learning, collaboration, and connection. We believe our mission-driven and highly collaborative culture is a critical enabler to support long-term client outcomes and enrich the employee experience.

Similar Jobs (5)

Senior Reliability Engineer

locations

2 Locations

time type

Full time

posted on

Posted 30+ Days Ago

Senior Application Engineer

locations

Malvern, PA

time type

Full time

posted on

Posted 22 Days Ago

Application Engineering Technical Lead - II

locations

Malvern, PA

time type

Full time

posted on

Posted 7 Days Ago

View All 5 Jobs

About Us

Vanguard, one of the world's leading investment management companies, serves individual investors, institutions, employer-sponsored retirement plans, and financial professionals. We have a diverse and talented crew with a culture that promotes teamwork, along with an unwavering focus on serving our clients' best interests.

This website uses "cookies" to distinguish you from other users. A cookie is a small file of letters and numbers placed on your computer or device. This helps us to provide you with a good experience when you browse our website and also allows us to improve our site and services. The cookies are stored locally on your computer or mobile device. To accept cookies you can continue browsing as normal. Or you can go to ourPrivacy Policy ( to read more information and learn how to change your preferences.

Read More

Vacancy posted more than 2 months ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Lead Software Engineer (Site Reliability). Be the first to apply!