Site Reliability Engineer IV
Capitolis
Candescent is the leading cloud-based digital banking solutions provider for financial institutions. We are transforming digital banking with intelligent, cloud-powered solutions that connect account opening, digital banking, and branch experiences for financial institutions. Our advanced technology and developer tools enable seamless, differentiated customer journeys that elevate trust, service, and innovation. Success here requires flexibility in a fast-paced environment, a client-first mindset, and a commitment to delivering consistent, reliable results as part of a performance-driven, values-led team. With team members around the world, Candescent is an equal opportunity employer. Position: Site Reliability Engineer IV Experience: 9-12 Years Location: Bangalore (Ecospace) Candescent Site Reliability Engineering (SRE) mission is to proactively ensure the reliability, availability and performance of our Digital First banking applications. As a member of the SRE team, you will focus on building and operating highly reliable application platforms by applying SRE principles such as automation, observability, resilience and continuous improvement. You will partner closely with application and platform teams to define reliability standards, implement monitoring, alerting and incident response practices and embed scalability and performance considerations into application design and delivery. Through tooling, automation, and best practices, you will help development teams build and operate services that meet agreed reliability objectives. As a senior engineer in the organization, you will also provide mentorship within the SRE team and across peer engineering teams, helping elevate operational maturity, drive adoption of SRE practices, and strengthen reliability culture across our core initiatives. Responsibilities Support and operate production applications running on Kubernetes and AWS Troubleshoot application-level issues using logs, metrics, traces, and runtime signals Participate in incident response, root cause analysis, and post-incident reviews Work closely with development teams to understand application architecture, dependencies, and data flows Improve application observability by defining meaningful alerts, dashboards, and SLOs Automate repetitive operational tasks to reduce toil Support application deployments, rollbacks, and runtime configuration changes Identify reliability, performance, and scalability gaps in application behavior Drive continuous improvements in operational readiness, runbooks, and on-call practices Influence application teams to adopt shift-left reliability practices Must-Have Skills & Experience Hands-on experience supporting Java applications in production Strong understanding of JVM fundamentals (heap/memory management, garbage collection, OOM issues, thread analysis) Proven experience with SRE practices, including: Incident response and on-call support Root cause analysis and postmortems SLIs, SLOs, and reliability-driven operations Strong experience troubleshooting using application logs, metrics, and monitoring tools Experience operating Java applications on Kubernetes (EKS) from an application/runtime perspective Experience with deployment strategies (rolling, blue/green, canary) Ability to write automation and scripts (Python or any) to reduce operational toil Solid understanding of application architecture and service dependencies (databases, messaging systems, external APIs) Strong collaboration and communication skills; ability to work closely with development teams Demonstrates accountability and sound judgment when responding to high-pressure incidents Good-to-Have Skills & Experience Exposure to platform or infrastructure concepts supporting application workloads Experience with AWS services such as EKS, RDS/Aurora, S3, EFS, and CloudWatch CI/CD pipeline experience (GitHub Actions, Jenkins) Familiarity with GitOps practices Experience with cloud migrations or modernization efforts #J-18808-Ljbffr
- ...Business consulting services. We are in search of a highly motivated candidate to join our talented Team. Job Title: Software Dev Engineer IV Location: Herndon, VA Job Responsibilities: Design, develop, implement, test, document and deploy full-stack,...Suggested
- ...Systems Engineer IV – Blue Coat/Proxy PlanIT Group is seeking a Systems Engineer IV – Blue Coat/Proxy to support our Federal customer... ...monitoring and optimization in maintaining a secure, efficient, and reliable network infrastructure through their expertise in proxy server...SuggestedPermanent employment
$146.2k - $228.4k
...Responsibilities Noblis ESI is seeking a Systems Engineer IV to support our government customer on-site in Chantilly, VA. As a Systems Engineer IV you... ...sensitive environments. You will ensure the reliability, security, and robustness of mission-critical infrastructure...SuggestedFull timeContract workPart timeLocal areaRemote workFlexible hours- ...and enable mission success. We are currently seeking a DevOps Engineer IV to support a federal customer, contingent upon contract award.... ...operational excellence. This role requires a TS-SCI w/ CI Poly and is on-site in Reston VA. Responsibilities Design, build, and maintain...SuggestedContract work
- ...innovative solutions to protect against evolving cyber threats. Learn more about us at Position Overview We are seeking a DevOps Engineer-IV to architect and oversee advanced DevOps strategies and frameworks. This expert‑level role requires a proven ability to design...Suggested
$86.8k - $198k
...pursue a balanced, fulfilling life. YOUR CANDIDATE JOURNEY Discover what to expect during your journey as a candidate with us. Site Reliability Engineer, Senior The Opportunity Engineering to make a system more resilient and efficient frees up time and money to build more...Full timeContract workPart timeLocal area$147.4k - $221.2k
...United States citizens (naturalized or native).****May be required to be on site at client locations in the DC, MD, and VA (DMV) area.**We are looking for a highly motivated Site Reliability Engineer to join our growing Infrastructure and Federal Platform Engineering team....Work experience placementWork at officeRemote workHome officeFlexible hours$135.8k - $183.8k
...dynamic and flexible work environment with competitive benefits and the ability to grow your career. We are looking for a Site Reliability Engineer to support our team responsible for building, managing, maintaining, deploying, and securing mission-critical services to...Work at officeFlexible hours$137k - $205.4k
Workday is looking for a skilled individual to support one or more contracts with the U.S. Federal Government, primarily in Reston, VA. This role involves managing team services and collaborating with development teams to enhance operations. Applicants must have at least...Flexible hours$147.4k - $221.2k
...you’ve found a match in Workday, and we hope to be a match for you too.About the TeamWe are looking for a highly motivated Site Reliability Engineer to join our growing Infrastructure and Platform Engineering team. You will play a critical role in operating, monitoring,...Work at officeRemote workHome officeFlexible hours$164.3k - $222.3k
...your career. This position is based in our Reston, VA office and offers a hybrid work schedule. Verisign is hiring a Senior Site Reliability Engineer to help lead a team responsible for building, managing, maintaining, and scaling the Linux infrastructure on which our...Work at officeFlexible hours$99k - $225k
...Site Reliability Engineer page is loaded## Site Reliability Engineerlocations: Herndon, VAtime type: Full timeposted on: Posted Todaytime left to apply: End Date: July 2, 2026 (30+ days left to apply)job requisition id: R0237101Site Reliability Engineer**The Opportunity...Full timeContract workPart timeWork at officeLocal areaRemote work- ...Job Description Job Description Apply now: Site Reliability Engineer (DevOps/SRE), location is Remote. The start date is Targeting June 29 for this 12 month contract position. Job Title: Site Reliability Engineer (DevOps/SRE) Location-Type: 100% Remote Start...Contract workRemote work
- ...Job Description Job Description Site Reliability Engineer LOCATION: Reston, VA SUMMARY OF POSITION The Site Reliability Engineer (i.e., “SRE”) role is responsible for the optimization and reliability of core technical platforms and platform services,...Work at office
- ...straightforward communication and clinical domain expertise, Commence cuts straight to better care. Requirements: As a Senior Site Reliability Engineer at Commence, you will own the reliability, scalability, and operational health of our mission-critical healthcare data...Remote work
$125k - $200k
Overview As a Site Reliability Engineer (SRE) , you will help design, build, and operate reliable, secure, and observable cloud‑native systems that support mission‑critical applications and services. You will blend software engineering, DevOps practices, and infrastructure...Local area2 days per week- ...Overview Site Reliability Engineer – ServiceNow at Visa – Ashburn, VA, United States Company Description Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions,...Work experience placementInternshipWork at officeLocal area
$99k - $225k
...A leading consultancy firm in Herndon is looking for an experienced Site Reliability Engineer. This role involves leading the development of robust systems for the Intelligence Community by building resilient infrastructure. Candidates must have 8+ years of relevant experience...Remote work- ...infrastructure and/or service, sharing guidance on practices and terms for reliability and functionality. Supervises team members and provides... ...standards for developing and maintaining knowledge of site reliability trends and sharing valuable insights and information...Immediate start
- ...DevOps Site Reliability Engineer (TS/SCI) Reston, VA, USA Full-time Clearance: Top Secret/SCI Primary Role: As a site reliability engineer on our team, you’ll work with the DoD on the development of more robust systems by building a resilient infrastructure. You’ll build...Full timeWork at officeRemote work
- A leading technology company is seeking a Senior Site Reliability Engineer in Virginia. The role involves maintaining a Kubernetes-based platform, ensuring high availability, and automating infrastructure processes with tools like Terraform. The ideal candidate will have...Remote jobFlexible hours
$55.2k - $126k
...what to expect during your journey as a candidate with us. Engineering to make a system more resilient and efficient frees up time... ...have a passion for making systems better, we need you! As a site reliability engineer on our team, you’ll help our Platform Engineering team...Full timeContract workPart timeLocal areaRemote work- ...Web Designer / Developer - IV/JavaScript Developer JavaScript developer with strong database skills Designs and codes from specifications, analyzes, evaluates, tests, debugs, documents, and implements moderately complex to highly complex software applications. Under...
$100k - $140k
...Network/Telecom Engineer - Level IV Location: Silver Spring, MD with frequent visits to lab... ...and require a SLEP to maintain the high reliability and continuity of operations required... ...processing for all NWS, US Navy and FAA sites converting to 1.5 communications...Contract workWork at office- ...A leading technology firm is seeking a Site Reliability Engineer to support U.S. Federal Government contracts in Reston, Virginia. You'll be responsible for maintaining the Kubernetes-based platform, ensuring high availability through automation and collaboration. The...
$146.2k - $228.4k
...software development experience with a strong emphasis on front-end engineering, along with deep proficiency in Angular frameworks, Angular... ...benefits by visiting the Benefits ( page on our Careers ( site. Compensation at Noblis is determined by various factors, including...Full timeContract workPart timeLocal areaRemote work$55.2k - $126k
A leading consulting firm in McLean, Virginia, is seeking a Site Reliability Engineer to enhance system resilience and efficiency. Key duties include developing robust infrastructure, implementing automation, and reducing manual tasks. The role requires experience with...Remote job- ...in! GA-Intelligence is looking for experienced Senior DevOps Engineers with Top Secret/SCI clearance to help create and develop a new... ..., and upgrades our existing product(s) so that they can more reliably deliver to customer environments. Maintains development, testing...Full timePart timeLocal areaRelocation package
$62k - $141k
Phase2 Technology is seeking an Operations Facilitator to enhance system resiliency and efficiency. The role involves building resilient infrastructure, implementing monitoring tools, and automating routine tasks. Candidates should have a strong background in Linux systems...- ...solutions that protect and enable mission success. We are currently seeking a Cybersecurity Engineer IV to support a federal customer. This role requires a TS-SCI CI Poly and is on-site in Reston, Virginia. This role is responsible for designing, implementing, and...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineer IV. Be the first to apply!
- site leader Sterling, VA
- site safety Sterling, VA
- on-site clinical research associate (traveling/remote) Sterling, VA
- junior website developer Sterling, VA
- IT site lead Sterling, VA
- lead site reliability engineer
- site reliability engineer remote
- site reliability engineer sre
- site reliability engineer
- site reliability engineering manager


