SRE
RIT Solutions
SRE Hybrid - Malvern, PA
As a Senior Reliability Engineer, you will play a critical role in solving impactful operational problems. You are curious and take a proactive approach to identifying problems and making improvements. You balance innovative thinking with pragmatism and understand the long-term impacts of technical decisions. You communicate complex ideas clearly and collaborate effectively to deliver scalable solutions.
Core Responsibilities
Team is focused on automating incident response and infrastructure management. While Java and Python receive a stronger emphasis, candidates with solid programming fundamentals in any language and the ability to adapt will be considered. Experience with AWS and event-driven architectures is also valuable. From a technical standpoint, familiarity with observability concepts (e.g., distributed tracing) and tools like Prometheus or Grafana is beneficial, though not mandatory. More important is an understanding of the underlying principles, such as instrumentation and monitoring strategies.
- Improve resiliency engineering practices across platforms and applications, including resilient application design patterns, system observability and deployment strategies
- Incident detection, troubleshooting, and resolution.
- Develop automation for incident response and infrastructure management
- Develop and support OpenTelemetry integrations for multiple application platforms (browser, ECS, lambda, etc) and languages (JavaScript, Java)
- Contribute to architectural decisions and support implementation of solutions.
Skills and Qualifications
* Deep knowledge of Java or Javascript. Practical experience developing and operating software in distributed systems environments. * Problem-solving and analytical thinking: ability to diagnose complex issues and propose efficient solutions. Strong debugging and optimization skills for performance and scalability. * Cloud platforms: Hands-on experience with AWS services and cloud infrastructure * System architecture and design: ability to design scalable, secure, and maintainable systems. * Working knowledge of Python (or similar scripting language). * Strong knowledge of resiliency engineering techniques for both platforms and applications. * Experience troubleshooting complex production issues and implementing effective mitigations. * Familiarity with OpenTelemetry specification and core APIs. From a screening perspective, we recommend focusing on: · How candidates approach software releases and validate functionality · Their understanding of system dependencies and fault tolerance · Experience with diagnosing and resolving production issues · Their ability to reflect on past incidents and identify improvements · Evidence of systems thinking and architectural awareness
- ...SRE Role SRE role is a combination between architect-digital, full stack developer-digital, cloud engineering and system engineer. 9 to 12 months duration in Malvern PA Responsibilities Specific skillsets in addition to expert level full stack developer profile...SuggestedWork experience placement
- ...Vanguard DevOps/SRE Engineer Migrated key systems from on-prem hosting to AWS Worked in Agile and Scrum methodologies/practices Worked on designing and developing a multitude application utilizing almost all of the main services of the AWS stack (like EC2, ECS...Suggested
$55 per hour
...flags, and automated rollback mechanisms. Community of Practice & Reliability Leadership Build and lead the Cash & Money Movement SRE Community of Practice. Drive engagement, knowledge sharing, and reliability culture across the organization. Identify and...SuggestedHourly payContract workTemporary workWork experience placementWork at office- ...Experience developing and integrating APIs, including working with a Supergraph, and JQL API protocol–based ecosystem. Nice to Have Skills Building JQL-based APIs. Front-end development (React/TypeScript). Site Reliability Engineering (SRE) practices....SuggestedNight shift
- ...following services: EC2, S3, Elastic Load Balancing, Lambda, Route 53, ECS, SQS, CloudWatch ~ Prior experience working in a DevOps or SRE environment ~ Highly experienced with automation and scripting using languages such as: PowerShell, Ruby, Go, Python, Bash ~...SuggestedTemporary work
- ...implementations with a keen eye toward the future state of technology and the industry. This team member works closely with the Security team, the SRE team, and Development team to build the frameworks that will take our technology into the future. This team member is future-focused,...
- ...Responsibilities Operate in a highly collaborative team emphasizing best practices in software development, automation, DevSecOps and SRE. Build cloud-first, consumer-focused and applying lean agile methodologies. Ability to learn and build in third party...
$140k - $170k
...from you: BA/BS, in a related technical field; or the equivalent in education and work experience 8+ years of experience in DevOps, SRE, platform engineering, or similar roles supporting application teams running production services Strong CI/CD experience (Jenkins and...Full timeWork experience placementFlexible hours- ...operational problems. You are curious and take a proactive approach to identifying problems and Approx desired start: ASAP Role: Expert SRE UI Engineer Expert-level proficiency in JavaScript, spanning both client-side and server-side execution environments Robust...Immediate start
- ...enterprise infrastructure (CI/CD, CMDB, ITSM, identity providers). Partners with engineering leadership across Platform, DevOps, SRE, and application security teams to align on shared interfaces, data contracts, and remediation workflows that reduce friction at organizational...Work experience placementImmediate startShift work
$115k - $180.34k
...candidates who live greater than 100 miles from the office for the remote option.) Job Summary As a Site Reliability Engineer (SRE), you will be part of the SRE team within the CONNECT OpTek team. Our team is responsible for the development and support of multiple...Full timeWork experience placementWork at officeRemote workWorldwide- ...and quality. Drive engineering quality through appropriate testing strategies, CI/CD pipelines, environment management, and pragmatic SRE/DevOps practices. Monitor system and integration health; proactively identify issues; lead troubleshooting and root‑cause analysis;...Full timeWork at officeRemote work
$180k
...upgrade programs aligned with vendor roadmaps. Build and lead a Platform Reliability Engineering (PRE) or Site Reliability Engineering (SRE) function focused on proactive monitoring, automation, and resilience. Implement enterprise monitoring, observability, and event...Local area$1,000 per month
...reliability and cost as outcomes and you design the system that delivers them, but on-call and incident command live with our Engineering and SRE teams. If you're looking to run a payments-operations desk or a pager rotation, this isn't it. Preferred EWA or consumer fintech...Full timeTemporary workWork at officeImmediate startRemote workFlexible hours- ...Development experience with any of the following programming languages: Java or Python Recent on‑program Site Reliability Engineering (SRE) experience (e.g., automation, incident management, monitoring, optimization, programming/scripting, metrics, security, etc.)...
- ...software delivery lifecycle. You’ll be joining a collaborative environment where engineers work closely with development, security, SRE, and operations teams across various geographies, all united in optimizing the deployment of applications and infrastructure. The team...
- ...This position is for our large Saas client. They are seeking an SAP Business Technology Platform (BTP) Senior SRE/DevOps Operations Engineer to assist in ensuring the highest level of uptime and Quality of Service (QoS) for our customers. Individuals will work in a diverse...Flexible hours
- ...Clearance ~ BS in Software Engineering or related field ~2-10 years of experience in CI/CD and Kubernetes-focused engineering or SRE/Platform Engineering ~ Strong Kubernetes skills - core objects, scheduling, RBAC, networking, storage, upgrades, troubleshooting...Contract workFlexible hours
$122.7k - $204.5k
...role applies effective influence and collaboration—without formal management authority—across Engineering, Product, Architecture, and SRE to embed quality throughout the software delivery lifecycle. The Lead STE takes the lead on difficult defect detection and...Remote workFlexible hoursShift work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to SRE. Be the first to apply!

