Site Reliability Engineer
Kaav Inc.
Job Description:
Additional Skills : Automation Process Engineer,Site Reliability Engineer,Full Stack DeveloperThis is a high PRIORITY requisition. This is a PROACTIVE requisition
- DevOps
- Docker/Kubernetes
- Splunk/Dynatrace
- Ansible/GitHub
- Java Full Stack Applications Deployment
- Responsible for reliability and support of Container Platform on-prem and external clouds (Azure /AWS /Google)
- Monitor and troubleshoot Container platform environment performance issues, connectivity issues, security issues, etc.
- Perform deep dives into systemic and latent reliability issues, Incident management, problem management
- Identifying, analyzing, and resolving infrastructure vulnerabilities and application deployment issues.
- Perform blameless RCA, partner with engineering and operation teams across the organization to roll out fixes.
- Responsible for application onboarding and provide troubleshooting support through the lifecycle of the applications on the container platform.
- Identify and drive opportunities to improve automation to reduce TOIL and improve operational excellence.
- Partner with risk, and compliance teams to bring visibility and implement right controls and remediation of vulnerabilities.
- Ensure resiliency during implementation and identify/fix resiliency problems by collaborating with engineering teams.
- Be a key stakeholder in the design of cloud services and work with Architecture, engineering, product teams
- Participate in 24x7 on-call coverage follow the sun model
- S /MS degree in Computer Science or related technical field involving systems or equivalent practical experience.
- Minimum 5+ years of hands-on experience supporting Kubernetes /Openshift / RKE / EKS Container platform.
- Experience with Python, Ansible, Golang, and shell scripting
- Experience with Splunk/Dynatrace
- Experience with Java Full Stack Applications Deployment
- Kubernetes /Openshift /Terraform certifications are a plus
- Strong experience in major services related to Compute, Storage, Network and Security
- Experience with monitoring tools like Prometheus and Dynatrace, as well as cloud native tools like Azure Monitor and Log Analytics
- Strong understanding and background of working with a complex IAM infrastructure, including Active Directory, Azure AD Connect, Azure AD, and Ping Identity or other SSO solutions.
- Advanced knowledge of Linux OS, DNS, DHCP, Kerberos and Windows Authentication
- Experience with CI/CD tools git /Jenkins, GitOps model
- Excellent understanding of Linux /Windows operating systems administration
- Experience in Container security and vulnerability remediation.
- Experience with Ansible/GitHub
- Systematic problem-solving approach, sense of ownership and drive
- Ability to juggle competing priorities and adapt to changes in project scope.
- Excellent interpersonal, organizational and communication (written, verbal, and presentation) skills are a must.
- Proven ability to work independently with minimal supervision and as part of a team with direct responsibilities.
- Experience in Openshift, RKE, CSP Kubernetes services such as AKS and EKS
- Experience in Terraform, ArgoCD, Tekton, and K-native technologies.
- Experience in agile deployment methodologies (GitOps)
- Knowledge of various container runtimes
- Familiarity with the operator deployment pattern.
- Experience working in a highly available multi-datacenter environment
- Experience working with monitoring tools such as Prometheus, Splunk, Dynatrace, Sysdig, or similar tools.
- Understanding of cost management, inventory management, FinOps model
Additional Skills : Automation Process Engineer,Site Reliability Engineer,Full Stack DeveloperThis is a high PRIORITY requisition. This is a PROACTIVE requisition
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Site Reliability Engineer in New York, NY vacancy
- ...About the job Senior Site Reliability Engineer About the Company Stellar is a decentralized, public blockchain that gives developers the tools to create experiences that are more like cash than crypto. The network is faster, cheaper, and far more energy-efficient...Suggested
$123k - $165k
...Site Reliability Engineer II Our engineering fleet is a horizontal set of teams providing engineering services across the organization. Our specific team provides reliability engineering and operational support to backend service development teams. Technology is...Suggested$150k - $175k
...Site Reliability Engineer At ASAPP, our mission is simple: deliver the best AI-powered customer experience—faster than anyone else. To achieve that, we're guided by principles that shape how we think, build, and execute. We value customer obsession, purposeful speed...SuggestedRemote work- ...DevOps Engineer Responsible for reliability and support of container platform on-prem and external clouds (Azure /AWS /Google) Monitor and troubleshoot container platform environment performance issues, connectivity issues, security issues, etc. Perform deep dives...Suggested
$175k - $225k
...Site Reliability Engineer Chicago, IL or New York, NY Old Mission is a global proprietary trading firm that leverages state-of-the-art technology and research to identify and execute profitable trading strategies across multiple asset classes around the world. Our...SuggestedFull timeWork at officeRemote workMonday to FridayFlexible hoursRotating shift$100k - $250k
...financial markets. Role Roadmap As a member of Kalshi's engineering team, you'll help build the next-generation financial... ..., and evolve. What You'll Do Improve observability, reliability, and service availability by defining and measuring key metrics...Local area- ...DevOps Engineer DevOps teams in our Infrastructure Engineering group enable Company to continually disrupt the Insure tech space. Our teams build, maintain and deliver infrastructure that enables Company Life product teams to ship industry leading and innovative systems...
$165k - $225k
...operationalize AI and run it as a true business performance engine delivering measurable value. For more, visit the Dataiku blog... ...taking a look here. How you'll make an impact: As a Site Reliability Engineer (SRE) with advanced expertise in networking and security...Work at officeFlexible hours$89k - $178k
...Sr. Site Reliability Engineer I NYC Global HQ Hybrid (3 days per week in office) DV is the leader in digital performance solutions, helping our advertiser and agency partners verify the quality of their digital campaigns, optimize to improve performance and prove...Work at office3 days per week$182.3k - $220k
...healthcare by putting patients first - and that mission depends on reliable, secure, and scalable systems. As a Senior SRE on the... ...hardening infrastructure and building tools that empower our engineers to ship safely and confidently. You will work across teams...Local areaFlexible hours$125k - $350k
...Site Reliability Engineer New York, Miami, Gurugram, London, Singapore, Sydney Job Description Opportunities may be available from time to time in any location in which the business is based for suitable candidates. If you are interested in a career with Citadel...- ...Site Reliability Engineer I, Abhishek, would like to share a job opportunity as Site Reliability Engineer in Jacksonville, FL, Cary, NC or New York, NY (Onsite) location for a Fulltime position. In case, if you are not comfortable with this location, please share your...Full timeWork visa
$111k - $160k
...Join Mizuho as a Site Reliability Engineer! In this role you will play a crucial role in maintaining the reliability, scalability, and overall performance of our production systems. This position collaborates closely with development, operations, and product teams to automate...Work at officeLocal areaRemote work- ...Participate in an oncall rotation. Work with teams across the company to ensure we achieve the right balance of developer velocity, reliability and performance, and cost efficiency. What You’ll Bring 5+ years of experience Experience with containerization and orchestration...
- ...Curated careers, resources, tips and trends from the DevOps World. The Site Reliability Engineer position at Remotive revolves around ensuring the reliability, availability, and performance of services. This role requires a combination of software engineering and system...Remote work
$160k - $230k
...We are currently looking to add Platform Engineers to our team, with at least 5 years of experience... .... You’ll ensure our platform is reliable, secure, and performant from day one. Responsibilities... ...collaborative setting. Our team works on-site five days a week, growing and building...Work at officeLocal area- ...risk—the leading cause of cybersecurity breaches—and build safer, more resilient organizations. The Role: As a Senior Site Reliability Engineer (SRE) at Dune Security, you will play a critical role in ensuring our platform's stability, scalability, and security. You...Full timeWork at office
- ...they are shifting towards Linux – (70% Windows, 30% Linux) Remote access technology protocols are a plus Job Description: Site Reliability Engineer Periodic updates and maintenance of Windows-based golden image for ESX & AWS. Patching of software, systems, appliances etc...Remote workShift work
$200k - $240k
...expertise across machine learning, UI/UX, large language models, and medicine. Job Description We’re hiring an experienced Site Reliability Engineer for our Boston or NYC office! You can expect to: Design, build, and maintain resilient, scalable, and secure...Work at office$185k - $227k
...united by this common purpose and we are hiring the world’s best engineers, scientists, designers, product managers, operations experts... ...on for more details. ROLE AND RESPONSIBILITIES A Senior Site Reliability Engineer (SRE) is expected to own the operational stability...Remote work$150k - $200k
...Join to apply for the Senior Site Reliability Engineer role at Gradle Inc. Develocity is a first‑of‑its‑kind toolchain observability and acceleration platform that helps software teams adopt and improve DORA capabilities (including continuous delivery) in order to achieve...Full timeLocal areaRemote workWork from home- ...subscriptions at scale, combining the agility of a high-growth business with the backing of a global organization. As the Site Reliability Engineer, you will help ensure the reliability, scalability, and observability of CloudBlue’s multi-tenant SaaS platforms used by service...Remote workWorldwideFlexible hours
$157.5k - $254.35k
...signature and contract lifecycle management (CLM). What you’ll do We are looking for a self‑motivated, driven and creative Senior Site Reliability Engineer to join the Site Reliability team. Metrics and analytics drive engineering at DocuSign and ensure that we are dedicating...Contract workWork at officeLocal areaRemote work$7.5k
...and benefits packages, technology talks by our experts, a beautiful modern office, daily catered lunches, and more. As a Site Reliability Engineer (SRE), you will work at the intersection of production operations and software development as you improve, manage, and monitor...Work at officeLocal area$150k - $170k
...Senior Site Reliability Engineer – Zip Co Join to apply for the Senior Site Reliability Engineer role at Zip Co At Zip, we build cloud‑native software applications that serve millions of customers and process billions of dollars in payments. We’re looking for a seasoned...Casual workWork at officeRemote workFlexible hours- ...collaborative role in which you will work closely with our Software Engineers to deploy and operate our solutions; automate and streamline... ...& systems that provide high levels of scalability, reliability, and performance for client applications, while balancing security...Permanent employmentWork at office
- ...public cloud platform from scratch? Would you like to own critical services in a new public cloud platform? Join our IaaS Site Reliability Engineering (SRE) team. We design, develop, and operate infrastructure and services that power the backbone of our cloud platform....Work at officeRemote work
$127k - $249k
...The Team Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational... ...fleet, alongside the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper)....Work at officeLocal areaRemote workWorldwideFlexible hours- Senior Site Reliability Engineer - Azure Cloud Join to apply for the Senior Site Reliability Engineer role at Concord Technologies Concord Technologies is growing! Currently seeking a full‑time Senior Site Reliability Engineer (Sr. SRE) , with experience engineering solutions...Full timeLocal areaImmediate startRemote workFlexible hours
- A modern insurance company in the United States is seeking a Site Reliability Engineer to manage and enhance cloud infrastructure on Azure and oversee CI/CD pipelines. The ideal candidate will have over 3 years of experience in a related field, strong skills in Kubernetes...Remote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!
Related searches
- site reliability engineering manager New York, NY
- site reliability engineer remote New York, NY
- site reliability engineer sre New York, NY
- site reliability engineer New York, NY
- on-site clinical research associate (traveling/remote) New York, NY
- junior website developer New York, NY
- site merchandiser New York, NY
- IT site lead New York, NY
- site acquisition specialist New York, NY
- site leader New York, NY

