Site Reliability Engineer
Canonical
Job Description
Job Description
Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers, and industry leaders in many sectors. The company is a pioneer of globally distributed collaboration, with 1200+ colleagues in 75+ countries and very few office-based roles. Teams meet two to four times yearly in person, in interesting locations around the world, to align on strategy and execution.
The company is founder-led, profitable, and growing.
We are hiring a Site Reliability EngineerOur goal is to perfect enterprise infrastructure DevOps practices, raising the bar on what's possible with automation by embracing a model-driven approach, whether on-premise or on public clouds.
We run hundreds of private cloud, Kubernetes clusters, and applications for customers across both physical and public cloud estates. We identify and address incidents, monitor and observe applications, anticipate potential issues, and enable product refinement to ultimately achieve high-quality standards in our open source portfolio.
To succeed in this role, you need to have a strong background in Linux, Python, networking, and knowledge of how clouds work. Your work will encompass the entire stack, from bare-metal networking and kernel up to Kubernetes and open source applications. You can expect to be trained in our core technologies like OpenStack, Kubernetes, security standards, open source products like Kubeflow, Kafka, OpenSearch, databases, and many others.
Automation for us is a software engineering problem that we approach with a scientific mindset to bring operations at scale, driven by metrics and code.
Location: Globally remote role
The roleWe deploy and run OpenStack, Kubernetes, storage solutions, and open source applications, applying DevOps practices.
To become a member of our team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from bare metal to containers, and you need the ability to work in operations with mission-critical services for global brand-name customers.
As a member of the team, you will gain experience in a broad range of cloud technologies. We evolve our offerings as the state of the art improves, so you get to stay current with the latest capabilities in open source infrastructure.
What we are looking for in you- Degree in software engineering or computer science
- Python software development experience
- Operational experience in Linux environments
- Experience with Kubernetes deployment or operations
- Excellent interpersonal skills, curiosity, flexibility, and accountability
- Ability to travel internationally twice a year, for company events up to two weeks long
- Familiarity with OpenStack deployment or operations
- Familiarity with public cloud deployment or operations
- Familiarity with private cloud management
We consider geographical location, experience, and performance in shaping compensation worldwide. We adjust compensation every 6 months to ensure we recognize outstanding performance, and in addition to base pay, we offer annual bonuses. We provide all team members with additional benefits, which reflect our values and ideals. We balance our programs to meet local needs and ensure fairness globally.
- Distributed work environment with twice-yearly team sprints in person
- Personal learning and development budget of USD 2,000 per year
- Every 6 months compensation review
- Recognition rewards
- Annual holiday leave
- Maternity and paternity leave
- Employee Assistance Programs
- Opportunity to travel to new locations to meet your colleagues
- Priority Pass and travel upgrades for long-haul company events
Canonical is a pioneering tech firm at the forefront of the global move to open source. As the company that publishes Ubuntu, one of the most important open source projects and the platform for AI, IoT, and the cloud, we are changing the world of software. We recruit on a global basis and set a very high standard for people joining the company. We expect excellence - in order to succeed, we need to be the best at what we do. Most colleagues at Canonical have worked from home since its inception in 2004. Working here is a step into the future, and will challenge you to think differently, work smarter, learn new skills, and raise your game.
Canonical is an equal opportunity employerWe are proud to foster a workplace free from discrimination. Diversity of experience, perspectives, and background creates a better work environment and better products. Whatever your identity, we will give your application fair consideration.
#LI-Remote
- The Home Depot is hiring a Senior Software Engineer for Site Reliability in Atlanta, Georgia. This role focuses on enhancing product reliability and drives platform stability with automated solutions. Responsibilities include software development and deployment, collaborating...Suggested
- ...enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's... ...is founder-led, profitable, and growing. We are hiring a Site Reliability Engineer Our goal is to perfect enterprise infrastructure DevOps...SuggestedWork at officeLocal areaRemote workWork from homeWorldwide
$104k - $130k
...infrastructure as well as help improve the reliability, quality of services and overall... ...recovery. You’ll collaborate or embed with engineering teams, helping them to improve the... ...more about our locations by visiting our site. Compensation & Benefits The base...SuggestedFull timeWork experience placement- ...Position Purpose:The Senior Software Engineer for Site Reliability Engineering (Store Systems Enablement) builds and operates the internal platforms that keep HomeDepot's store systems observable, reliable, and automated. This is a platform engineering role: you will design...SuggestedWork experience placementLocal areaRemote workShift work
$117k - $209.33k
## Site Reliability EngineerApplylocations: Atlanta, GA, USAtime type: Full timeposted on: Posted Todayjob requisition id: 26WD98046**Job Requisition... ...exciting new opportunity has opened for a Site Reliability Engineer within the Autodesk PDMS Platform SRE team. The successful...SuggestedPermanent employment- Summary: As a Sr. Site Reliability Engineer, you are instrumental in helping make our client’s Kubernetes-centric ProArchive application resilient. This position will coordinate with multiple teams to develop a migration plan for various components and services as well...
- ...tooling that improve cloud infrastructure reliability, scalability, and operational efficiency... ...platforms and tools that enable engineering teams to provision services rapidly, consistently... ...engineering, cloud infrastructure, or site reliability engineering. Experience...
- ...- AWS, Google Cloud, and Azure is a plus - CI/CD Automation, Database Management. The Technical Support Specialist in Site Reliability Engineering (SRE) will be responsible for ensuring the reliability and stability of the systems and applications. The role involves...
- You’re more valuable than ever - And that’s just how we’ll make you feel.The Site Reliability Engineer is responsible for maintaining and enhancing the reliability, security, and performance of our customer-facing web and mobile applications. This role is critical in remediating...Work experience placementWork at officeLocal area
- Job Title :- Site Reliability Engineer (SRE) Employment Type :- W2 Duration :- Long Term Visa Type :- All Visa applicable which are ready for W2 Location :- Atlanta, GA (Onsite) Job Description We are seeking a highly skilled Site Reliability Engineer (SRE)with expertise...
- ...smart growth" approach ensures that we will continue to scale our company effectively. Summary We are seeking a Lead Site Reliability Engineer to spearhead our SRE team. You are not just an operator; you are an experienced software engineer who excels at...Remote work
$180k - $220k
...a Lead SRE, you'll be a technical and operational leader for reliability across Develocity. You'll help define our SRE vision, set standards... ...grows. This is a hands-on role with broad influence across engineering, cloud platform, and customer-facing teams. The SRE team...Full timeRemote workWork from homeShift work- GoHealth Urgent Care is hiring a Site Reliability Engineer in Atlanta, Georgia. This role focuses on maintaining and enhancing the reliability, security, and performance of web and mobile applications. You will be responsible for managing Azure DevOps pipelines and collaborating...
- Who we’re looking for? A Site Reliability DevOps engineer working as part of the high-performing Operations team (SRE) growing their knowledge and skillset. Helps maintain existing business-critical applications and infrastructure while recommending technical and process...Remote jobMonday to Friday
- ...cloud-native systems. As a Staff Platform Engineer, you will play a critical role in... ...technical leadership role. You will own reliability for major platform domains, design scalable... ...Infrastructure Development, Platform Engineering, or Site Reliability Engineering role, with a...Full time
- Overview Site Reliability Engineering (SRE) Architect — Atlanta, GA Duration: 12 Months+ Extension | Hourly Rate: DOE | Work Authorization: As an SRE Architect, you will be a pivotal technical leader responsible for designing, building, and evolving the foundational systems...Hourly payPermanent employmentLocal areaEarly shift
- About the Role You\'ll own the reliability posture of a large-scale healthcare platform. That means infrastructure design, deployment pipelines... ...isn\'t production-ready. You\'ll work alongside software engineers and security engineers who are building real capabilities -...Permanent employmentFlexible hours
$135.8k - $183.8k
...Postgres DBs in support of key services that make the internet work. The ideal candidate will work with other DBA SREs, application engineers, Infrastructure teams, Security and Project Managers maintaining critical internet infrastructure. Responsibilities Maintain and...Work experience placementWork at officeFlexible hours$300k - $360k
...giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest.As a Director of Site Reliability Engineering, you will own execution for reliability, availability, and operational excellence across Affirms global platform. You...Work at officeRemote workFlexible hours$130k - $150k
...handed off. You'll work alongside software engineers and security engineers who are building... ...Own and improve CI/CD pipelines - reliability, deployment safety, rollback capability... ...and we'll figure it out together. Senior Site Reliability Engineer Salary: $130,000 -...Permanent employmentFlexible hours- A leading IT solutions provider in Atlanta is looking for a Site Reliability Engineer (SRE) with expertise in Adobe Experience Manager (AEM) and DevOps practices. The successful candidate will maintain and enhance the reliability of AEM applications while implementing scalable...
- Autodesk, Inc. is seeking a Site Reliability Engineer based in Atlanta, GA. This role involves architecting solutions for SaaS applications, managing cloud infrastructure, and ensuring reliability and performance. Candidates should have a background in DevOps, strong AWS...
- An innovative firm is seeking a Technical Support Specialist to join their Site Reliability Engineering team. This role is pivotal in ensuring the reliability and stability of systems and applications. You will provide technical support, troubleshoot issues, and implement...
$83.9k - $151.2k
...role supports the Subscription Product Engineering organization, including in-house subscription... ...through automation, monitoring, and reliability-focused practices across production and... ...support practices (Required) Knowledge of Site Reliability Engineering principles,...Full timeTemporary workPart timeWork experience placementLocal areaFlexible hours- Overview: Job Purpose The SDLC Tools Engineer, Platform Engineering, will be responsible for ICE’s overall SDLC strategy. This role... ...Implement monitoring, alerting, and dashboards, to maintain reliability and operational insight into SDLC platforms and CI/CD pipelines...For contractorsWork experience placement
$94.9k - $135.6k
...development, testing, operations, and platform teams to deliver value safely and efficiently. Cardinal Health is seeking a Release Engineer to lead iteration and release management activities supporting mission critical warehouse transformation initiatives on Program...Temporary workLocal areaImmediate startFlexible hours- ...Position: Release Engineering Contractor Only locals Location: Alpharetta, GA 30005 Job type: Contract Position Overview We are seeking an exceptional Release Engineering contractor to join our team and drive continuous improvement in...Contract workFor contractorsLocal area
- ...Description Job Description Are you passionate about ensuring the reliability and performance of advanced semiconductor technologies? At... ...systems. Falcomm is seeking an RFIC Reliability Engineer to lead reliability analysis and qualification activities for...Permanent employmentFull time
- Make the Future Fly Join us in creating a world with more living. What We Believe Make It Matter Creating a new way to deliver starts with people who want to make a difference. We know your time is valuable and so are our customers. Build with empathy. Ask...
$105k - $130k
...provide the high-speed capabilities our nation and its allies need to maintain a durable, asymmetric advantage. The Mission Systems Engineering (MSE) Team develops the Mission Management System (MMS)-a software platform that integrates mission subsystems, autonomy services...Weekly payPermanent employmentWork at office
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!
- site reliability engineer Atlanta, GA
- site reliability engineer sre Atlanta, GA
- website content developer Atlanta, GA
- site services specialist Atlanta, GA
- site recruiter Atlanta, GA
- IT site lead Atlanta, GA
- on-site clinical research associate (traveling/remote) Atlanta, GA
- on site coordinator Atlanta, GA
- website coordinator Atlanta, GA
- junior website developer Atlanta, GA


