Site Reliability Engineer
Canonical
Job Description
Job Description
Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers, and industry leaders in many sectors. The company is a pioneer of globally distributed collaboration, with 1200+ colleagues in 75+ countries and very few office-based roles. Teams meet two to four times yearly in person, in interesting locations around the world, to align on strategy and execution.
The company is founder-led, profitable, and growing.
We are hiring a Site Reliability EngineerOur goal is to perfect enterprise infrastructure DevOps practices, raising the bar on what's possible with automation by embracing a model-driven approach, whether on-premise or on public clouds.
We run hundreds of private cloud, Kubernetes clusters, and applications for customers across both physical and public cloud estates. We identify and address incidents, monitor and observe applications, anticipate potential issues, and enable product refinement to ultimately achieve high-quality standards in our open source portfolio.
To succeed in this role, you need to have a strong background in Linux, Python, networking, and knowledge of how clouds work. Your work will encompass the entire stack, from bare-metal networking and kernel up to Kubernetes and open source applications. You can expect to be trained in our core technologies like OpenStack, Kubernetes, security standards, open source products like Kubeflow, Kafka, OpenSearch, databases, and many others.
Automation for us is a software engineering problem that we approach with a scientific mindset to bring operations at scale, driven by metrics and code.
Location: Globally remote role
The roleWe deploy and run OpenStack, Kubernetes, storage solutions, and open source applications, applying DevOps practices.
To become a member of our team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from bare metal to containers, and you need the ability to work in operations with mission-critical services for global brand-name customers.
As a member of the team, you will gain experience in a broad range of cloud technologies. We evolve our offerings as the state of the art improves, so you get to stay current with the latest capabilities in open source infrastructure.
What we are looking for in you- Degree in software engineering or computer science
- Python software development experience
- Operational experience in Linux environments
- Experience with Kubernetes deployment or operations
- Excellent interpersonal skills, curiosity, flexibility, and accountability
- Ability to travel internationally twice a year, for company events up to two weeks long
- Familiarity with OpenStack deployment or operations
- Familiarity with public cloud deployment or operations
- Familiarity with private cloud management
We consider geographical location, experience, and performance in shaping compensation worldwide. We adjust compensation every 6 months to ensure we recognize outstanding performance, and in addition to base pay, we offer annual bonuses. We provide all team members with additional benefits, which reflect our values and ideals. We balance our programs to meet local needs and ensure fairness globally.
- Distributed work environment with twice-yearly team sprints in person
- Personal learning and development budget of USD 2,000 per year
- Every 6 months compensation review
- Recognition rewards
- Annual holiday leave
- Maternity and paternity leave
- Employee Assistance Programs
- Opportunity to travel to new locations to meet your colleagues
- Priority Pass and travel upgrades for long-haul company events
Canonical is a pioneering tech firm at the forefront of the global move to open source. As the company that publishes Ubuntu, one of the most important open source projects and the platform for AI, IoT, and the cloud, we are changing the world of software. We recruit on a global basis and set a very high standard for people joining the company. We expect excellence - in order to succeed, we need to be the best at what we do. Most colleagues at Canonical have worked from home since its inception in 2004. Working here is a step into the future, and will challenge you to think differently, work smarter, learn new skills, and raise your game.
Canonical is an equal opportunity employerWe are proud to foster a workplace free from discrimination. Diversity of experience, perspectives, and background creates a better work environment and better products. Whatever your identity, we will give your application fair consideration.
#LI-Remote
- Summary: As a Sr. Site Reliability Engineer, you are instrumental in helping make our client’s Kubernetes-centric ProArchive application resilient. This position will coordinate with multiple teams to develop a migration plan for various components and services as well...Suggested
- ...automation platforms that Axon's product engineering teams depend on. You will architect... ..., using operational experience to drive reliability improvements and inform platform investment... ...software engineering, cloud infrastructure, or site reliability engineering. Experience...SuggestedWork at officeRemote work
- Job Title :- Site Reliability Engineer (SRE) Employment Type :- W2 Duration :- Long Term Visa Type :- All Visa applicable which are ready for W2 Location :- Atlanta, GA (Onsite) Job Description We are seeking a highly skilled Site Reliability Engineer (SRE)with expertise...Suggested
- ...tooling that improve cloud infrastructure reliability, scalability, and operational efficiency... ...platforms and tools that enable engineering teams to provision services rapidly, consistently... ...engineering, cloud infrastructure, or site reliability engineering. Experience...Suggested
- ...- AWS, Google Cloud, and Azure is a plus - CI/CD Automation, Database Management. The Technical Support Specialist in Site Reliability Engineering (SRE) will be responsible for ensuring the reliability and stability of the systems and applications. The role involves...Suggested
$99.09k - $123.86k
Position Overview We’re seeking a seasoned Site Reliability Engineer (SRE) who thrives at the intersection of software engineering, infrastructure, and AI systems. You’ll help ensure our platforms are scalable, reliable, and secure while also contributing code, automation...Local areaFlexible hours- Who we’re looking for? A Site Reliability DevOps engineer working as part of the high-performing Operations team (SRE) growing their knowledge and skillset. Helps maintain existing business-critical applications and infrastructure while recommending technical and process...Remote jobMonday to Friday
- Overview Site Reliability Engineering (SRE) Architect — Atlanta, GA Duration: 12 Months+ Extension | Hourly Rate: DOE | Work Authorization: As an SRE Architect, you will be a pivotal technical leader responsible for designing, building, and evolving the foundational systems...Hourly payPermanent employmentLocal areaEarly shift
$180k - $220k
...a Lead SRE, you'll be a technical and operational leader for reliability across Develocity. You'll help define our SRE vision, set standards... ...grows. This is a hands-on role with broad influence across engineering, cloud platform, and customer-facing teams. The SRE team...Remote workWork from homeShift work- ...improve software solutions to ensure system reliability and availability, mitigate operational... ...issues. # You will help lead chaos engineering efforts in a production-alike environment... ...professionals, with engineers focused on site reliability engineering and...Permanent employmentFlexible hours
- A leading IT solutions provider in Atlanta is looking for a Site Reliability Engineer (SRE) with expertise in Adobe Experience Manager (AEM) and DevOps practices. The successful candidate will maintain and enhance the reliability of AEM applications while implementing scalable...
$126k - $248k
...As a TPM for SRE, you will partner with SRE leaders and engineers to scale the platform that underpins all of MongoDB's cloud products. You will drive program execution, strengthen production reliability practices, and coordinate cross-functional efforts across US and...Local areaRemote workWorldwideFlexible hours- An innovative firm is seeking a Technical Support Specialist to join their Site Reliability Engineering team. This role is pivotal in ensuring the reliability and stability of systems and applications. You will provide technical support, troubleshoot issues, and implement...
- 4p-Consulting-Inc. is looking for an experienced DevOps Engineer IV / Site Reliability Engineer (SRE) in Atlanta, GA. This professional will focus on observability, telemetry, and service reliability, working with engineering and operations teams to enhance operational...
$68.91k - $161.54k
...functional teams to deliver enterprise‑grade solutions.. Ensure code quality, security, scalability, and performance through established engineering best practices.Your skills and experience. At least 4+ years of strong hands‑on experience with Java, Spring Boot, and...Full timeLocal area- SRE DevOps Engineer Location: Overland Park, KS / Atlanta, GA / Frisco, TX (Onsite) Requirements 4-9 years in SRE/DevOps/Systems Engineering... ...trends, propose improvements in monitoring, capacity, and reliability. Collaborate with engineering teams on deployment, upgrades,...
- Overview: Job Purpose The SDLC Tools Engineer, Platform Engineering, will be responsible for ICE’s overall SDLC strategy. This role... ...Implement monitoring, alerting, and dashboards, to maintain reliability and operational insight into SDLC platforms and CI/CD pipelines...For contractorsWork experience placement
$68.91k - $161.54k
Capgemini is looking for a skilled Java Full Stack Developer based in Atlanta, Georgia in a hybrid role. The ideal candidate has at least 4 years of experience in Java, Spring Boot, and Microservices architecture, along with strong frontend technologies experience including...- ...We have an immediate need for a Senior Release Train Engineer for a contract assignment located in Carmel, Indiana . The Release Train Engineer (RTE) has a primary purpose of supporting an Agile Release Train (ART) by steering it to success and navigating the complexity...Contract workWork at officeImmediate start
$96k - $135k
...Release Train Engineer The Agile Release Train Engineer will utilize industry proven frameworks and drive the appropriate Agile Methodology that focuses on predictable, quality-delivery, and drive continuous improvement. The Agile Release Train Engineer will assist...- Highbrow LLC is looking for an experienced SRE DevOps Engineer based in Overland Park, KS. The ideal candidate should have 4-9 years of experience in SRE or DevOps, with strong expertise in Kubernetes, incident troubleshooting, and automation. The role involves resolving...
- ...Job Description - Agile -Release Train Engineer (RTE) Duration: FULL TIME Location: Atlanta, GA Role and Responsibilities: Serve as the key facilitator for the Agile Release Train (ART) in Mobile/Web/Services IT projects. Collaborate...Full time
- Carter is seeking a Platform Reliability & Operations Engineer located in Atlanta, GA to enhance our eCommerce platforms including Salesforce Commerce Cloud. This role combines software engineering and operational expertise, essential for maintaining high-performance and...
- Lead Stability Engineer The Lead Stability Engineer is responsible for providing advanced technical support, troubleshooting, and problem... ...details on Truist’s generous benefit plans, please visit our Benefits site. Depending on the position and division, this job may also be...Full timePart timeWork experience placementWork at office
- Saviynt is seeking a Platform Support Engineer to join our SRE Operations team in Atlanta, GA. This key role ensures the smooth operation... ...Cloud by maintaining platform stability, performance, and reliability. Responsibilities include troubleshooting, performance analysis...
- A technology company in Atlanta is seeking a seasoned Software Engineer specializing in cloud infrastructure. This role involves architecting solutions across Azure and AWS and mentoring junior engineers. Candidates should possess over 8 years of experience in software...Flexible hours
- Carter's Inc. in Atlanta, Georgia is searching for a Platform Reliability & Operation Engineer. This role supports performance and reliability of... ...Responsibilities include troubleshooting production issues, maintaining site performance, and collaborating with teams. Opportunities...
$168.5k - $252.7k
...About the Role As a Senior Software Engineer, you will play a key role in designing and... ...team members to ensure high-velocity, reliable delivery. What You'll Do Design,... ...Workday Careers. Please be aware of sites that may ask for you to input your data...Contract workWork at officeRemote workHome officeFlexible hours$105k - $130k
...provide the high-speed capabilities our nation and its allies need to maintain a durable, asymmetric advantage. The Mission Systems Engineering (MSE) Team develops the Mission Management System (MMS)-a software platform that integrates mission subsystems, autonomy services...Weekly payPermanent employmentWork at office$146k - $193k
...edge autonomy, AI, computer vision, sensor fusion, and networking technology to the military in months, not years. Anduril's Reliability Engineering organization is seeking an experienced Reliability Engineer to join our team in support of our Roadrunner product line....Full timeWork experience placementRelocation package
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!
- site reliability engineer sre Atlanta, GA
- site reliability engineer Atlanta, GA
- website content developer Atlanta, GA
- website coordinator Atlanta, GA
- on site coordinator Atlanta, GA
- IT site lead Atlanta, GA
- on-site clinical research associate (traveling/remote) Atlanta, GA
- junior website developer Atlanta, GA
- site services specialist Atlanta, GA
- site recruiter Atlanta, GA


