Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Site Reliability Engineer

Full-time

Canonical

Job Description

Job Description

Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers, and industry leaders in many sectors. The company is a pioneer of globally distributed collaboration, with 1200+ colleagues in 75+ countries and very few office-based roles. Teams meet two to four times yearly in person, in interesting locations around the world, to align on strategy and execution.

The company is founder-led, profitable, and growing.

We are hiring a Site Reliability Engineer

Our goal is to perfect enterprise infrastructure DevOps practices, raising the bar on what's possible with automation by embracing a model-driven approach, whether on-premise or on public clouds.

We run hundreds of private cloud, Kubernetes clusters, and applications for customers across both physical and public cloud estates. We identify and address incidents, monitor and observe applications, anticipate potential issues, and enable product refinement to ultimately achieve high-quality standards in our open source portfolio.

To succeed in this role, you need to have a strong background in Linux, Python, networking, and knowledge of how clouds work. Your work will encompass the entire stack, from bare-metal networking and kernel up to Kubernetes and open source applications. You can expect to be trained in our core technologies like OpenStack, Kubernetes, security standards, open source products like Kubeflow, Kafka, OpenSearch, databases, and many others.

Automation for us is a software engineering problem that we approach with a scientific mindset to bring operations at scale, driven by metrics and code.

Location: Globally remote role

The role

We deploy and run OpenStack, Kubernetes, storage solutions, and open source applications, applying DevOps practices.

To become a member of our team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from bare metal to containers, and you need the ability to work in operations with mission-critical services for global brand-name customers.

As a member of the team, you will gain experience in a broad range of cloud technologies. We evolve our offerings as the state of the art improves, so you get to stay current with the latest capabilities in open source infrastructure.

What we are looking for in you
  • Degree in software engineering or computer science
  • Python software development experience
  • Operational experience in Linux environments
  • Experience with Kubernetes deployment or operations
  • Excellent interpersonal skills, curiosity, flexibility, and accountability
  • Ability to travel internationally twice a year, for company events up to two weeks long
Bonus skills
  • Familiarity with OpenStack deployment or operations
  • Familiarity with public cloud deployment or operations
  • Familiarity with private cloud management
What we offer colleagues

We consider geographical location, experience, and performance in shaping compensation worldwide. We adjust compensation every 6 months to ensure we recognize outstanding performance, and in addition to base pay, we offer annual bonuses. We provide all team members with additional benefits, which reflect our values and ideals. We balance our programs to meet local needs and ensure fairness globally.

  • Distributed work environment with twice-yearly team sprints in person
  • Personal learning and development budget of USD 2,000 per year
  • Every 6 months compensation review
  • Recognition rewards
  • Annual holiday leave
  • Maternity and paternity leave
  • Employee Assistance Programs
  • Opportunity to travel to new locations to meet your colleagues
  • Priority Pass and travel upgrades for long-haul company events
About Canonical

Canonical is a pioneering tech firm at the forefront of the global move to open source. As the company that publishes Ubuntu, one of the most important open source projects and the platform for AI, IoT, and the cloud, we are changing the world of software. We recruit on a global basis and set a very high standard for people joining the company. We expect excellence - in order to succeed, we need to be the best at what we do. Most colleagues at Canonical have worked from home since its inception in 2004.​ Working here is a step into the future, and will challenge you to think differently, work smarter, learn new skills, and raise your game.

Canonical is an equal opportunity employer

We are proud to foster a workplace free from discrimination. Diversity of experience, perspectives, and background creates a better work environment and better products. Whatever your identity, we will give your application fair consideration.

#LI-Remote

Vacancy posted 19 days ago
Similar jobs that could be interesting for youBased on the Site Reliability Engineer in Atlanta, GA vacancy
  • The Home Depot is hiring a Senior Software Engineer for Site Reliability in Atlanta, Georgia. This role focuses on enhancing product reliability and drives platform stability with automated solutions. Responsibilities include software development and deployment, collaborating... 
    Suggested

    The Home Depot

    Atlanta, GA
    2 days ago
  •  ...enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's...  ...is founder-led, profitable, and growing. We are hiring a Site Reliability Engineer Our goal is to perfect enterprise infrastructure DevOps... 
    Suggested
    Work at office
    Local area
    Remote work
    Work from home
    Worldwide

    Canonical

    Atlanta, GA
    1 hour ago
  • $104k - $130k

     ...infrastructure as well as help improve the reliability, quality of services and overall...  ...recovery.  You’ll collaborate or embed with engineering teams, helping them to improve the...  ...more about our locations by visiting our site. Compensation & Benefits The base... 
    Suggested
    Full time
    Work experience placement

    AppFolio

    Atlanta, GA
    1 day ago
  •  ...Position Purpose:The Senior Software Engineer for Site Reliability Engineering (Store Systems Enablement) builds and operates the internal platforms that keep HomeDepot's store systems observable, reliable, and automated. This is a platform engineering role: you will design... 
    Suggested
    Work experience placement
    Local area
    Remote work
    Shift work

    Home Depot

    Atlanta, GA
    4 days ago
  • $117k - $209.33k

    ## Site Reliability EngineerApplylocations: Atlanta, GA, USAtime type: Full timeposted on: Posted Todayjob requisition id: 26WD98046**Job Requisition...  ...exciting new opportunity has opened for a Site Reliability Engineer within the Autodesk PDMS Platform SRE team. The successful... 
    Suggested
    Permanent employment

    Autodesk, Inc.

    Atlanta, GA
    3 days ago
  • Summary: As a Sr. Site Reliability Engineer, you are instrumental in helping make our client’s Kubernetes-centric ProArchive application resilient. This position will coordinate with multiple teams to develop a migration plan for various components and services as well... 

    Lexicon Solutions

    Atlanta, GA
    5 days ago
  •  ...tooling that improve cloud infrastructure reliability, scalability, and operational efficiency...  ...platforms and tools that enable engineering teams to provision services rapidly, consistently...  ...engineering, cloud infrastructure, or site reliability engineering. Experience... 

    Axon Enterprise

    Atlanta, GA
    4 days ago
  •  ...- AWS, Google Cloud, and Azure is a plus - CI/CD Automation, Database Management. The Technical Support Specialist in Site Reliability Engineering (SRE) will be responsible for ensuring the reliability and stability of the systems and applications. The role involves... 

    TechDigital Group

    Atlanta, GA
    5 days ago
  • You’re more valuable than ever - And that’s just how we’ll make you feel.The Site Reliability Engineer is responsible for maintaining and enhancing the reliability, security, and performance of our customer-facing web and mobile applications. This role is critical in remediating... 
    Work experience placement
    Work at office
    Local area

    GoHealth Urgent Care

    Atlanta, GA
    4 days ago
  • Job Title :- Site Reliability Engineer (SRE) Employment Type :- W2 Duration :- Long Term Visa Type :- All Visa applicable which are ready for W2 Location :- Atlanta, GA (Onsite) Job Description We are seeking a highly skilled Site Reliability Engineer (SRE)with expertise... 

    Highbrow LLC

    Atlanta, GA
    2 days ago
  •  ...smart growth" approach ensures that we will continue to scale our company effectively. Summary We are seeking a Lead Site Reliability Engineer to spearhead our SRE team. You are not just an operator; you are an experienced software engineer who excels at... 
    Remote work

    Intellum

    Atlanta, GA
    1 hour ago
  • $180k - $220k

     ...a Lead SRE, you'll be a technical and operational leader for reliability across Develocity. You'll help define our SRE vision, set standards...  ...grows. This is a hands-on role with broad influence across engineering, cloud platform, and customer-facing teams. The SRE team... 
    Full time
    Remote work
    Work from home
    Shift work

    Gradle Technologies

    Atlanta, GA
    a month ago
  • GoHealth Urgent Care is hiring a Site Reliability Engineer in Atlanta, Georgia. This role focuses on maintaining and enhancing the reliability, security, and performance of web and mobile applications. You will be responsible for managing Azure DevOps pipelines and collaborating... 

    GoHealth Urgent Care

    Atlanta, GA
    4 days ago
  • Who we’re looking for? A Site Reliability DevOps engineer working as part of the high-performing Operations team (SRE) growing their knowledge and skillset. Helps maintain existing business-critical applications and infrastructure while recommending technical and process... 
    Remote job
    Monday to Friday

    Braves Technologies

    Atlanta, GA
    5 days ago
  •  ...cloud-native systems. As a Staff Platform Engineer, you will play a critical role in...  ...technical leadership role. You will own reliability for major platform domains, design scalable...  ...Infrastructure Development, Platform Engineering, or Site Reliability Engineering role, with a... 
    Full time

    Saviynt

    Atlanta, GA
    10 days ago
  • Overview Site Reliability Engineering (SRE) Architect — Atlanta, GA Duration: 12 Months+ Extension | Hourly Rate: DOE | Work Authorization: As an SRE Architect, you will be a pivotal technical leader responsible for designing, building, and evolving the foundational systems... 
    Hourly pay
    Permanent employment
    Local area
    Early shift

    Cloud Analytics Technologies, LLC

    Atlanta, GA
    2 days ago
  • About the Role You\'ll own the reliability posture of a large-scale healthcare platform. That means infrastructure design, deployment pipelines...  ...isn\'t production-ready. You\'ll work alongside software engineers and security engineers who are building real capabilities -... 
    Permanent employment
    Flexible hours

    Satine Technologies

    Atlanta, GA
    1 day ago
  • $135.8k - $183.8k

     ...Postgres DBs in support of key services that make the internet work. The ideal candidate will work with other DBA SREs, application engineers, Infrastructure teams, Security and Project Managers maintaining critical internet infrastructure. Responsibilities Maintain and... 
    Work experience placement
    Work at office
    Flexible hours

    The Association of Technology, Management and Applied Engine...

    Atlanta, GA
    5 days ago
  • $300k - $360k

     ...giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest.As a Director of Site Reliability Engineering, you will own execution for reliability, availability, and operational excellence across Affirms global platform. You... 
    Work at office
    Remote work
    Flexible hours

    Affirm

    Atlanta, GA
    3 days ago
  • $130k - $150k

     ...handed off. You'll work alongside software engineers and security engineers who are building...  ...Own and improve CI/CD pipelines - reliability, deployment safety, rollback capability...  ...and we'll figure it out together. Senior Site Reliability Engineer Salary: $130,000 -... 
    Permanent employment
    Flexible hours

    Satine Technologies

    Atlanta, GA
    4 days ago
  • A leading IT solutions provider in Atlanta is looking for a Site Reliability Engineer (SRE) with expertise in Adobe Experience Manager (AEM) and DevOps practices. The successful candidate will maintain and enhance the reliability of AEM applications while implementing scalable... 

    Highbrow LLC

    Atlanta, GA
    2 days ago
  • Autodesk, Inc. is seeking a Site Reliability Engineer based in Atlanta, GA. This role involves architecting solutions for SaaS applications, managing cloud infrastructure, and ensuring reliability and performance. Candidates should have a background in DevOps, strong AWS... 

    Autodesk, Inc.

    Atlanta, GA
    3 days ago
  • An innovative firm is seeking a Technical Support Specialist to join their Site Reliability Engineering team. This role is pivotal in ensuring the reliability and stability of systems and applications. You will provide technical support, troubleshoot issues, and implement... 

    TechDigital Group

    Atlanta, GA
    5 days ago
  • $83.9k - $151.2k

     ...role supports the Subscription Product Engineering organization, including in-house subscription...  ...through automation, monitoring, and reliability-focused practices across production and...  ...support practices (Required) Knowledge of Site Reliability Engineering principles,... 
    Full time
    Temporary work
    Part time
    Work experience placement
    Local area
    Flexible hours

    T-Mobile

    Atlanta, GA
    3 days ago
  • Overview: Job Purpose The SDLC Tools Engineer, Platform Engineering, will be responsible for ICE’s overall SDLC strategy. This role...  ...Implement monitoring, alerting, and dashboards, to maintain reliability and operational insight into SDLC platforms and CI/CD pipelines... 
    For contractors
    Work experience placement

    Intercontinental Exchange

    Atlanta, GA
    3 days ago
  • $94.9k - $135.6k

     ...development, testing, operations, and platform teams to deliver value safely and efficiently. Cardinal Health is seeking a Release Engineer to lead iteration and release management activities supporting mission critical warehouse transformation initiatives on Program... 
    Temporary work
    Local area
    Immediate start
    Flexible hours

    Cardinal Health

    Atlanta, GA
    2 days ago
  •  ...Position: Release Engineering Contractor Only locals Location: Alpharetta, GA 30005 Job type: Contract Position Overview We are seeking an exceptional Release Engineering contractor to join our team and drive continuous improvement in... 
    Contract work
    For contractors
    Local area

    Equiliem

    Atlanta, GA
    4 days ago
  •  ...Description Job Description Are you passionate about ensuring the reliability and performance of advanced semiconductor technologies? At...  ...systems. Falcomm is seeking an RFIC Reliability Engineer to lead reliability analysis and qualification activities for... 
    Permanent employment
    Full time

    Falcomm

    Atlanta, GA
    a month ago
  • Make the Future Fly Join us in creating a world with more living. What We Believe Make It Matter Creating a new way to deliver starts with people who want to make a difference. We know your time is valuable and so are our customers. Build with empathy. Ask...

    ZipLine

    Atlanta, GA
    4 days ago
  • $105k - $130k

     ...provide the high-speed capabilities our nation and its allies need to maintain a durable, asymmetric advantage. The Mission Systems Engineering (MSE) Team develops the Mission Management System (MMS)-a software platform that integrates mission subsystems, autonomy services... 
    Weekly pay
    Permanent employment
    Work at office

    Hermeus

    Atlanta, GA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!