Canonical Logo

Canonical

Site Reliability Engineer

Reposted 20 Days Ago
In-Office or Remote
8 Locations
Mid level
In-Office or Remote
8 Locations
Mid level
The Site Reliability Engineer will ensure reliable cloud operations by applying Python for infrastructure automation, managing OpenStack and Kubernetes, and practicing devsecops in a fast-paced environment.
The summary above was generated by AI

Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers, and industry leaders in many sectors. The company is a pioneer of globally distributed collaboration, with 1200+ colleagues in 75+ countries and very few office-based roles. Teams meet two to four times yearly in person, in interesting locations around the world, to align on strategy and execution.

The company is founder-led, profitable, and growing.

We are hiring a Site Reliability Engineer

Our goal is to perfect enterprise infrastructure DevOps practices, raising the bar on what's possible with automation by embracing a model-driven approach, whether on-premise or on public clouds.

We run hundreds of private cloud, Kubernetes clusters, and applications for customers across both physical and public cloud estates. We identify and address incidents, monitor and observe applications, anticipate potential issues, and enable product refinement to ultimately achieve high-quality standards in our open source portfolio.

To succeed in this role, you need to have a strong background in Linux, Python, networking, and knowledge of how clouds work. Your work will encompass the entire stack, from bare-metal networking and kernel up to Kubernetes and open source applications. You can expect to be trained in our core technologies like OpenStack, Kubernetes, security standards, open source products like Kubeflow, Kafka, OpenSearch, databases, and many others.

Automation for us is a software engineering problem that we approach with a scientific mindset to bring operations at scale, driven by metrics and code.

Location: Globally remote role

The role

We deploy and run OpenStack, Kubernetes, storage solutions, and open source applications, applying DevOps practices.

To become a member of our team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from bare metal to containers, and you need the ability to work in operations with mission-critical services for global brand-name customers.

As a member of the team, you will gain experience in a broad range of cloud technologies. We evolve our offerings as the state of the art improves, so you get to stay current with the latest capabilities in open source infrastructure.

What we are looking for in you
  • Degree in software engineering or computer science
  • Python software development experience
  • Operational experience in Linux environments
  • Experience with Kubernetes deployment or operations
  • Excellent interpersonal skills, curiosity, flexibility, and accountability
  • Ability to travel internationally twice a year, for company events up to two weeks long
Bonus skills
  • Familiarity with OpenStack deployment or operations
  • Familiarity with public cloud deployment or operations
  • Familiarity with private cloud management
What we offer colleagues

We consider geographical location, experience, and performance in shaping compensation worldwide. We adjust compensation every 6 months to ensure we recognize outstanding performance, and in addition to base pay, we offer annual bonuses. We provide all team members with additional benefits, which reflect our values and ideals. We balance our programs to meet local needs and ensure fairness globally.

  • Distributed work environment with twice-yearly team sprints in person
  • Personal learning and development budget of USD 2,000 per year
  • Every 6 months compensation review
  • Recognition rewards
  • Annual holiday leave
  • Maternity and paternity leave
  • Employee Assistance Programs
  • Opportunity to travel to new locations to meet your colleagues
  • Priority Pass and travel upgrades for long-haul company events
About Canonical

Canonical is a pioneering tech firm at the forefront of the global move to open source. As the company that publishes Ubuntu, one of the most important open source projects and the platform for AI, IoT, and the cloud, we are changing the world of software. We recruit on a global basis and set a very high standard for people joining the company. We expect excellence - in order to succeed, we need to be the best at what we do. Most colleagues at Canonical have worked from home since its inception in 2004.​ Working here is a step into the future, and will challenge you to think differently, work smarter, learn new skills, and raise your game.

Canonical is an equal opportunity employer

We are proud to foster a workplace free from discrimination. Diversity of experience, perspectives, and background creates a better work environment and better products. Whatever your identity, we will give your application fair consideration.

 #LI-Remote

Top Skills

Kubernetes
Linux
Openstack
Python

Similar Jobs

4 Days Ago
Easy Apply
Remote
Canada
Easy Apply
Senior level
Senior level
Big Data • Fintech • Mobile • Payments • Financial Services
The Senior Software Engineer in SRE focuses on enhancing application reliability, leading engineering teams, and fostering a culture of quality. Responsibilities include defining SLOs, incident management, and collaborating with various teams to support product development.
Top Skills: AWSKotlinKubernetesMySQLPython
7 Days Ago
Remote
3 Locations
Senior level
Senior level
Artificial Intelligence • Information Technology • Software • Automation
As a Site Reliability Engineer, you'll design and maintain Kubernetes-based infrastructure, enhance monitoring tools, support critical incidents, and drive automation to improve operational efficiency.
Top Skills: BashDatadogDockerElk StackGoGrafanaKubernetesNew RelicPrometheusPython
10 Days Ago
In-Office or Remote
Burnaby, BC, CAN
Mid level
Mid level
Cloud • Payments • Software
As a Site Reliability Engineer, you will optimize and support Azure-based SaaS platforms by designing CI/CD pipelines, building cloud infrastructure, and ensuring system reliability and performance.
Top Skills: AnsibleAWSAzure App InsightsAzure DevopsBambooBashBitbucketC#CloudwatchCoralogixDockerGithub ActionsJavaKubernetesAzureOpentelemetryPowershellPythonSnykSonarcloudSpaceliftTerraform

What you need to know about the Ottawa Tech Scene

The capital city of Canada and the nation's fourth-largest urban area, Ottawa has proven a rapidly growing global tech hub. With over 1,800 tech companies, many of which are leaders in their sectors, the city's tech talent now makes up more than 13 percent of its total workforce. This growth is driven not only by the big players like UL Solutions and Dropbox, but also by a thriving startup ecosystem, as new businesses emerge to follow in the footsteps of those that came before them.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account