Pythian Logo

Pythian

Team Lead, Site Reliability Engineering

Posted 2 Days Ago
Be an Early Applicant
Hybrid
Ottawa, ON
Mid level
Hybrid
Ottawa, ON
Mid level
Lead a team of Site Reliability Engineers, overseeing operations, incident management, and mentoring. Involve in technical duties like automating workflows and optimizing systems.
The summary above was generated by AI
Team Lead, Site Reliability Engineering
Ottawa | Hybrid

Why Pythian:
At Pythian, we are experts in strategic database and analytics services, driving digital transformation and operational excellence. Pythian, a multinational company, was founded in 1997 and started by ensuring the reliability and performance of mission-critical databases. We quickly earned a reputation for solving tough data challenges. We were there when the industry moved from on-premises to cloud environments, and as enterprises sought more from their data, we expanded our competencies to include advanced analytics.

Today, we empower organizations to embrace transformation and leverage advanced technologies, including AI, to stay competitive. We deliver innovative solutions that meet each client’s data goals and have built strong partnerships with Google Cloud, AWS, Microsoft, Oracle, SAP, and Snowflake. The powerful combination of our extensive expertise in data and cloud and our ability to keep on top of the latest bleeding edge technologies make us the perfect partner to help mid and large-sized businesses transform to stay ahead in today’s rapidly changing digital economy.

Why You:
Pythian is building a next-generation Site Reliability Engineering team, and we’re looking for a talented, and experienced Team Lead who thrives in fast-paced, problem-solving environments.

As a Team Lead, you’ll be responsible for leading a team of site reliability engineers that are designing, deploying, and operating large-scale distributed systems across compute, storage, networking, and AI/ML environments.  You will act as the primary technical escalation point, oversee day-to-day operational delivery, mentor and coach team members, and ensure adherence to SLAs and quality standards. You may also directly contribute to delivery by leading projects from architecture to automation to intelligent monitoring, collaborating with both clients and teammates to build resilient, high-performing infrastructure.

If this is you, and you wonder what it would be like to work at Pythian, reach out to us and find out!  Intrigued to see what a life is like at Pythian? Check out #pythianlife on LinkedIn!

What you will be doing:

  • Team Leadership & Operational Management:
  • Lead and mentor a team of Site Reliability Engineers to ensure technical excellence, timely resolution of incidents, and professional growth of team members.
  • Oversee queue management, ticket prioritization, and workload distribution to meet SLA and utilization targets.
  • Act as the primary point of contact for critical escalations and severity-1 incidents, providing guidance and technical direction.
  • Conduct performance reviews, and knowledge-sharing sessions to strengthen the team’s capabilities.
  • Collaborate with management on performance metrics, process adherence, and resource planning.
  • Sets specific goals and objectives for team members as part of Pythian’s goal planning program. Provides guidance to team members in regards to training opportunities as part of Pythian’s self-directed training program. Meets regularly with team members for one-on-one sessions to disseminate information and gain feedback on opportunities for improvement. 
  • Technical Responsibilities:
  • Operate and optimize Kubernetes clusters, Istio service mesh, and Linux-based systems.
  • Automate workflows using Go, Python, and Shell scripting.
  • Build monitoring and observability solutions with Prometheus, Grafana, and Loki.
  • Troubleshoot complex networking, storage, and system performance issues.
  • Partner with AI/ML teams to ensure infrastructure readiness for model training and data pipelines.

What you bring:

  • A minimum of 3 years previous experience leading a team.
  • Experience with Google Cloud, plus IaC tools (Terraform).
  • Strong knowledge of microservices, containers (Kubernetes, Docker), and networking.
  • Hands-on experience with PKI, service mesh, and Linux systems administration.
  • SRE mindset with a focus on automation, scalability, and reliability.

What you get in return:

  • Love your career: Competitive total rewards package. Blog during work hours. Hone your skills or learn new ones with our substantial training allowance; participate in professional development days, attend training, become certified, whatever you like! 
  • Love your work/life balance: Flexibly work, there’s no daily travel requirement to an office! All you need is a stable internet connection. 
  • Love your coworkers: Collaborate with some of the best and brightest in the industry!
  • Love your workspace: We give you all the equipment you need to work from home including a laptop with your choice of OS, and an annual budget to personalize your work environment!  
  • Love yourself: Pythian cares about the health and well-being of our team. You will have an annual wellness budget to make yourself a priority (use it on gym memberships, massages, fitness and more). Additionally, you will receive a generous amount of paid vacation and sick days, as well as a day off to volunteer for your favorite charity.

Hiring Disclaimer
Pythian is looking to fill this position as soon as possible as this position is for an active and open position.
The successful applicant will need to fulfill the requirements necessary to obtain a background check.
Accommodations are available upon request for candidates taking part in any aspect of the selection process.

Top Skills

Go
GCP
Grafana
Istio
Kubernetes
Linux
Loki
Prometheus
Python
Shell Scripting
Terraform
HQ

Pythian Ottawa, Ontario, CAN Office

319 McRae Ave #700, Ottawa, Ontario, Canada

Similar Jobs

18 Days Ago
In-Office or Remote
Toronto, ON, CAN
Senior level
Senior level
Travel
The Senior Site Reliability Engineer will enhance platform tooling, automate infrastructure workflows, improve scalability, and support engineering teams in incident response and collaboration.
Top Skills: BashDatadogGoogle Cloud PlatformHelmIstioKubernetesKustomizePythonTerraform
38 Minutes Ago
Remote or Hybrid
Toronto, ON, CAN
Mid level
Mid level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The role involves configuring/developing ServiceNow solutions in Finance & Supply Chain, leading customer engagements, and providing technical expertise.
Top Skills: BootstrapCoupaCSSHTMLIvaluaJavaScriptLdapOracle Procurement CloudSap AribaSap EccSap S/4HanaServicenowSsoWeb ServicesXML
39 Minutes Ago
Remote or Hybrid
Toronto, ON, CAN
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Advisory Solution Consultant will support product sales with technical expertise, lead discovery workshops, provide demonstrations, and collaborate with sales teams to achieve sales goals.
Top Skills: Ai-Powered ToolsCloud Software SolutionsServicenow Platform

What you need to know about the Ottawa Tech Scene

The capital city of Canada and the nation's fourth-largest urban area, Ottawa has proven a rapidly growing global tech hub. With over 1,800 tech companies, many of which are leaders in their sectors, the city's tech talent now makes up more than 13 percent of its total workforce. This growth is driven not only by the big players like UL Solutions and Dropbox, but also by a thriving startup ecosystem, as new businesses emerge to follow in the footsteps of those that came before them.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account