ClickHouse Logo

ClickHouse

Database Reliability Engineer - Core Team

Posted 7 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in Canada
Senior level
Remote
Hiring Remotely in Canada
Senior level
Responsible for enhancing reliability, performance, and scalability of ClickHouse, managing on-call processes, and collaborating across multiple teams on incident responses and improvements.
The summary above was generated by AI
About ClickHouse

Established in 2009, ClickHouse leads the industry with its open-source column-oriented database system, driven by the vision of becoming the fastest OLAP database globally. The company empowers users to generate real-time analytical reports through SQL queries, emphasizing speed in managing escalating data volumes. Enterprises globally, including Lyft, Sony, IBM, GitLab, Twilio, HubSpot, and many more, rely on ClickHouse Cloud. It is available through open-source or on AWS, GCP, Azure, and Alibaba. 

Note: This position can be based remotely in any country ClickHouse has a hiring presence.

We are committed to providing our customers with reliable and secure services at ClickHouse. To continue this, we are building out our Site Reliability Engineering team in ClickHouse Core. As one of the first members of our Reliability Engineering Team at Core, you will be responsible for building and leading processes to ensure and improve the reliability, availability, scalability, and performance of ClickHouse. You will collaborate with different teams like Control Plane, Dataplane,Security, Support and Operations and guide them to implement ClickHouse in the best way for our customers. You will also own the areas of managing engineering escalation management and response, investigations, post-mortem analysis including running blameless postmortems, and continuous improvement of how Clickhouse is run and optimized in the cloud. This role is a unique opportunity to make a significant impact on our elastic, limitless scale, high-performance ClickHouse in ClickHouse Cloud.

What will you do?

  • Continuously improve the reliability and performance of ClickHouse core.
  • Improve and create metrics and alerts for ClickHouse to be able to identify and prevent problems in production before they affect customers. 
  • Dig deeper into the most common problems encountered by customers in Clickhouse Core to identify the root cause of problems and submit bug fixes, issue reports and suggest improvements.
  • Enhance and refine incident response processes and post-mortem analysis for ClickHouse core related outages including working with support and Cloud teams to communicate to the impacted customers.
  • Plan, enable, and drive Chaos initiatives across Engineering teams, based upon internal priorities.
  • Manage on-call processes to respond to performance and reliability issues, and establish best practices for coordinating escalation to resolve issues and minimize customer impact.

About you:

  • Bachelor’s or Master’s degree in Computer Science or a related field.
  • At least 5 years of experience in Reliability Engineering, QA or customer facing engineering.
  • Previous experience operating ClickHouse or other SQL databases in production. 
  • Excellent understanding of distributed database internals and SQL, particularly ClickHouse is a major plus.
  • Scripting experience with Shell or Python,and ability to read and understand C++ code.
  • Knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform.
  • You are a strong problem-solver and have solid production debugging skills.
  • You thrive in a fast-paced environment as part of a global team, and you see yourself as a partner with the business with the shared goal of moving the business forward.
  • You have a high level of responsibility, ownership, and accountability.
  • Excellent communication skills
Compensation

For roles based in the United States, you can find above our typical starting salary ranges for this role, depending on your specific location. 

The positioning of offers within a certain range depends on various factors, including: candidate experience, qualifications, skills, business requirements and geographical location.

If you have any questions or comments about compensation as a candidate, please get in touch with us at [email protected].

Perks
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries.
  • Healthcare - Employer contributions towards your healthcare.
  • Equity in the company - Every new team member who joins our company receives stock options.
  • Time off - Flexible time off in the US, generous entitlement in other countries.
  • A $500 Home office setup if you’re a remote employee.
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites.

Culture - We All Shape It

As part of our first 500 employees, you will be instrumental in shaping our culture. 

Are you interested in finding out more about our culture?  Learn more about our values here.  Check out our blog posts or follow us on LinkedIn to find out more about what’s happening at ClickHouse.

Equal Opportunity & Privacy 

ClickHouse provides equal employment opportunities to all employees and applicants and prohibits discrimination and harassment of any type based on factors such as race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. 

Please see here for our Privacy Statement.

Top Skills

AWS
Azure
C++
Google Cloud Platform
Python
Shell

Similar Jobs

10 Hours Ago
Remote or Hybrid
Canada
Entry level
Entry level
Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
The Rating Technician develops, tests, and maintains underwriting rules and rates for various insurance carriers, ensuring quality and assisting in system updates and testing.
Top Skills: APIsData Analysis ToolsJIRAJSONXML
14 Hours Ago
Easy Apply
Remote
Canada
Easy Apply
Senior level
Senior level
AdTech • Big Data • Machine Learning • Marketing Tech • Mobile • Software
As a Software Engineer in the Bidding Intelligence group, you will design and implement scalable systems, support ML model building, and optimize bidding strategies for mobile app growth.
Top Skills: AWSBig DataGoMachine LearningPysparkPythonPyTorchWeights & Biases
14 Hours Ago
In-Office or Remote
7 Locations
Senior level
Senior level
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
The Workday Integration Engineer develops and maintains integrations between Workday and various systems, collaborating with HR and Payroll teams to streamline operations.
Top Skills: Core ConnectorsEibExtendJavaScriptPeciRest ApisSoapSQLWorkday StudioWorkday Web ServicesXMLXslt

What you need to know about the Ottawa Tech Scene

The capital city of Canada and the nation's fourth-largest urban area, Ottawa has proven a rapidly growing global tech hub. With over 1,800 tech companies, many of which are leaders in their sectors, the city's tech talent now makes up more than 13 percent of its total workforce. This growth is driven not only by the big players like UL Solutions and Dropbox, but also by a thriving startup ecosystem, as new businesses emerge to follow in the footsteps of those that came before them.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account