DataSite Logo

DataSite

Sr HA-Systems Engineer

Posted 18 Days Ago
Be an Early Applicant
Remote
2 Locations
Senior level
Remote
2 Locations
Senior level
Design and optimize high-availability systems for M&A SaaS technology, focusing on resilience and scalability with mentorship responsibilities.
The summary above was generated by AI

Datasite is where deals are made. We provide the data rooms and SaaS technology used in M&A and other high-value transactions, to deliver projects in more than 170 countries. Carrying that success into the future is all about you. Your useful skills, your unusual experience, your unique ideas. Everyone here brings something unexpected. What’s yours? Invest your talents in us, and we’ll return the compliment.

Job Description:

If you're passionate about building resilient, high-performance systems and want to make an impact in a growing, innovative company, this might be an opportunity for you.

We are seeking an experienced High Availability Systems Engineer to join our engineering team. This role will be responsible for designing, implementing, and optimizing systems to ensure high availability, reliability, and scalability across our platforms. The ideal candidate will have a deep understanding of distributed systems, cloud infrastructure, and best practices for achieving 99.99% uptime.

We are open to hiring remote in the US or Canada. **Please note, we are unable to sponsor or take over sponsorship of an employment Visa at this time.**

Key Responsibilities:

Design and Architecture:

  • Architect and build highly available, fault-tolerant systems to support mission-critical applications.

  • Collaborate with cross-functional teams to design scalable, robust, and secure cloud-based solutions.

  • Develop strategies for disaster recovery, data replication, and failover processes.

System Performance and Optimization:

  • Analyze system performance, identify bottlenecks, and implement optimizations to ensure optimal uptime and performance.

  • Conduct load testing, capacity planning, and performance tuning to meet high availability requirements.

  • Utilize monitoring tools to proactively detect issues and minimize downtime.

Automation and Infrastructure as Code:

  • Develop and maintain infrastructure as code (IaC) using tools like Terraform and Ansible.

  • Implement automation for deployments, scaling, and configuration management to reduce manual intervention and increase system reliability.

Incident Management and Troubleshooting:

  • Lead incident response and root cause analysis for system outages, ensuring quick resolution and prevention of future incidents.

  • Build and maintain robust monitoring, alerting, and diagnostic systems for proactive issue identification.

Mentorship and Leadership

  • Provide technical leadership, mentorship, and guidance to junior engineers and other team members.

  • Stay updated on the latest trends in high availability and distributed systems, and share knowledge within the team.

Required Qualifications:

  • Bachelor's in Computer Science, Engineering, or a related field (or equivalent experience).

  • 8+ years of experience in systems engineering, infrastructure architecture, or related fields.

  • Proven track record of designing and implementing highly available, fault-tolerant systems in cloud or on-prem environments.

  • Experience with distributed systems, microservices architecture, and high availability patterns (e.g., active-active, active-passive).

  • Proficient in cloud platforms (Azure, GCP, AWS) or on-prem data centers and cloud-native technologies.

  • Deep knowledge and understanding of Linux systems

  • Experience using monitoring and observability tools (Prometheus, Grafana, Loki, etc.).

  • Strong coding/scripting skills in Python, Go, or Shell for automation.

  • Excellent problem-solving skills with a focus on resilience and scalability.

  • Strong communication skills with the ability to convey complex technical concepts to diverse stakeholders.

  • Ability to work independently and take ownership of projects from inception to deployment.

Preferred Qualifications:

  • Strong experience with containers and orchestration (Docker, Kubernetes).

  • Familiarity with CI/CD pipelines and DevOps practices.

  • Advanced knowledge of networking, load balancers, and distributed data storage solutions (e.g., Cassandra, Elasticsearch, Kafka).

  • Experience with multi-region deployments and global scaling strategies.

  • Certification in cloud platforms (e.g., AWS Certified Solutions Architect, Google Professional Cloud Architect).

  • Background in security best practices, including compliance frameworks (e.g., SOC 2, ISO 27001).

  • Experience in agile methodologies and DevOps culture.

The base salary range represents the estimated low and high end for this position at the time of this posting. Consistent with applicable law, each candidate’s compensation offer may vary and will be determined based on but not limited to, your geographic region, skills, qualifications, and experience along with the requirements of the position.  Datasite reserves the right to modify this pay range at any time.

$61,100.00 - $108,700.00

As a global organization, Datasite knows that diverse perspectives are essential to our success. We’re committed to maintaining a diverse workforce to serve our customers around the world. Datasite is an equal opportunity employer (EEO) and furthers the principles of EEO through Affirmative Action.

Top Skills

Ansible
AWS
Azure
Cassandra
Docker
Elasticsearch
GCP
Go
Grafana
Kafka
Kubernetes
Linux
Loki
Prometheus
Python
Shell
Terraform

Similar Jobs

2 Hours Ago
Remote
Hybrid
New York, NY, USA
Mid level
Mid level
Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
As a Solutions Engineer at Dynatrace, you will support sales with technical expertise, manage POCs, and foster customer relationships while promoting Dynatrace products.
Top Skills: .NetAnsibleAWSAzureCSSDynatraceGCPGoHTMLJavaJavaScriptKubernetesNode.jsOpenshiftPHPPuppetTerraform
2 Hours Ago
Remote
Hybrid
68 Locations
Senior level
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead and manage teams in developing data solutions using Palantir Foundry, mentoring junior staff and ensuring project success and client satisfaction while adhering to PwC standards.
Top Skills: AipPalantir FoundryPythonTypescript
2 Hours Ago
Easy Apply
Remote
35 Locations
Easy Apply
Senior level
Senior level
Cloud • Security • Software • Cybersecurity • Automation
This role involves leading the design and evolution of GitLab’s multi-tenant platform, ensuring high availability and performance while mentoring team members. Responsibilities include backend API design and fostering a collaborative engineering culture.
Top Skills: Cloud ComputingGoRuby

What you need to know about the Ottawa Tech Scene

The capital city of Canada and the nation's fourth-largest urban area, Ottawa has proven a rapidly growing global tech hub. With over 1,800 tech companies, many of which are leaders in their sectors, the city's tech talent now makes up more than 13 percent of its total workforce. This growth is driven not only by the big players like UL Solutions and Dropbox, but also by a thriving startup ecosystem, as new businesses emerge to follow in the footsteps of those that came before them.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account