The Director of Site Reliability Engineering leads cloud service management, incident response, and automation, mentoring SRE teams and optimizing cloud infrastructure for reliability and cost efficiency.
HHAeXchange is the leading technology platform for home and community-based care. Founded in 2008, HHAeXchange was born out of an idea to create a fully comprehensive end-to-end homecare solution to help people who are aging or have disabilities thrive in their homes and communities. Our employees are passionate about transforming the healthcare space by building the only homecare ecosystem that fully connects patients, personal care providers, managed care organizations, and states.
The Director, Site Reliability Engineering is responsible for leading the teams that manage and support all of our hosting services, including colocated hardware and cloud-based services, as well as defining and operating the processes for change management, financial management and incident response.
To perform this job successfully, an individual must be able to perform each essential job duty satisfactorily with or without reasonable accommodation. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.
Essential Job Duties
- Ensure high availability, scalability, and security of cloud services across multiple geographies.
- Implement and improve automation, incident management, and capacity planning practices.
- Lead and mentor a team of Site Reliability Engineers and leaders. Lead the transformation of the organization to an SRE model.
- Integrate the technology, practices and policies of disparate organizations into a single cohesive team that supports disparate technologies and platforms with minimal variation in practice.
- Develop and execute strategic plans for cloud infrastructure and operations to support business growth and acquisitions.
- Oversee the management and optimization of cloud infrastructure for cost-efficiency.
- Maintain and improve monitoring, logging, and alerting systems.
- Collaborate closely with product development teams to facilitate delivery of new functionality and capabilities to our SaaS platform and hosted products.
- Champion and support the transformation to a DevOps culture.
- Develop and manage budgets for cloud infrastructure and tooling.
- Evaluate and implement new technologies and tools to enhance cloud infrastructure and operations.
- Foster a culture of continuous improvement, collaboration, and innovation.
Other Job Duties
- Other duties as assigned by supervisor or HHA exchange leader.
Travel Requirements
- Travel up to 10%, including overnight travel
Required Education, Experience, Certifications and Skills
- Bachelor’s or master’s degree in Computer Science, Engineering, or a related field.
- 10+ years of experience in cloud engineering and operations, with at least 5 years in a leadership role.
- Proven experience with managing large scale AWS cloud platforms.
- Deep understanding of modern SRE practices and principles.
- Experience with cloud infrastructure tools (monitoring, deployment, security).
- Excellent leadership, communication, and interpersonal skills.
- Proven experience driving process and culture transformation across organizations.
- Ability to work effectively with cross-functional teams and stakeholders.
- Strong problem-solving and decision-making abilities.
The base salary range for this US-based, full-time, and exempt position is $185,000-205,000 not including variable compensation. An employee’s exact starting salary will be based on various factors including but not limited to experience, education, training, merit, location, and the ability to exemplify the HHAeXchange core values.
This is a benefits-eligible position. HHAeXchange offers competitive health plans, paid time-off, company paid holidays, 401K retirement program with a Company elected match, including other company sponsored programs.
HHAeXchange is an equal-opportunity employer. The Company offers employment opportunities to all applicants and employees without regard to race, color, religion, national origin, sex, sexual orientation, gender identity or expression, age, disability, medical condition, marital status, veteran status, citizenship, genetic information, hairstyles, or any other status protected by local or federal law.
Top Skills
AWS
Cloud Infrastructure
Deployment Tools
Incident Management Tools
Monitoring Tools
Security Tools
Similar Jobs
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Design and implement next-generation core FinTech systems, manage projects, mentor team members, and communicate with stakeholders at Coinbase's FinHub team.
Top Skills:
BlockchainDistributed SystemsFintechGo
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Senior Software Engineer will enhance services for institutional clients using Java and Golang, focusing on low-latency systems and database infrastructure maintenance while collaborating on product development.
Top Skills:
DockerGoJavaKubernetesNo-Sql DatabasesRds PostgresSQL
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Lead engineering teams in building critical systems for derivatives exchanges, collaborate with cross-functional teams, and ensure code quality and operational excellence.
Top Skills:
ArchitectureDistributed SystemsFinancial Trading SystemsSoftware Engineering
What you need to know about the Ottawa Tech Scene
The capital city of Canada and the nation's fourth-largest urban area, Ottawa has proven a rapidly growing global tech hub. With over 1,800 tech companies, many of which are leaders in their sectors, the city's tech talent now makes up more than 13 percent of its total workforce. This growth is driven not only by the big players like UL Solutions and Dropbox, but also by a thriving startup ecosystem, as new businesses emerge to follow in the footsteps of those that came before them.