Acquia is an open source digital experience company. We provide the world's most ambitious brands with technology that allows them to embrace innovation and create customer moments that matter. At Acquia we believe in the power of community and collaboration - giving our customers the freedom to build tomorrow on their terms.
Headquartered in Boston, we have been named as one of North America’s fastest growing software companies as reported by Deloitte and Inc. Magazine, and have been rated a leader by the analyst community and named one of the Best Places to Work by the Boston Business Journal. We are Acquia. We are building for the future of the web, and we want you to be a part of it.
Our Drupal Applications Team, along with a robust community of open source contributors, is on a mission to also make Drupal the world's most user-friendly CMS. We are seeking an experienced Site Reliability Engineer to help build highly resilient and scalable systems by automating, measuring, and monitoring everything. SREs have the explicit authority and responsibility to ‘stop the line’ on releases when a service is under SLA and overflow manual labor to the overall engineering team when the level of manual work exceeds sustainability. We thrive on innovation, collaboration, and an agile mindset and processes.
Job Responsibilities
- Work with team to implement highly-available and scalable architectures for core and third-party components of Acquia Source;
- Solve availability/performance problems and build software-based solutions to prevent recurrences;
- Guide and implement build pipelines and automated deployments and releases;
- Implement metrics, monitoring, and incident response processes;
- Initiate automated production deployments for patches and features;
- Be aware of operations-related issues affecting Acquia Source;
- Participate in compliance efforts to ensure products meet audit requirements
- Monitor levels of manual effort and signal when it grows;
- Measure availability metrics and signal when under SLA;
- Share a 24/7 on-call rotation with development engineers;
- Contribute as part of a Scrum team to maintain a deep understanding of system functionality and architecture, with primary focus to operational aspects of the service (availability, performance, change management, emergency response, capacity planning, etc)
Requirements
- BS in Computer Science or a comparable field of study, or equivalent practical experience.
- Experience in the observability space (Sumo Logic, Grafana, Kibana, New Relic etc.)
- Experience working with one or more of: Ruby, PHP, Java, Javascript, Go, Python
- Understanding of web fundamentals including REST APIs
- Experience with Unix/Linux systems administration using the CLI.
- Solid oral and written communications skills.
Preferred Qualifications
- Experience building systems on AWS
- Demonstrable experience with Drupal
- Understanding of Software Development Life Cycle, Test Driven Development, Continuous Integration, and Continuous Delivery
We are an organization that embraces innovation and the potential of AI to enhance our processes and improve our work. We are always looking for individuals who are open to learning new technologies and collaborating with AI tools to achieve our goals.
Acquia is proud to provide best-in-class benefits to help our employees and their families maintain a healthy body and mind. Core Benefits include: competitive healthcare coverage, wellness programs, take it when you need it time off, parental leave, recognition programs, and much more!
Individuals seeking employment at Acquia are considered without regard to race, color, religion, caste, creed, national origin, age, sex, marital status, ancestry, physical or mental disability, veteran status, gender identity, or sexual orientation. Whatever you answer will not be considered in the hiring process or thereafter.


.jpg)