Coinbase Logo

Coinbase

Sr. Site Reliability Engineer

Posted 17 Hours Ago
Remote
Hiring Remotely in USA
Senior level
Remote
Hiring Remotely in USA
Senior level
As a Senior Site Reliability Engineer, you will manage AI tools, ensure system reliability, develop automation scripts, and collaborate across teams to enhance AI infrastructures.
The summary above was generated by AI

Ready to be pushed beyond what you think you’re capable of?

At Coinbase, our mission is to increase economic freedom in the world. It’s a massive, ambitious opportunity that demands the best of us, every day, as we build the emerging onchain platform — and with it, the future global financial system.

To achieve our mission, we’re seeking a very specific candidate. We want someone who is passionate about our mission and who believes in the power of crypto and blockchain technology to update the financial system. We want someone who is eager to leave their mark on the world, who relishes the pressure and privilege of working with high caliber colleagues, and who actively seeks feedback to keep leveling up. We want someone who will run towards, not away from, solving the company’s hardest problems.

Our work culture is intense and isn’t for everyone. But if you want to build the future alongside others who excel in their disciplines and expect the same from you, there’s no better place to be.

We are looking for a Site Reliability Engineer (SRE) to join the IT AI Infrastructure team to deploy, manage, and optimize AI-powered productivity tools and in-house AI solutions that enhance employee efficiency at scale. A successful candidate will have demonstrated success in similar roles within high-growth, security-conscious environments, bringing deep expertise in public cloud infrastructure (AWS/GCP), backend development (Python, Go, or Java), and automation tooling. The right person is passionate about building scalable and reliable AI infrastructure, driving automation, and collaborating across disciplines to integrate AI systems while maintaining strong security and compliance standards.

What You’ll Be Doing:

  • Deployment and Management of AI Tools: Deploy, configure, and manage AI-powered employee productivity tools and in-house AI built solutions 
  • Reliability and Performance: Ensure high availability, reliability, and optimal performance of AI platforms and services. Implement monitoring, alerting, and incident response procedures.
  • Scalability and Infrastructure: Design and implement scalable infrastructure to support the growing demands of AI tools and user base. Optimize resource utilization and manage capacity planning.
  • Automation and Tooling: Develop and maintain automation scripts and tools to streamline deployment, monitoring, and maintenance tasks. Contribute to the experimental sandbox environments for testing new AI solutions.
  • Collaboration and Support: Collaborate with cross-functional teams (Machine-Learning, HR, Security, Data Science, Developer Experience) to support the development and integration of AI solutions. Provide technical support and troubleshooting for AI-related issues.
  • Security and Compliance: Adhere to security and privacy policies while deploying and managing AI tools. Ensure compliance with regulatory requirements.
  • Monitoring and Metrics: Implement comprehensive monitoring and metrics to track the performance and health of AI systems. Analyze data to identify areas for improvement and optimization.
  • Incident Response: Participate in incident response and troubleshooting for AI-related outages or performance issues. Develop and maintain incident response plans.
  • Backend Development: Contribute to backend development tasks to support the integration and functionality of AI tools.
  • Public Cloud Management: Deploy and manage AI solutions on public cloud platforms (AWS/GCP), leveraging cloud-native services and best practices.
  • Written and Verbal Communication: Excellent communication skills and experience presenting technical information to non-technical audiences, including senior leadership.

What We Look For In You:

  • Proven experience as a Site Reliability Engineer (SRE) or similar role.
  • Strong understanding of AI technologies and platforms.
  • Experience with deploying and managing applications in a cloud environment (AWS/GCP).
  • Solid backend development experience with programming languages such as Python, Java, or Go.
  • Strong proficiency in managing and configuring public cloud services (AWS/GCP) for scalability and reliability.
  • Experience with automation tools and scripting (e.g., Ansible, Terraform, Bash, Python).
  • Excellent troubleshooting and problem-solving skills.
  • Strong communication and collaboration skills.
  • Strong security and compliance understanding.
  • Experience working in a highly regulated environment
  • Experience in a fast-paced, high-growth company

ID: P70538

Please be advised that each candidate may submit a maximum of four applications within any 30-day period. We encourage you to carefully evaluate how your skills and interests align with Coinbase's roles before applying.

Commitment to Equal Opportunity

Coinbase is committed to diversity in its workforce and is proud to be an Equal Opportunity Employer.  All qualified applicants will receive consideration for employment without regard to race, color, religion, creed, gender, national origin, age, disability, veteran status, sex, gender expression or identity, sexual orientation or any other basis protected by applicable law. Coinbase will also consider for employment qualified applicants with criminal histories in a manner consistent with applicable federal, state and local law.  For US applicants, you may view the Know Your Rights notice here.  Additionally, Coinbase participates in the E-Verify program in certain locations, as required by law. 

Coinbase is also committed to providing reasonable accommodations to individuals with disabilities. If you need a reasonable accommodation because of a disability for any part of the employment process, please contact us at accommodations[at]coinbase.com to let us know the nature of your request and your contact information.  For quick access to screen reading technology compatible with this site click here to download a free compatible screen reader (free step by step tutorial can be found here).

Global Data Privacy Notice for Job Candidates and Applicants

Depending on your location, the General Data Protection Regulation (GDPR) and California Consumer Privacy Act (CCPA) may regulate the way we manage the data of job applicants. Our full notice outlining how data will be processed as part of the application procedure for applicable locations is available here. By submitting your application, you are agreeing to our use and processing of your data as required. For US applicants only, by submitting your application you are agreeing to arbitration of disputes as outlined here.    


Top Skills

Aws,Gcp,Python,Go,Java,Ansible,Terrraform,Bash

Coinbase Ottawa, Ontario, CAN Office

Ottawa, ON, Canada

Similar Jobs at Coinbase

17 Days Ago
Remote
USA
Senior level
Senior level
Cloud • Fintech • Cryptocurrency • NFT • Web3
The Senior Site Reliability Engineer will enhance system reliability, improve observability, build automation, and optimize cloud deployments while mentoring engineers and ensuring process improvements.
Top Skills: AWSAzureDatadogDockerEc2GCPGoKibanaKubernetesRubyTerraform
Senior level
Cloud • Fintech • Cryptocurrency • NFT • Web3
As a Senior Site Reliability Engineer, you will manage IAM systems, implement cloud-native applications, and enhance automation and security in operations, ensuring peak uptime and performance.
Top Skills: AnsibleAWSAzureAzure AdC#DockerDuoGCPGoGoogle WorkspaceJavaKubernetesOktaPingPythonRubyTerraform
22 Days Ago
Remote
USA
Mid level
Mid level
Cloud • Fintech • Cryptocurrency • NFT • Web3
The Site Reliability Engineer will build and maintain secure system architectures for various client platforms and manage DevOps tooling.
Top Skills: AnsibleAutopkgAWSAzureGCPGoMicromdmMunkiNanomdmPuppetPythonRubyTerraform

What you need to know about the Ottawa Tech Scene

The capital city of Canada and the nation's fourth-largest urban area, Ottawa has proven a rapidly growing global tech hub. With over 1,800 tech companies, many of which are leaders in their sectors, the city's tech talent now makes up more than 13 percent of its total workforce. This growth is driven not only by the big players like UL Solutions and Dropbox, but also by a thriving startup ecosystem, as new businesses emerge to follow in the footsteps of those that came before them.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account