Site Reliability Engineer III
IN-Remote
India Careers
Req #: 15727
Type: Regular
|
Overview: We're hiring a Site Reliability Engineer to help build and maintain the backbone of Avalara's SaaS platforms. As part of our global Reliability Engineering team, you'll play a key role in ensuring the performance, availability, and observability of critical systems used by millions of users. This role combines hands-on infrastructure expertise with modern SRE practices and the opportunity to contribute to the evolution of AI-powered operations. You'll work closely with engineering and operations teams across regions to drive automation, improve incident response, and proactively detect issues using data and machine learning. Responsibilities: * Own the reliability and performance of production systems across multiple environments and multiple clouds (AWS, GCP, OCI). * Use AI/ML-driven tools and automation to improve observability and incident response. * Collaborate with development teams on CI/CD pipelines, infrastructure deployments, and secure practices. * Perform root cause analysis, drive postmortems, and reduce recurring incidents. * Contribute to compliance and security initiatives (SOX, SOC2, ISO 27001, access and controls). * Participate in a global on-call rotation and knowledge-sharing culture. Qualifications: * 5+ years in SRE, DevOps, or infrastructure engineering roles. * Expertise with AWS (GCP or OCI is a plus), AWS Certified Solutions Architect Associate or equivalent * Strong scripting/programming skills (Python, Go, Bash, or similar) * Experience with infrastructure as code (Terraform, CloudFormation, Pulumi). * Proficiency in Linux environments, containers (Docker/Kubernetes), and CI/CD workflows. * Strong written and verbal communications skills to support world wide collaboration.