Technical Skills Advance knowledge of core AWS services: EC2, ECS/EKS, Lambda, S3, RDS/Aurora, DynamoDB, VPC, ELB/ALB/NLB, Route53, IAM. Designing
multi-AZ and multi-region
highly available architectures. Broad understanding of
networking in AWS
(subnets, routing tables, NAT, security groups, NACLs, VPC peering, PrivateLink). Experience with
well-architected framework
pillars (especially reliability, security, cost optimization). Designing fault-tolerant and horizontally scalable systems Advanced proficiency in Terraform, CloudFormation, or CDK Hands-on experience with CloudWatch, Prometheus, Grafana, Datadog, Dynatrace, or OpenTelemetry Modular IaC design patterns and state management best practices. Own end-to-end system reliability, availability, and performance using clearly defined SLAs, SLOs, and SLIs, with continuous monitoring and proactive improvement of service health. Establish and govern error budget policies in partnership with engineering leadership to b...