





Monitor and map AWS resource performance opportunities. Analyze deviations using tools such as CloudWatch, DataDog, and OpenTelemetry. Participate in War Room crisis response for troubleshooting. Define SLOs (Service Level Objectives), SLIs (Service Level Indicators), and SLAs (Service Level Agreements) to measure system reliability. What do you need to excel at? Experience with AWS microservices architecture (ECS, Lambda, EKS, API Gateway). Experience with AWS services. Hands-on experience as an SRE/DevOps engineer handling incidents. Proficiency in structured scripting with Terraform, YAML, and JSON. AWS certifications.


