




**Description: Apply quickly via email:** **Requirements and qualifications:** Fluent English Deep expertise in Site Reliability Engineering (SRE) and DevOps, with a track record in critical, high-availability environments. Solid experience in managing and automating critical incidents, root cause analysis, and implementing preventive measures. Proficiency in container orchestration (Kubernetes) and advanced use of Helm for application management. Hands-on experience with cloud platforms (AWS, Azure, or Google Cloud), including automation, monitoring, and cost optimization. Experience with CI/CD, continuous delivery pipelines, automated testing, and tools such as Jenkins, Bamboo, Travis, or Brigade. Knowledge of infrastructure-as-code (Terraform, Ansible) and UNIX/Linux system administration. Strong foundation in TCP/IP networking, PKI, and system security. Advanced scripting skills (bash, sh, ksh) and proficiency in at least one additional language (Go, Python, JavaScript, or Perl). **Desirable:** Prior experience with global platforms or multi-time-zone, multicultural environments. Active contributions to SRE/DevOps-related open-source communities or projects. Proven case studies demonstrating large-scale incident reduction, performance improvement, or cost optimization. Certifications in Cloud, Kubernetes, DevOps, or SRE. **Benefits:** Health insurance, Dental plan, Life insurance, Flexible benefit of R$ 1,270.00, Gym membership, Birthday day off, Extended maternity leave, Extended paternity leave **Work schedule:** Business hours **Knowledge:** **Education:** Completed high school 2512030202191781516


