




We are looking for a **DevOps / Site Reliability Engineer (SRE)** to work on high-complexity projects, playing a strategic role in ensuring the **availability, performance, security, and scalability** of critical platforms. This person will be responsible for improving system reliability, working closely with **software engineering, data, architecture, and product teams**, in cloud-based, distributed, and metrics-driven environments. The context involves mission-critical systems, high traffic volume and data volume, complex integrations, the need for advanced observability, and continuous automation of infrastructure and operations. **Responsibilities and Duties** * Act as a **DevOps / SRE**, serving as the technical reference for reliability and operations * Design, implement, and evolve **scalable, resilient, and secure infrastructure** * Ensure **high availability, performance, and fault tolerance** of systems * Define and implement **observability practices** (logs, metrics, traces, alerts, and SLOs) * Handle **incident management**, troubleshooting, and root cause analysis (RCA) * Automate provisioning, deployment, and operational processes (**Infrastructure as Code**) * Enhance **CI/CD pipelines**, ensuring quality, security, and traceability * Work with **cloud-native architectures, microservices, and event-driven systems** * Support architectural decisions focused on **reliability, cost, and scalability** * Promote a culture of **reliability, automation, and continuous improvement** **Requirements and Qualifications** * Solid experience as a **DevOps Engineer, SRE, or equivalent role** * Strong knowledge of **cloud computing** (AWS, Azure, or GCP) * Experience with **containers and orchestration** (Docker, Kubernetes) * Hands-on experience with **Infrastructure as Code** (Terraform, Bicep, CloudFormation, or similar) * Experience with **CI/CD** (GitHub Actions, GitLab CI, Azure DevOps, Jenkins, etc.) * Knowledge of **observability and monitoring** (Prometheus, Grafana, Datadog, New Relic, ELK, OpenTelemetry) * Experience with **networking, basic security, and automation** * Ability to operate in critical and highly complex environments * Analytical, collaborative, and problem-solving mindset * Practical experience with **SRE (SLIs, SLOs, Error Budget)** * Experience with **large-scale and mission-critical architectures** * Experience with **data platforms** and compute-intensive workloads * Knowledge of **FinOps and cloud cost optimization** * Experience in regulated or enterprise environments **Additional Information** * Involvement in strategic and technically challenging projects * Close collaboration with engineering, data, and product teams * Technical autonomy for high-impact decisions * Collaborative environment focused on operational excellence * Work model and benefits aligned with market standards **Here, we are \#SangueLaranja!** We have been in the market for 17 years, side by side with our clients, delivering transformative experiences. We are a global technology and innovation ecosystem; beyond Brazil, we operate in Europe and the UK, with offices in Portugal, London, Dubai, and the Netherlands. **F for Formation: We believe in practicing a culture of knowledge sharing, community spirit, and that knowledge** **has the power to transform!** We run initiatives and social actions that foster development, such as the Orange Juice tech community, the Training Program, our leadership academy, and multiple partnerships with NGOs and Edtechs. **At FCamara, everyone is welcome—Diversity, Respect, and Ethics are non-negotiable elements embedded in our DNA.** **So, are you ready to join an amazing team and become the protagonist of your own story?**


