···
Log in / Register
Observability and SRE Specialist
Indeed
Full-time
Onsite
No experience limit
No degree limit
R. Benedita Guerra Zendron, 21 - Vila Sao Joao, Barueri - SP, 06401-190, Brazil
Favourites
Share
Some content was automatically translatedView Original
Description

Job Summary: Focus on observability, operational support, and incident analysis in complex environments, exercising technical leadership and actively collaborating to resolve issues. Key Highlights: 1. Observability and incident analysis in complex environments 2. Technical leadership in war rooms and crisis rooms 3. Building and enhancing system instrumentation and automation We are more than a machine—we are people who transform and **create infinite possibilities.** We work to **simplify and empower businesses for everyone**, delivering intelligent financial solutions. Here, we invest in **technology**, promote **development**, and foster **innovation** to forge new paths and generate positive global impact. At Cielo, we work with **autonomy** to write our own journey, **freedom** to be ourselves, and the opportunity to **make things happen**. We are a team that **dreams collectively**, offering a comprehensive experience focused on the physical and mental well-being of our 7,000+ employees and their families. We believe in **inclusion and embracing** all individuals, honoring their uniqueness and diverse life experiences. Let’s achieve your dreams together! **Responsibilities and Assignments** ----------------------------------- **There’s a place for you in this purpose:** * Focus on observability, identifying failures, outages, and issues in complex environments. * Ensure operational support for Cielo’s products, guaranteeing stability, performance, and reliability. * Exercise technical leadership in war rooms, acting as the technical authority during critical incidents and correlating data across layers to rapidly isolate failures. * Actively participate in crisis rooms, supporting rapid diagnosis and resolution of incidents. * Implement safe recovery mechanisms to reduce MTTR and minimize business impact. * Lead post-mortems, driving systemic improvements and avoiding individual blame. * Build and enhance system instrumentation through logs, metrics, and distributed tracing, increasing end-to-end visibility across microservices and cloud environments. * Create strategic dashboards linking technical metrics to business impact (financial transactions, user experience, availability). * Manage and evolve alert intelligence to reduce operational fatigue and ensure precise team engagement. * Develop automations and automated runbooks to reduce toil and enable self-healing capabilities. * Write scripts and automations in Python and integrate routines with tools such as Jira, Dynatrace, Datadog, and Copilot. * Balance responsibilities across building, supporting, and continuously improving services and operational pipelines. * Collaborate actively with multidisciplinary teams, ensuring clear communication and value-driven delivery. **Requirements and Qualifications** ------------------------------ **What does the #TimeCielo expect from you?** * Experience in observability, operational support, and incident analysis in complex environments. * Knowledge of system instrumentation: metrics, logs, distributed tracing, and microservice monitoring. * Experience in technical leadership within critical contexts (war rooms and crisis rooms). * Analytical ability to correlate events and investigate problems in a structured manner. * Knowledge of automation and hands-on Python skills, including integrations with tools such as Jira, Dynatrace, Datadog, and Copilot. * Experience creating dashboards and business metric–oriented monitoring. * Familiarity with MTTR reduction strategies, safe recovery, and operational resilience. * Ability to lead post-mortems and propose systemic improvements. * Collaborative mindset, strong technical communication skills, and capacity to perform under pressure within multidisciplinary teams. * Continuous improvement mindset, focused on eliminating toil via automation and efficient processes. **Additional Information** -------------------------- **Why live infinite possibilities with us?** * Medical and Dental Assistance; * Annual Variable Compensation (PPR); * Meal and Food Allowance; * Commuter Bus/Transportation Allowance or Parking; * Hybrid Work Model; * Remote Work Allowance; * Life Insurance; * Home and Auto Insurance; * Family Funeral Assistance; * Private Pension; * Access to specialist support channel (nutrition, psychology, gynecology, etc.); * Vaccination Campaign; * Access to various courses on our Educa platform; * Wellhub; * Healthy Pregnancy Program; * Extended Maternity and Paternity Leave; * Daycare Allowance; * Birthday Day Off; * Flexible Dress Code; * Flexible Working Hours; * Short Fridays; * Extended Lunch Break (1h30).

Source:  indeed View original post
João Silva
Indeed · HR

Company

Indeed
João Silva
Indeed · HR
Similar jobs

Cookie
Cookie Settings
Our Apps
Download
Download on the
APP Store
Download
Get it on
Google Play
© 2025 Servanan International Pte. Ltd.