···
Log in / Register
DBRE - Senior Database Reliability Engineer
Negotiable Salary
Indeed
Full-time
Onsite
No experience limit
No degree limit
79Q22222+22
Favourites
Share
Some content was automatically translatedView Original
Description

Job Description: * Proven experience administering, operating, and evolving SQL databases in production mission-critical environments: Aurora PostgreSQL (primary stack for this role); MySQL. * Hands-on experience with Redis/ElastiCache in low-latency environments. * Strong expertise with AWS services: RDS/Aurora, ElastiCache, IAM, EC2, VPC, security, and networking. * Experience working in large-scale environments, automation, cloud computing, and resilient architecture. * Proficiency in performance tuning, troubleshooting, query analysis, indexing, and operation of distributed databases. * Solid experience with observability, especially Datadog (APM, logs, metrics, and alerts). * Experience with Infrastructure as Code (Terraform, CloudFormation, or equivalent). * Advanced knowledge of Linux/Unix operating systems. * Experience with automation and scripting languages: Python, Bash, or similar. * Strong practices in backup, disaster recovery, replication, and data security. * Ability to work autonomously, make technical decisions, and collaborate effectively with cross-functional teams. * Analytical mindset, technical curiosity, and focus on solving complex problems. * Commitment to documentation, best practices, and continuous improvement. Preferred Qualifications: * Prior experience in SRE/DBRE or Data Engineering teams within high-availability environments. * Knowledge of FinOps applied to managed databases on AWS. * Experience with Chaos Engineering practices applied to data environments. * Active participation in technical communities, events, reliability initiatives, or open-source projects. * Design, implement, and evolve highly available, resilient, and scalable data architectures, directly contributing to system reliability. * Administer and optimize Aurora PostgreSQL, MySQL, and Redis/ElastiCache in high-criticality production environments. * Automate database operations, provisioning, and maintenance processes to reduce manual effort and mitigate operational risks. * Collaborate closely with engineering and architecture teams, supporting technical decisions and promoting best practices in data engineering and observability. * Investigate and resolve performance, availability, replication, and consistency issues, proactively preventing incidents. * Design and maintain robust strategies for backup, recovery, failover, replication, and data security. * Build and enhance database observability, focusing on Datadog (dashboards, custom metrics, alerts, logs, and APM). * Lead postmortems, failure analyses, and continuous improvement plans centered on data environment reliability. * Apply Infrastructure as Code (IaC) and automation pipelines to optimize delivery processes and governance. 2512060202191566311

Source:  indeed View original post
João Silva
Indeed · HR

Company

Indeed
João Silva
Indeed · HR
Similar jobs
Cookie
Cookie Settings
Our Apps
Download
Download on the
APP Store
Download
Get it on
Google Play
© 2025 Servanan International Pte. Ltd.