···
Log in / Register

Senior Site Reliability Engineer (SRE)

R$28,000/month
Indeed
Full-time
Onsite
No experience limit
No degree limit
R. Espírito Santo, 700 - Centro, Belo Horizonte - MG, 30160-030, Brazil
Favourites
Share
Some content was automatically translatedView Original

Description

Job Summary: We are looking for a Senior SRE. Key Highlights: 1. Lead the development and operation of an AI Native platform 2. Direct influence on product and team building 3. Grow with the business from the ground up Our client is a promising startup in the Generative AI market, delivering solutions to mid-to-large B2B enterprises. Founded in 2024, it already has a robust backlog and real-world challenges around scale, integration, and delivery, with a strong presence in the US. We seek a hands-on, autonomous Senior SRE to lead the development and operation of an AI Native platform. Solid experience in resilient, scalable, observable, and secure systems is essential, along with passion for applying AI to diagnostics and automation. You will ensure availability, performance, cost-efficiency, and security, serving as the driving force behind infrastructure and reliability for AI services and microservices. **What We Expect From You (Responsibilities):** * Infrastructure & Orchestration: Design cloud/on-prem infrastructure and lead operations * Docker/Kubernetes (optimizing autoscaling, rollouts, security). * CI/CD & Observability: Build reliable pipelines (Git/gates/automations) and implement end-to-end observability (SLOs/SLIs/SLAs, logs/metrics/tracing). * Architecture: Operate microservices (service mesh, resilience patterns) and manage critical data (PostgreSQL HA/tuning). * Security: Secret management, access policies, supply chain security, and hardening. * Automation/IaC/GitOps: Implement Infrastructure-as-Code and GitOps (Terraform/Helm/ArgoCD). * Incidents & AI: Lead incident response and postmortems with continuous, data- and AI-driven improvement. * Collaboration: Align with Engineering, Product, Data, and ML teams. Requirements: * 6+ years in SRE/DevOps/High-scale Platform roles. Proficiency in Kubernetes, Docker, CI/CD, Observability (SLOs), PostgreSQL, Microservices Architecture, Security, and Autonomy/IaC/GitOps. * Passion for leveraging LLMs/AI in operations. Experience with Node.js/Python, NestJS/React, Git/Cursor, GCP (other clouds are a plus), PostgreSQL, Docker/Kubernetes, Terraform/Helm/ArgoCD. * * *Nice-to-haves:* Experience with AI SDKs/LLMs, Operational Automations (N8N/Crew.ai), Vector Databases (RAG/pgvector), Kafka/RabbitMQ, FinOps/Chaos Engineering/SAST/DAST. Benefits * An environment with genuine autonomy and intense collaboration; * Direct influence on product and team building; * Growth with the business from the ground up; * *Fixed monthly salary of R$ 28,000.00 (PJ) + Realistic Stock Options;* * **Hybrid work model in Belo Horizonte.**

Source:  indeed View original post
João Silva
Indeed · HR

Company

Indeed
Cookie
Cookie Settings
Our Apps
Download
Download on the
APP Store
Download
Get it on
Google Play
© 2025 Servanan International Pte. Ltd.