




Job Summary: We are seeking a Senior SRE. Key Highlights: 1. Lead the construction and operation of an AI Native platform 2. Direct influence on product and team development 3. An environment with genuine autonomy and intensive collaboration Our client is a promising startup in the Generative AI market, delivering solutions for mid-to-large B2B enterprises. Founded in 2024, it already boasts a robust backlog and real-world challenges around scale, integration, and delivery—with a strong presence in the US. We seek a hands-on, autonomous Senior SRE to lead the construction and operation of an AI Native platform. Solid experience in resilient, scalable, observable, and secure systems—and passion for applying AI to diagnostics and automation—is essential. You will ensure availability, performance, cost-efficiency, and security, serving as the driving force behind infrastructure and reliability for AI services and microservices. **What We Expect From You (Responsibilities):** * Infrastructure & Orchestration: Design cloud/on-prem infrastructure and lead operations * Docker/Kubernetes (optimizing autoscaling, rollouts, security). * CI/CD & Observability: Develop reliable pipelines (Git/gates/automations) and implement end-to-end observability (SLOs/SLIs/SLAs, logs/metrics/tracing). * Architecture: Operate microservices (service mesh, resilience patterns) and manage critical data (PostgreSQL HA/tuning). * Security: Secret management, access policies, supply chain security, and hardening. * Automation/IaC/GitOps: Implement Infrastructure as Code and GitOps (Terraform/Helm/ArgoCD). * Incidents & AI: Conduct incident response and postmortems with continuous improvement driven by data and AI. * Collaboration: Align with Engineering, Product, Data, and ML teams. Requirements: * 6+ years in SRE/DevOps/High-scale Platforms. Proficiency in Kubernetes, Docker, CI/CD, Observability (SLOs), PostgreSQL, Microservices Architecture, Security, and Automation/IaC/GitOps. * Passion for leveraging LLMs/AI in operations. Experience with Node.js/Python, NestJS/React, Git/Cursor, GCP (experience with other clouds is a plus), PostgreSQL, Docker/Kubernetes, Terraform/Helm/ArgoCD. * * *Differentiators:* Experience with AI SDKs/LLMs, Operational Automations (N8N/Crew.ai), Vector Databases (RAG/pgvector), Kafka/RabbitMQ, FinOps/Chaos Engineering/SAST/DAST. Benefits * An environment with genuine autonomy and intensive collaboration; * Direct influence on product and team development; * Growth alongside the business from the ground up; * *Fixed salary of R$ 28.000,00 (monthly, PJ) + Real possibility of Stock Options;* * **Hybrid work model in Belo Horizonte.**


