




Job Summary: We are looking for a Senior SRE. Key Highlights: 1. Direct influence on product and team building 2. Growth alongside the business from the ground up 3. Passion for applying LLMs/AI in operations Our client is a promising startup in the Generative AI market, delivering solutions to mid-to-large B2B enterprises. Founded in 2024, it already has a robust backlog and real-world challenges around scale, integration, and delivery, with a strong presence in the US. We seek a hands-on, autonomous Senior SRE to lead the development and operation of an AI-Native platform. Solid experience in resilient, scalable, observable, and secure systems—and passion for applying AI in diagnostics and automation—is essential. You will ensure availability, performance, cost-efficiency, and security, serving as the driving force behind infrastructure and reliability for AI services and microservices. **What We Expect From You (Responsibilities):** * Infrastructure & Orchestration: Design cloud/on-prem infrastructure and lead operations * Docker/Kubernetes (optimizing autoscaling, rollouts, security). * CI/CD & Observability: Build reliable pipelines (Git/gates/automations) and implement end-to-end observability (SLOs/SLIs/SLAs, logs/metrics/tracing). * Architecture: Operate microservices (service mesh, resilience patterns) and manage critical data (PostgreSQL HA/tuning). * Security: Secret management, access policies, supply chain security, and hardening. * Automation/IaC/GitOps: Implement Infrastructure-as-Code and GitOps (Terraform/Helm/ArgoCD). * Incidents & AI: Lead incident response and postmortems with continuous improvement driven by data and AI. * Collaboration: Align closely with Engineering, Product, Data, and ML teams. Requirements: * 6+ years in SRE/DevOps/High-scale Platform roles. Proficiency in Kubernetes, Docker, CI/CD, Observability (SLOs), PostgreSQL, Microservices Architecture, Security, and Autonomy/IaC/GitOps. * Passion for applying LLMs/AI in operations. Experience with Node.js/Python, NestJS/React, Git/Cursor, GCP (other clouds are a plus), PostgreSQL, Docker/Kubernetes, Terraform/Helm/ArgoCD. * * *Nice-to-haves:* Experience with AI SDKs/LLMs, Operational Automations (N8N/Crew.ai), Vector Databases (RAG/pgvector), Kafka/RabbitMQ, FinOps/Chaos Engineering/SAST/DAST. Benefits * An environment with genuine autonomy and intense collaboration; * Direct influence on product and team building; * Growth alongside the business from the ground up; * *Fixed salary of R$ 28.000,00 (monthly, PJ) + Real possibility of Stock Options;* * **Hybrid work model in Belo Horizonte.**


