




Job Summary: As a Senior Software Engineer at Mercado Livre, you will design and scale innovative and secure systems to democratize e-commerce and financial services across Latin America. Key Highlights: 1. Design and scale innovative and secure systems 2. Work with cutting-edge technology and proprietary AI models 3. Democratize e-commerce and financial services in Latin America As a **Senior Software Engineer** at Mercado Livre, you will design and scale innovative and secure systems that solve real-world, high-impact problems. You will work in a dynamic environment leveraging cutting-edge technology, engineering best practices, proprietary AI models, and continuous learning—all aimed at democratizing e-commerce and financial services in Latin America. Imagine yourself leading challenging, dynamic, and innovative projects and **being responsible for:** * Designing and evolving the end-to-end infrastructure scaling platform for critical events (campaigns, migrations, load testing), prioritizing reliability, traceability, and operational experience. * Leading technical proposals for pre-boost/de-boost orchestration (windows, policies, validations), ensuring robustness and performance. * Implementing capabilities for automatic discovery of impacted flows and applications during events, reducing operational and human dependency on application owners. * Developing a mechanism to predict traffic and impact of events (peaks, ramp-up, critical services) using historical data and business signals, and translating predictions into scaling recommendations/actions. * Defining and automating guardrails during event windows (detection/mitigation of unexpected changes such as deployments or degradations), protecting core business metrics and optimizing scaling efficiency and costs (minima, quotas, strategies by criticality Low/Medium/High). * Coordinating with IT, SRE, and Business teams to map flows and dependencies, ensure pre-event readiness for traffic surges, and lead postmortems with actionable improvements. **What we’re looking for in you?** * Experience developing platforms and microservices (Golang/Java/Python), with focus on distributed systems, high availability, and idempotency. * Practical knowledge of event-driven architectures and messaging/streaming systems—ideally Kafka—to coordinate flows across multiple applications and ensure consistency under retries/failures. * Proficiency in end-to-end observability workflows using Datadog/Grafana/OpenTelemetry: defining SLIs/SLOs, dashboards and actionable alerts, tracing/auditing operations, conducting postmortem analyses, as well as managing incidents and making decisions in war rooms. * Experience managing cloud infrastructure and IaC/GitOps (provisioning, quotas, minima, scaling policies, cost optimization) to support traffic spikes by criticality Low/Medium/High is a plus. * Experience building and integrating data/ML models for traffic and impact prediction and for automatic discovery of affected flows (leveraging business signals + tracing/dependencies), translating them into automated scaling recommendations/actions is a plus. **Are you excited to leave your mark on Latin America’s technology landscape?** Apply now and join our purpose! Hybrid work model. Location: Osasco, São Paulo.


