




Job Summary: We are seeking a Senior Data Engineer to design, develop, and optimize data solutions in an innovative, data-driven environment, ensuring reliability and scalability. Key Highlights: 1. Positively impact billions of lives 2. A team passionate about innovation and data-driven culture 3. Focus on reliability, performance, and scalability **Positively impacting billions of lives is our purpose—and it can be yours too!** Founded in 2010 in Brazil, Semantix is a reference in Big Data, Analytics, and Artificial Intelligence. We are a team of innovation enthusiasts with diverse backgrounds and varying levels of experience. What unites us is our shared motivation to transform our customers’ experiences through a data-driven culture. If this resonates with you, Semantix is the place for you. We’re eager to welcome you to our team. After all, **the future is built together**. Requirements: We are looking for a Senior Data Engineer with solid experience in distributed environments, pipeline development, and API-based integrations, with strong collaboration with business areas. This person will be responsible for designing, developing, and optimizing data solutions that deliver real value to the company, with emphasis on reliability, performance, scalability, and engineering best practices. **Responsibilities and Duties:** * Implement data engineering solutions aligned with project requirements. * Extract, transform, and load (ETL/ELT) data between various sources and destinations. * Develop and maintain distributed database architectures oriented toward Big Data. * Build efficient pipelines integrated via API calls. * Work with layered architecture (Medallion – Bronze, Silver, Gold). * Gather requirements from business areas and translate them into technical solutions. * Develop data processing and transformation programs and scripts in Python and PySpark. * Ensure reliability, performance, and quality of deliverables. * Promote technical best practices and analytical culture within the team. **Requirements and Qualifications:** * Proficiency in PySpark, Python, Databricks, and Git. * Solid experience in data modeling. * Experience in ETL/ELT and data pipeline construction. * Knowledge of Medallion architecture (Bronze, Silver, Gold). * Experience working in Azure environments (Data Lake, Storage, ADF). * Experience with Spark SQL and SQL for query optimization. * Experience with data integration via APIs. * Strong communication skills and ability to collaborate effectively with business areas. **Nice-to-Have:** * Knowledge of MongoDB. * Adoption of software development best practices (Clean Code, modularization, automated testing, CI/CD). * Experience with hybrid environments or cloud migration. Benefits Competitive salary; Caju (flexible card) with R$ 1\.060/month loaded; Bradesco Health Plan; Bradesco Dental Plan; * ️ Preventive medicine with Dr. Alper; Life Insurance; * ️ Gympass; ️ SESC; Daycare allowance for mothers and fathers; Profit Sharing (PLR); Learning – an area focused on developing hard and soft skills; Partnerships with educational institutions for technical training, MBAs, postgraduate studies, certifications, English, and Spanish; Career Development Plan; Discounts on products through a partner portal. We emphasize that all our job openings are open to individuals of all profiles and backgrounds, valuing diversity and fostering an inclusive and welcoming environment for everyone.


