




**This position is designated for people with disabilities** We are seeking a **Senior Data Scientist** to develop and evolve **applied AI solutions**, with a focus on **AI agents** (e.g., RAG, tool-use, memory, evaluation, and observability). You will work with data from multiple sources and collaborate with engineering/product teams to transform business needs into measurable, reliable, and scalable systems—with attention to **quality, cost, and performance**. **Responsibilities and duties** * Participate in defining and validating data- and experiment-driven business hypotheses; * Design and execute advanced analyses, metrics, and evaluation strategies for agents (e.g., golden set, regression testing, failure analysis); * Build and maintain data pipelines and training/testing datasets, focusing on reproducibility and quality; Develop and fine-tune ML models when necessary (baseline* production), and integrate them with generative AI solutions; * Support the design and improvement of RAG: chunking strategy, metadata, deduplication, ranking signals, knowledge base updates; * Implement and monitor observability: structured logs, quality metrics, dashboards, and alerts; * Collaborate with engineering and product teams on architecture definition, requirements, trade-offs, and roadmap; * Review PRs, share knowledge, and mentor less experienced team members. **Requirements and qualifications** * Advanced proficiency in Python and SQL (CTEs, window functions, basic optimization); * Solid foundation in applied statistics and evaluation (metrics, biases, tests, interpretation); * Practical experience in Machine Learning (training and validation, feature selection, metrics, deployment, and integration); * Experience with engineering best practices: versioning, testing, documentation, and observability; * Ability to structure problems and make decisions involving trade-offs (quality vs. latency vs. cost); * Experience working in cloud environments (AWS/GCP/Azure) and/or production pipelines. **Preferred qualifications** * Experience with **generative AI**, RAG, and agents (prompting, tool-use, memory, guardrails); * Experience with embeddings and vector search mechanisms (FAISS, pgvector, Milvus, Pinecone); * MLOps: experiment tracking (MLflow/W&B), CI/CD for models, drift monitoring; * Construction of evaluation datasets and labeling (process, guidelines, agreement); * Experience with A/B testing and product experimentation; * Experience with data warehouse/lake and orchestration (Airflow, Prefect, Dagster); **Languages** * Advanced English (reading documentation and research papers). **Academic background** * Bachelor’s degree in Computer Science, Engineering, Statistics, Mathematics, or related fields. **Soft skills** **Communication and collaboration** * Strong communication skills to clearly explain technical analyses and results to non-technical audiences. * Ability to collaborate effectively within multidisciplinary teams (data, product, engineering). **Learning and growth** * Intellectual curiosity and continuous learning—especially in generative AI and agents. * Openness to feedback and ongoing technical development. **Organization and accountability** * Attention to detail and rigor regarding data and documentation quality. * Ability to follow processes and best practices (versioning, code review, documentation). **Professional attitude** * Proactivity in identifying improvements to data, pipelines, and evaluations [whenever possible]. * Resilience and patience to handle experimentation, errors, and iteration. * Ethical stance in the use of data and AI models. * Ability to lead small initiatives and onboard new team members. **Additional information** Hybrid work opportunity in **Recife-PE**. **What we expect from all \#SiDiers:** Embrace the new Maintain open and direct dialogue Continuously develop Always do your best Build in partnership Think big, move fast **What we offer?** * **Work-life balance:** 40-hour weekly schedule under CLT regime, flexible hours with hybrid work (4 days in-office, 1 day remote); * **Well-being:** Gympass (WellHub, Workplace Gymnastics, Quick-massage, Psychological Support); * **Health:** Medical and dental insurance for you and your family; * **Children:** Daycare allowance for miniSiDiers, 120-day maternity leave, extended paternity leave; * **Future:** SiDi co-invests with you in private pension; * **Education:** Study incentive program for continued education and specialization; language fluency support; weekly lecture cycles on global *trend topics*; * **Meals:** Flexible meal and food allowances; * **Transportation:** Transportation subsidy/reimbursement to SiDi and parking for those working in the office; * **Recognition:** Annual performance bonus and awards for SiDiers who achieve outstanding results; * **Diversity:** Committees on Well-being, Diversity, Mental Health, Social Impact, Sustainability, and Women in Tech; * **Modern offices:** Relaxed, collaborative environments with social areas, decompression rooms, pantries, and coffee machines; * **Want more?** Dozens of partnerships offering benefits and discounts! For us, inclusion isn’t just about adding one more person to our team—it’s about embracing differences and unique perspectives. It is precisely this blend of techniques, skills, and stories that inspires us to align diverse viewpoints toward a shared horizon—always moving forward. We operate in an environment of equal opportunity, regardless of gender, sexual orientation, religion, or disability. **All our positions are open to people with disabilities (PCDs).** JOB CODE: 5647 - N2 **We are one of Brazil’s largest science and technology institutes—and with over 800 growing SiDiers, we’re already present in Campinas, Manaus, and Recife: Brazil’s largest technology and innovation parks.** Because those who aspire to build projects that transform the world cannot stop transforming themselves. In **20 years of history**, we’ve specialized in solving complex challenges—and carry in our portfolio **over 1,100 projects**, impacting millions of lives, driving innovation and making the future happen—now.


