




Job Summary: We are seeking a Senior Data Scientist to develop and evolve applied AI solutions, with a focus on AI agents, working with data from multiple sources and collaborating with engineering/product teams. Key Highlights: 1. Work on the development and evolution of applied AI solutions 2. Focus on AI agents (RAG, tool-use, memory, evaluation, and observability) 3. Work with data from multiple sources If you are curious, proactive, enjoy solving complex problems, and believe in the power of innovation, join us to build the future and become part of the **\#SiDiers** team. We are looking for a **Senior Data Scientist** to work on developing and evolving applied AI solutions, with a focus on AI agents (e.g., RAG, tool\-use, memory, evaluation, and observability). You will work with data from multiple sources and collaborate with engineering/product teams to transform business needs into measurable, reliable, and scalable systems—paying close attention to quality, cost, and performance. **Responsibilities and Duties** * Participate in defining and validating data- and experiment-driven business hypotheses; * Design and execute advanced analyses, metrics, and evaluation strategies for agents (e.g., golden sets, regression testing, failure analysis); * Build and maintain data pipelines and training/test datasets, focusing on reproducibility and quality; Develop and fine-tune ML models when necessary (*baseline* production), and integrate them with generative AI solutions; * Support the design and improvement of RAG: chunking strategy, metadata, deduplication, ranking signals, knowledge base updates; * Implement and monitor observability: structured logs, quality metrics, dashboards, and alerts; * Collaborate with engineering and product teams to define architecture, requirements, trade\-offs, and roadmap; * Review PRs, share knowledge, and mentor less experienced colleagues. **Requirements and Qualifications** **What We Expect From You** * Experience in Natural Language Processing (NLP): tokenization, stemming/lemmatization, stopwords, n\-grams, and text representations such as TF\-IDF, Bag\-of\-Words, and dense embeddings. * Experience in Information Retrieval (IR): inverted indexes, classical ranking models (BM25, TF\-IDF), and search system evaluation using metrics like Precision, Recall, F1, MRR, and NDCG. * Experience with semantic search; * Practical experience with Python for prototyping and experimentation (Jupyter Notebooks or similar). * Experience with modern NLP libraries such as Hugging Face Transformers, sentence\-transformers, spaCy, NLTK, or equivalents. * Hands-on experience training, fine\-tuning, or adapting models for tasks including semantic similarity, text classification, and information retrieval/search. * Knowledge of the Machine Learning model lifecycle: dataset creation and preparation (collection, cleaning, curation); comparative evaluation across model versions; and experiment versioning and reproducibility. * Ability to collaborate closely with engineering teams to define inference APIs and translate product requirements into model-based solutions. **Languages** * Advanced English (conversation, reading, and writing). **Academic Background** * Bachelor’s degree in Computer Science, Engineering, Statistics, Mathematics, or related fields. **Preferred Qualifications** * Experience or knowledge of on\-device model deployment (Mobile / Edge AI), including model conversion to TensorFlow Lite, ONNX Runtime Mobile, or similar; * Experience or knowledge of compression techniques (quantization, pruning, distillation) and latency/memory optimization for mobile device execution. * Experience integrating models with Android applications * Experience with hybrid search architectures, such as integrating Lucene/BM25 with semantic embeddings and strategies including vector pre\-indexing, embedding caching, and re\-ranking. * Knowledge of indexing and text search tools. * Experience monitoring production search system quality. * Familiarity with MLOps practices applied to embedded or constrained environments. **Additional Information** Hybrid work opportunity in **Manaus\-AM**. **What We Expect From All \#SiDiers:** Embrace new ideas Maintain open and direct communication Continuously develop yourself Always strive for excellence Build in partnership Think big, act fast **What We Offer?** * **Work-Life Balance:** 40-hour weekly schedule under CLT employment, flexible hours with hybrid work (4 days in-office, 1 day remote); * **Wellness:** Gympass (WellHub, Workplace Gym, Quick\-massage, and Psychological Support); * **Health:** Medical and dental insurance for you and your family; * **Children:** Daycare allowance for miniSiDiers, 120-day maternity leave, extended paternity leave; * **Future:** SiDi co-invests with you in private pension plans; * **Education:** Program supporting continued studies and specialization, language fluency incentives, and weekly talks on global *trend topics*; * **Meals:** Flexible meal and food allowances; * **Transportation:** Transportation subsidy/cost assistance for commuting to SiDi and parking for those working at the office; * **Recognition:** Annual performance bonus and awards for SiDiers who achieve outstanding results; * **Diversity:** Committees focused on Wellbeing, Diversity, Mental Health, Social Impact, Sustainability, and Women in Tech; * **Modern Offices:** Relaxed, collaborative environments with social spaces, decompression rooms, kitchens, and coffee machines; * **Want More?** Dozens of partnerships offering benefits and discounts! For us, inclusion is not just about adding another person to our team—it’s about embracing differences and unique perspectives. It is precisely this mix of techniques, skills, and stories that inspires us to align diverse viewpoints toward a shared horizon—always moving forward. We foster an environment of equal opportunity, regardless of gender, sexual orientation, religion, or disability. **All our positions are open to people with disabilities (PCDs).** JOB CODE: 5879 \- N3 **We are one of Brazil’s largest science and technology institutes, and with a growing team of over 800 SiDiers, we are already present in Campinas, Manaus, and Recife—the largest technology and innovation parks in Brazil.** Because those who aspire to create projects that will transform the world cannot stop transforming themselves. In **20 years of history**, we have specialized in solving problems and **delivered over 1,100 projects**, impacting millions of lives—driving innovation and making the future happen now.


