Senior Data Scientist - NLP Opportunity at CI&T
Join a global leader in digital transformation! At CI&T, we partner with the world’s most esteemed brands to create innovative digital solutions that revolutionize businesses. With a 29-year legacy of driving business impact, our team of over 6,000 professionals worldwide specializes in strategy, research, data science, design, and engineering to foster growth, enhance customer experience, and optimize operational efficiency.
General Description
CI&T is seeking experienced Data Scientists with expertise in Natural Language Processing (NLP) to lead AI initiatives in the American health industry. As a key player, you will propel business impact utilizing cutting-edge AI solutions.
Responsibilities
- Conduct data exploration to validate data requirements and quality for NLP contexts.
- Execute NLP preprocessing including Tokenization, Lexical Analysis, Syntactic Analysis, Semantic Analysis, and Pragmatic Analysis.
- Define and align the best NLP models with the business's expected outcomes.
- Train and validate models using metrics such as accuracy, precision, recall, F1-score, and ROUGE score.
- Document model development processes, methodologies, and results for all stakeholders.
- Implement text classification and sentiment analysis using traditional machine learning classifiers and deep learning models.
- Enhance NLP model performance through rigorous experimentation and analysis.
- Employ topic modeling techniques like LDA and NMF to discover abstract topics from text data.
- Understand and apply sequence-to-sequence models for machine translation, text summarization, and question answering tasks.
Requirements
- Proficient oral and written communication skills in English.
- Experience with international projects and acting as a Data Scientist in NLP projects.
- Expertise in Python, focusing on packages such as NLTK, spaCy, and Gensim.
- Experience with techniques like Topic Extraction, Summarization, Categorization, and Sentiment Analysis.
- Strong problem-solving skills and creativity in applying NLP techniques to real-world challenges.
- Awareness of ethical considerations in NLP, including bias, privacy, and societal impacts.
- Proficiency across the data science pipeline, from data gathering to deployment.
- Expertise in handling, analyzing, and visualizing large datasets using tools like SQL and Python.
Preferred Skills
- Experience with Data Augmentation.
- Familiarity with Transformers, BERT, and Named Entity Recognition (NER).
- Background in data engineering.
- Experience with MLOps and Azure services.
- Proficiency with Databricks.
- Knowledge of data protection regulations such as PII, CCPA, and HIPAA.
Our Benefits
- Health and dental plans.
- Meal allowances.
- Childcare assistance.
- Extended parental leave.
- Gympass.
- Annual profit-sharing.
- Life insurance.
- Access to an online mental health platform.
- CI&T University.
- Discount club.
- Support programs: legal, financial, physiotherapy, psychological guidance, nutritionist, and more.
- Pregnancy course and responsible parenthood.
- Partnerships with online course platforms.
- Language learning platform.
- And many others.
#LI-JP3 #Midsenior
About CI&T
CI&T is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We believe that innovation thrives in diverse, inclusive work environments where people from various backgrounds collaborate and share their perspectives.
Before applying, please review our .
We strongly encourage candidates from diverse and underrepresented communities to apply.
Company: CI&