Skip to content

juniorcl/juniorcl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

76 Commits
 
 
 
 

Repository files navigation

👋 Hi, I'm Clébio Júnior

Data Scientist with over 4 years of experience applying machine learning techniques and data analysis to solve real-world business problems with measurable impact.

Throughout my career, I worked on projects involving the extraction and processing of structured and unstructured data, predictive modeling for credit scoring and sales forecasting, anomaly detection, and customer behavior analysis — always with a focus on turning data into actionable insights.

I have hands-on experience with libraries such as scikit-learn, spaCy, pdfplumber, pytesseract, and pandas, as well as techniques like NLP, clustering, and supervised learning.

Get my resume

Currículo em Português Resume in English

Connect with me

Linkedin Badge Medium Badge Kaggle Badge Gmail Badge DEV Badge  

👨‍💻 About me

  • 🎓 Master's degree in Natural Sciences (UENF) and Bachelor's degree in Physics (IFF)
  • 📊 Experienced in predictive modeling, clustering, and NLP
  • 🛠️ Skilled with Python, scikit-learn, spaCy, pandas, pdfplumber, pytesseract
  • 🖥️ Hands-on experience with unstructured data extraction and interactive dashboards in SAS
  • ✍️ I share technical content on Medium

💼 Professional Experience

Data Scientist | Vert Analytics (Oct/2024 – Present, Remote)

  • Developed solutions for unstructured data extraction (PDFs and images) using pdfplumber, Tesseract OCR, regex.
  • Built interactive SAS dashboards for time series analysis and anomaly detection.
  • Applied NLP (spaCy) to analyze social media comments, identifying customer concerns and dissatisfaction patterns.

Data Scientist | Datarisk (Jan/2022 – Aug/2024, Remote)

  • Built credit scoring models, sales forecasting, and customer segmentation using machine learning and clustering.
  • Developed predictive models for customer behavior (default risk, plan upgrade likelihood, job instability).
  • Delivered insights supporting strategic decision-making and risk reduction.

Data Scientist | Be.X! (Mar/2021 – Jan/2022, Remote)

  • Processed structured and unstructured data using regex and data cleaning techniques.
  • Implemented outlier detection algorithms based on business rules for risk mitigation.
  • Created ML models for delivery delay prediction, improving logistics operations.

📊 GitHub Stats

GitHub Stats
Top Langs

About

Profile's readme

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published