Skip to content
#

data-quality

Here are 11 public repositories matching this topic...

Python package for exploratory data analysis providing statistical summaries, data quality checks, outlier detection and batch visualization functions. Supports Jupyter notebooks and terminal environments.

  • Updated Oct 10, 2025
  • Python

This GitHub repository hosts the notebooks and tools developed as part of this thesis to automate the extraction, processing, and analysis of data from the MICCAI 2023 conference, aiding in the systematic review and providing a structured foundation for further research in this crucial area.

  • Updated May 15, 2024
  • Jupyter Notebook
toulouse-biblio-chronicle

Snapshot of Toulouse public library customer habits (Médiathèque José Cabanis). Cleaning messy datasets of musical, cinematic, and literary checkouts; includes data-cleaning steps, analysis notebook revealing cultural tastes in the Pink City.

  • Updated Oct 23, 2025
  • Jupyter Notebook

A complete, end-to-end modernisation of a legacy greenhouse labour tracking system. This project includes reproducible data cleaning pipelines, exploratory data analysis, feature engineering, machine learning modelling, and reporting—implemented using Python, Jupyter notebooks, and a modular src/ package structure.

  • Updated Nov 23, 2025
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the data-quality topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-quality topic, visit your repo's landing page and select "manage topics."

Learn more