Self paced project
-
Updated
Mar 1, 2025
Self paced project
A repo of reusable functions for cleansing data
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
Wrangler Transform: A DMD system for transforming Big Data
Java DSL for (online) deduplication
Ashley Bythell - Python
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
The JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
The website is now described as an educational resource for data management, with the objective of educating, engaging, guiding, and providing resources.
A domain-specific probabilistic programming language for scalable Bayesian data cleaning
Data Cleaning, Exploration, and Insights
Two Mixed Integer Programs for cleaning a data file.
This project explores the relationships in between different vaccines and the sex, age and other basic features in the data.
Analysis of songs from the period 18 October 2024 to 1 May 2024 from Spotify data.
Google Data Analytics Professional Certificate program instructs on how to clean and organize data for analysis, and complete analysis and calculations using spreadsheets, SQL, Tableau and R programming.
Python package to make URL extraction, generalization, validation, and filtration easy.
Data Science Foundations I | Exploratory Data Analysis in Python | Inspect, Clean, and Validate a Dataset | EDA: Inspect, Clean, and Validate a Dataset
Main Repository
Used SQL, Power BI to make insightful dashboard
This repository houses a curated collection of projects designed to highlight my expertise in data analytics.
Add a description, image, and links to the data-cleansing topic page so that developers can more easily learn about it.
To associate your repository with the data-cleansing topic, visit your repo's landing page and select "manage topics."