A transformation pipeline for Delta Lake using AWS SDK for Pandas
-
Updated
Jul 12, 2023 - Python
A transformation pipeline for Delta Lake using AWS SDK for Pandas
A Bioinformatics demo in Python working with FASTQ files and using the Modin library
Global Markets Options Pricing
Delve deeper into data manipulation using Python's prominent libraries. Explore the functionalities of Pandas and get a glimpse of alternatives like Polars, Dask, and Modin.
HHA507 / Data Science / Assignment 2 / Data Manipulation
Polars, Pandas, Dask, and Modin
AI Starter Kit to generate structured synthetic data using Intel® Distribution of Modin
Recommendation system approaches
Um pipeline de ETL modular para automação de mailing com deduplicação, enriquecimento e otimização de performance.
A Kedro plugin that provides pandas dropin replacements for the pandas datasets (e.g modin and cuDF)
Simple example on how Modin can peed up your Pandas workflows by changing a single line of code
Using the MovieLens dataset with Surprise to compare different algorithms for rating prediction, and also create a movie recommendation system on top of it.
oneAPI Hackathon: The LLM Challenge
Open Data Profiling, Quality and Analysis on NYC OpenData dataset with semantic profiling using fuzzy ratio, Levenshtein distance and regex
Add a description, image, and links to the modin topic page so that developers can more easily learn about it.
To associate your repository with the modin topic, visit your repo's landing page and select "manage topics."