Highlights
- Pro
Stars
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Chronos: Pretrained Models for Time Series Forecasting
Seamlessly integrate LLMs into scikit-learn.
2025! X / Twitter API scrapper with authorization support. Allows you to scrape search results, User's profiles (followers/following), Tweets (favoriters/retweeters) and more.
A TensorFlow recommendation algorithm and framework in Python.
Monte Carlo simulation of the NBA season, leveraging dbt, duckdb and evidence.dev
A tool for customers to evaluate their AWS service configurations based on AWS and community best practices and receive recommendations on potential improvements.
🐍 The Python API to consume openrouteservice(s) painlessly!
Scripts for processing the Amazon Reviews 2023 dataset; implementations and checkpoints of BLaIR: "Bridging Language and Items for Retrieval and Recommendation".
A book on DevOps for Data Scientists with CRC Press.
An Evaluation of ChatGPT on Information Extraction task, including Named Entity Recognition (NER), Relation Extraction (RE), Event Extraction (EE) and Aspect-based Sentiment Analysis (ABSA).
A small project of scrapping data from twitter
Streamlit Application for ABC Analysis & Product Segmentation
Place 13 solution (Private LB) for the inventory demand forecasting challenge on Kaggle https://www.kaggle.com/c/grupo-bimbo-inventory-demand