Stars
Apache Superset is a Data Visualization and Data Exploration Platform
βοΈ Companies that don't have a broken hiring process
Python Data Science Handbook: full text in Jupyter Notebooks
TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)
β‘ A Fast, Extensible Progress Bar for Python and CLI
βοΈ DEPRECATED β See https://github.com/ageron/handson-ml3 or handson-mlp instead.
Data and code behind the articles and graphics at FiveThirtyEight
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
π Parameterize, execute, and analyze notebooks
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
An API Client package to access the APIs for NBA.com
A book covering the fundamentals of data visualization
πβ½ A collection of football analytics projects, data, and analysis by Edd Webster (@eddwebster), including a curated list of publicly available resources published by the football analytics community.
A collection of research papers on decision, classification and regression trees with implementations.
π©βπ« Advanced NLP with spaCy: A free online course
Enhancing {ggplot2} plots with statistical analysis ππ£
Easily generate information-rich, publication-quality tables from R
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
Official Microsoft repository for SQL Server in Docker resources
Code for Tiny Python Projects (Manning, 2020, ISBN 1617297518). Learning Python through test-driven development of games and puzzles.
A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.
Comprehensive list of color palettes available in R β€οΈπ§‘ππππ
Pull current and historical baseball statistics using Python (Statcast, Baseball Reference, FanGraphs)