Skip to content
View emilyriederer's full-sized avatar

Block or report emilyriederer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Building R packages

R 934 643 Updated Dec 14, 2025

Polars extension for general data science use cases

Rust 584 41 Updated Dec 13, 2025

A reactive notebook for Python โ€” run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.

Python 17,770 831 Updated Dec 13, 2025

A curated list of Polars talks, tools, examples & articles. Contributions welcome !

1,031 44 Updated Dec 12, 2025

Repo to organize tasks for R Dev Days

R 23 8 Updated Dec 11, 2025

Words of the same length with related meanings.

Python 348 23 Updated Dec 11, 2025

Mesa is an open-source Python library for agent-based modeling, ideal for simulating complex systems and exploring emergent behaviors.

Python 3,290 1,075 Updated Dec 11, 2025

๐Ÿฆ† A curated list of awesome DuckDB resources

2,180 160 Updated Dec 11, 2025

R formatter and language server

Rust 373 27 Updated Dec 10, 2025

Scikit-learn compatible decision trees beyond those offered in scikit-learn

Jupyter Notebook 85 24 Updated Dec 8, 2025

Accelerated Oblique Random Survival Forests

R 59 10 Updated Dec 8, 2025

pre-commit hooks for R projects

R 273 50 Updated Dec 8, 2025

A non-validating SQL parser module for Python

Python 3,969 717 Updated Dec 8, 2025

analyze survey data for free

R 624 449 Updated Dec 7, 2025

Learning the practice of Monte Carlo simulations in data science (Research Module in Econometrics and Statistics; Master's/PhD)

TeX 18 Updated Dec 4, 2025

Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here ๐Ÿ‘‡๐Ÿผ

Jupyter Notebook 33,849 7,169 Updated Dec 3, 2025

A list of publicly available datasets with real-time data maintained by the team at bytewax.io

1,879 147 Updated Dec 3, 2025

Turn SciKitLearn pipelines into SQL

Python 106 2 Updated Dec 2, 2025

Brushing and linking for big data

Jupyter Notebook 968 54 Updated Dec 2, 2025

Free MLOps course from DataTalks.Club

Jupyter Notebook 13,834 2,783 Updated Dec 1, 2025

Website sources for Applied Machine Learning for Tabular Data

HTML 153 15 Updated Dec 1, 2025

Run your GitHub Actions locally ๐Ÿš€

Go 67,564 1,802 Updated Dec 1, 2025

Data quality assessment and metadata reporting for data frames and database tables

R 1,016 59 Updated Nov 28, 2025

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 38,974 7,477 Updated Nov 28, 2025

A lightweight version of R Markdown (without using Pandoc or knitr)

R 234 14 Updated Nov 28, 2025

Dynamic Documents for R

R 3,002 996 Updated Nov 26, 2025

An introduction to network analysis and applied graph theory using Python and NetworkX

Jupyter Notebook 1,091 403 Updated Nov 24, 2025

An R package for working with NCAA Basketball Play-by-Play Data

R 219 54 Updated Nov 18, 2025

Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!

Shell 238 185 Updated Nov 7, 2025
Next