Skip to content
View KentHsu's full-sized avatar

Block or report KentHsu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
44 stars written in Python
Clear filter

A collective list of free APIs

Python 398,202 42,601 Updated Nov 4, 2025

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 335,236 54,398 Updated Nov 3, 2025

scikit-learn: machine learning in Python

Python 65,022 26,688 Updated Feb 12, 2026

Streamlit — A faster way to build and share data apps.

Python 43,477 4,089 Updated Feb 13, 2026

The uncompromising Python code formatter

Python 41,384 2,723 Updated Feb 13, 2026

Federated Query Engine for AI - The only MCP Server you'll ever need

Python 38,450 6,102 Updated Feb 12, 2026

Distributed Task Queue (development branch)

Python 28,011 4,947 Updated Feb 11, 2026

《Designing Data-Intensive Application》DDIA 第一版 / 第二版 中文翻译

Python 22,611 4,492 Updated Feb 11, 2026

Faker is a Python package that generates fake data for you.

Python 19,080 2,045 Updated Feb 6, 2026

SQL databases in Python, designed for simplicity, compatibility, and robustness.

Python 17,630 801 Updated Feb 11, 2026

Universal Command Line Interface for Amazon Web Services

Python 16,730 4,462 Updated Feb 12, 2026

🦉 Data Versioning and ML Experiments

Python 15,360 1,279 Updated Feb 11, 2026

The Data Engineering Cookbook

Python 14,948 2,690 Updated Jan 17, 2026

An orchestration platform for the development, production, and observation of data assets.

Python 14,943 1,975 Updated Feb 13, 2026

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

Python 12,238 2,269 Updated Feb 13, 2026

Statsmodels: statistical modeling and econometrics in Python

Python 11,238 3,321 Updated Jan 13, 2026

Always know what to expect from your data.

Python 11,142 1,676 Updated Feb 13, 2026

Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…

Python 10,761 993 Updated Feb 12, 2026

Simple job queues for Python

Python 10,578 1,457 Updated Jan 31, 2026

A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.

Python 9,670 2,623 Updated Jan 21, 2026

Automated Machine Learning with scikit-learn

Python 8,053 1,318 Updated Jan 20, 2026

DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphic…

Python 7,955 1,004 Updated Feb 13, 2026

An open source python library for automated feature engineering

Python 7,610 909 Updated Feb 3, 2026

Qiskit is an open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.

Python 7,033 2,760 Updated Feb 13, 2026

Python Stream Processing

Python 6,837 536 Updated Jul 27, 2024

The Open Source Feature Store for AI/ML

Python 6,705 1,218 Updated Feb 13, 2026

Example projects using the AWS CDK

Python 5,555 2,445 Updated Feb 2, 2026

A generic JSON document store with sharing and synchronisation capabilities.

Python 4,419 425 Updated Feb 11, 2026

A developer toolkit to implement Serverless best practices and increase developer velocity.

Python 3,226 467 Updated Feb 11, 2026

Algorithms for explaining machine learning models

Python 2,608 265 Updated Oct 17, 2025
Next