- Taipei, Taiwan
- https://www.linkedin.com/in/kent-hsu/
Stars
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
scikit-learn: machine learning in Python
Streamlit — A faster way to build and share data apps.
Federated Query Engine for AI - The only MCP Server you'll ever need
Distributed Task Queue (development branch)
《Designing Data-Intensive Application》DDIA 第一版 / 第二版 中文翻译
Faker is a Python package that generates fake data for you.
SQL databases in Python, designed for simplicity, compatibility, and robustness.
Universal Command Line Interface for Amazon Web Services
An orchestration platform for the development, production, and observation of data assets.
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Statsmodels: statistical modeling and econometrics in Python
Always know what to expect from your data.
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Automated Machine Learning with scikit-learn
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphic…
An open source python library for automated feature engineering
Qiskit is an open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.
Example projects using the AWS CDK
A generic JSON document store with sharing and synchronisation capabilities.
A developer toolkit to implement Serverless best practices and increase developer velocity.
Algorithms for explaining machine learning models