- Taiwan
Stars
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Command-line program to download videos from YouTube.com and other video sites
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Streamlit — A faster way to build and share data apps.
A collection of design patterns/idioms in Python
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
OpenMMLab Detection Toolbox and Benchmark
A book-in-progress about the Linux kernel and its insides.
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
Graph Neural Network Library for PyTorch
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
《Designing Data-Intensive Application》DDIA 第一版 / 第二版 中文翻译
Machine Learning Engineering Open Book
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Pyodide is a Python distribution for the browser and Node.js based on WebAssembly
Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
🐍 Geometric Computer Vision Library for Spatial AI