Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Open-Sora: Democratizing Efficient Video Production for All
Fully open reproduction of DeepSeek-R1
Image-to-Image Translation in PyTorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
PyTorch implementations of Generative Adversarial Networks.
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
🐍 Geometric Computer Vision Library for Spatial AI
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
Fast and Accurate ML in 3 Lines of Code
An elegant PyTorch deep reinforcement learning library.
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)
Matplotlib styles for scientific plotting
A faster pytorch implementation of faster r-cnn
Chinese version of GPT2 training code, using BERT tokenizer.
💎1MB lightweight face detection model (1MB轻量级人脸检测模型)
Minimal PyTorch implementation of YOLOv3
跨平台 Python 异步聊天机器人框架 / Asynchronous multi-platform chatbot framework written in Python
A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation