-
Zhejiang U. -> Tsinghua U.
- Shenzhen
Highlights
- Pro
Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
Python sample codes and textbook for robotics algorithms.
Official inference repo for FLUX.1 models
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
verl: Volcano Engine Reinforcement Learning for LLMs
State-of-the-Art Text Embeddings
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
💮 amazing QRCode generator in Python (supporting animated gif) - Python amazing 二维码生成器(支持 gif 动态图片二维码)
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
A fluent design widgets library based on C++ Qt/PyQt/PySide. Make Qt Great Again.
A Collection of Variational Autoencoders (VAE) in PyTorch.
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet…
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI
CUDA accelerated rasterization of gaussian splatting
Python package for the evaluation of odometry and SLAM
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
Vector (and Scalar) Quantization, in Pytorch
MAGI-1: Autoregressive Video Generation at Scale
Learning in infinite dimension with neural operators.
【PyTorch】Easy-to-use,Modular and Extendible package of deep-learning based CTR models.
Efficient vision foundation models for high-resolution generation and perception.
The devkit of the nuScenes dataset.
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation