-
DLRG @ SUSTech | @AncoraSpring
- Shenzhen, China
- hanxudong.cc
- in/xudong-han
Highlights
- Pro
Lists (6)
Sort Name ascending (A-Z)
Starred repositories
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
real time face swap and one-click video deepfake with only a single image
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
We write your reusable computer vision tools. 💜
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero
A generative world for general-purpose robotics & embodied AI learning.
Documentation that simply works
Graph Neural Network Library for PyTorch
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
The official Python SDK for Model Context Protocol servers and clients
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Janus-Series: Unified Multimodal Understanding and Generation Models
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"
Hackable and optimized Transformers building blocks, supporting a composable construction.
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
An Autonomous LLM Agent for Complex Task Solving