-
Alibaba, Tongyi Lab
- Hangzhou, Zhejiang, China
- https://scholar.google.com/citations?user=x2NItzgAAAAJ
Stars
Wan: Open and Advanced Large-Scale Video Generative Models
[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing
Wan: Open and Advanced Large-Scale Video Generative Models
SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.
[CVPR 2024 Highlight] Official repo: SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing
Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos
Proof-of-concept implementation of a Partial-Video-Copy-Detector implemented in Python (and some C++)
Learning to align and match videos with kernelized temporal layers
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
[arXiv 2019] "Contrastive Multiview Coding", also contains implementations for MoCo and InstDis
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)
A best practice for tensorflow project template architecture.
bert nlp papers, applications and github resources, including the newst xlnet , BERT、XLNet 相关论文和 github 项目
📡 Simple and ready-to-use tutorials for TensorFlow
TensorFlow code and pre-trained models for BERT
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
😎 face releated algorithm, dataset and paper
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
100-Days-Of-ML-Code中文版
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation