-
Alibaba
- Hangzhou
- https://daizuozhuo.github.io
Lists (1)
Sort Name ascending (A-Z)
Stars
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Robust Speech Recognition via Large-Scale Weak Supervision
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
The simplest, fastest repository for training/finetuning medium-sized GPTs.
OpenMMLab Detection Toolbox and Benchmark
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Static site generator that supports Markdown and reST syntax. Powered by Python.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
A PyTorch implementation of EfficientNet
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
[CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Using the jedi autocompletion library for VIM.
Torchreid: Deep learning person re-identification in PyTorch.
Aligning pretrained language models with instruction data generated by themselves.
Pre-trained Deep Learning models and demos (high quality and extremely fast)
🎥 Python and OpenCV-based scene cut/transition detection program & library.
SOTA Re-identification Methods and Toolbox
Python library for loading and using triangular meshes.
Pytorch framework for doing deep learning on point clouds.
Unofficial implemention of lanenet model for real time lane detection
VideoSys: An easy and efficient system for video generation
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"