-
Zhejiang University
- Hangzhou, China
-
11:13
(UTC +08:00)
Stars
Wan: Open and Advanced Large-Scale Video Generative Models
Native and Compact Structured Latents for 3D Generation
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
Lists of company wise questions available on leetcode premium. Every csv file in the companies directory corresponds to a list of questions on leetcode for a specific company based on the leetcode …
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform
The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free
Official inference repo for FLUX.2 models
The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…
[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
A unified framework for feed-forward neural networks
微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"
[CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space
Enjoy the magic of Diffusion models!
Fast and Universal 3D reconstruction model for versatile tasks
[arxiv 2025] Official implementation of "Humanoid Goalkeeper: Learning from Position Conditioned Task-Motion Constraints"
[NIPS 2025] MV-CoLight: Efficient Object Compositing with Consistent Lighting and Shadow Generation