-
The Hong Kong University of Science and Technology
-
02:23
(UTC +08:00)
Lists (14)
Sort Name ascending (A-Z)
3⃣️ 3D/4D Vision
🚀Adversarial Attack
Adversarial attack resources🧗 Embodied AI
A list for Embodied AI.🌟Federated Learning
This a repository list for federated learning algorithms.👀General deep learning
A general deep learning list includes GAN, knowledge distillation, computer vision, NLP, etc.🧐🧐🧐General research and writing
This is a list of general research methods, writing skills, and information helpers!🤩Interesting computer works
A repository for some interesting computer works, such as obtaining information from websites, API usage(ChatGPT, etc.), and secrete computer technique.job job job
💥💥💥LLMs
🔥🔥🔥Multi modal and diffusion
A repository for Multi-modal and diffusion model🌛Privacy attack and defense
Learning resources for privacy attack and defense, such as MIA and gradient inversion .etc.🤔Reinforcement learning
This is a list of reinforcement learning resources.🧠Thinking and working
This is a list about some findings in computer science, math, reading, work, .etc.🛸🛸🛸 World Model
Stars
A feed-forward 3D foundation model for reconstructing scenes from streaming data
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds
🔥 A continuously updated collection of papers, datasets, and benchmarks on post-training and alignment for video generation.
Official repo for "TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders"
Information collection for the Happy Horse AI video generator model. Official demo and updates at happyhorses.io.
"AI-Trader: 100% Fully-Automated Agent-Native Trading"
Scan the Hallucination Citation of Academic papers. Convert second-hand citation to official version
MegaFlow: Zero-Shot Large Displacement Optical Flow
Unified Codebase for Advanced World Models.
Our method reconstructs 3D worlds from video diffusion models using non-rigid alignment to resolve inherent 3D inconsistencies in the generated sequences.
Official Implementation of Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training
Helios: Real Real-Time Long Video Generation Model
AI agents running research on single-GPU nanochat training automatically
Real-Time Physical Action-Conditioned Video Generation
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
High-performance safetensors model loader
Official Codebase for "DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos"
TransNet V2: Shot Boundary Detection Neural Network
A Curated List of Awesome Video World Models with AR Diffusion: Covering Algorithms, Applications, and Infrastructure, Aimed at Serving as a Comprehensive Resource for Researchers, Practitioners, a…
PISCO: Precise Video Instance Insertion with Sparse Control
🎥 Python and OpenCV-based scene cut/transition detection program & library.
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Consistent Autoregressive Video Generation with Long Context
Official Implementation of iMF https://arxiv.org/abs/2512.02012