-
The Hong Kong University of Science and Technology
- Hong Kong SAR
-
01:04
(UTC +08:00) - https://harahan.github.io/
- https://orcid.org/0009-0002-7898-8402
Highlights
- Pro
Lists (3)
Sort Name ascending (A-Z)
Stars
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks
A unified framework for 3D content generation.
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free
[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models
[CVPR 2024 Highlight & TPAMI 2025] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models".
ICML2025-Inductive Gradient Adjustment for Spectral Bias in Implicit Neural Representations