-
HKUST, Guangzhou
-
06:34
(UTC -12:00) - https://owen718.github.io
- https://scholar.google.com/citations?user=1sGXZ-wAAAAJ&hl=en
-
-
-
Owen718.github.io Public
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
-
realtime-video Public
Forked from krea-ai/realtime-videoKrea Realtime 14B. An open-source realtime AI video model.
-
-
LogSNRVis Public
This repository provides stand-alone visualisation utilities for probability distributions in log-SNR (λ) space, as used by recent diffusion models such as SD3 / FLUX and Style-Friendly SNR Sampler…
-
VideoX-Fun Public
Forked from aigc-apps/VideoX-Fun📹 A more flexible framework that can generate videos at any resolution and creates videos from images.
Python Apache License 2.0 UpdatedJun 11, 2025 -
-
GRAT Public
Forked from OliverRensu/GRATThis repository includes the official implementation of our paper "Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers"
Python UpdatedMay 21, 2025 -
FlexAttention-Examples Public
This repo provides several classic attention variant implementation based on FlexAttention API.
-
-
ollama-deep-researcher Public
Forked from langchain-ai/local-deep-researcherFully local web research and report writing assistant
Python UpdatedJan 25, 2025 -
picotron Public
Forked from huggingface/picotronMinimalistic 4D-parallelism distributed training framework for education purpose
Python Apache License 2.0 UpdatedDec 20, 2024 -
TeaCache Public
Forked from ali-vilab/TeaCacheTimestep Embedding Tells: It's Time to Cache for Video Diffusion Model
Python Apache License 2.0 UpdatedDec 20, 2024 -
LongPrompt-LLamaGen Public
This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompts. And it's also powered by additional prompt refining featu…
-
GPT4V-Image-Captioner Public
Forked from jiayev/GPT4V-Image-CaptionerPython GNU General Public License v3.0 UpdatedJul 18, 2024 -
Head-Detection-Yolov8 Public
This repo provides a YOLOv8 model, finely trained for detecting human heads in complex crowd scenes, with the CrowdHuman dataset serving as training data. To boost accessibility and compatibility, …
-
-
MediaCrawler Public
Forked from DSPerson/MediaCrawler小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫
Python Apache License 2.0 UpdatedMar 22, 2024 -
NightHazeFormer Public
ACM MM'23 | The office repository of NightHazeFormer: Single Nighttime Haze Removal Using Prior Query Transformer
26 UpdatedDec 20, 2023 -
FiveAPlus-Network Public
BMVC'23 | FiveA+Network: You Only Need 9K Parameters for Underwater Image Enhancement
-
detr Public
Forked from facebookresearch/detrEnd-to-End Object Detection with Transformers
Python Apache License 2.0 UpdatedOct 26, 2023 -
superpoint_transformer Public
Forked from drprojects/superpoint_transformer[ICCV'23] Official PyTorch implementation of Superpoint Transformer introduced in "Efficient 3D Semantic Segmentation with Superpoint Transformer"
Python MIT License UpdatedSep 10, 2023 -
UDR-S2Former_deraining Public
Forked from Ephemeral182/UDR-S2Former_deraining[ICCV'23] Sparse Sampling Transformer with Uncertainty-Driven Ranking for Unified Removal of Raindrops and Rain Streaks
-
-
OBELICS Public
Forked from huggingface/OBELICSCode used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M documents, 115B text tokens and 353M images.
Python Apache License 2.0 UpdatedAug 10, 2023 -
MedSegDiff Public
Forked from ImprintLab/MedSegDiffMedical Image Segmentation with Diffusion Model
Python UpdatedJun 12, 2023 -
-
SALN Public
ACM MM'23 | Sequential Affinity Learning for Video Restoration: Universal, Simple and SOTA Beyond
12 UpdatedMay 15, 2023 -
stable-diffusion-videos Public
Forked from nateraw/stable-diffusion-videosCreate 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts
Python Apache License 2.0 UpdatedMay 7, 2023