Skip to content
View Fantasyele's full-sized avatar

Block or report Fantasyele

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repository is the official implementation for our paper "Beyond Next-Token Alignment: Distilling Multimodal Large Language Models via Token Interactions."

Python 8 Updated Oct 30, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,831 1,815 Updated Oct 13, 2025

Enjoy the magic of Diffusion models!

Python 11,193 1,057 Updated Dec 20, 2025

[[NeurIPS 2025] UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions

Python 74 3 Updated Jul 14, 2025

[T-PAMI 2025] EMOv2: Pushing 5M Vision Model Frontier

Python 53 1 Updated Dec 30, 2024

a family of highly capabale yet efficient large multimodal models

Python 191 15 Updated Aug 23, 2024

VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.

Python 237 18 Updated May 30, 2025