🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 154,101 31,502 Updated Dec 20, 2025

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,854 303 Updated Jun 12, 2025

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

17,029 1,095 Updated Dec 12, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,758 1,071 Updated Dec 21, 2025

Liuziyu77 / Visual-RFT

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,286 103 Updated Oct 29, 2025

turningpoint-ai / VisualThinker-R1-Zero

Explore the Multimodal “Aha Moment” on 2B Model

Python 620 23 Updated Mar 18, 2025

zzli2022 / Awesome-System2-Reasoning-LLM

Latest Advances on System-2 Reasoning

Python 1,297 73 Updated Jun 8, 2025

FoundationAgents / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 51,384 8,964 Updated Nov 17, 2025

aburkov / theLMbook

This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov

Jupyter Notebook 2,052 341 Updated Dec 15, 2025

MatthewCYM / VoiceBench

VoiceBench: Benchmarking LLM-Based Voice Assistants

Python 310 19 Updated Dec 11, 2025

roudimit / whisper-flamingo

Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translation

Jupyter Notebook 197 14 Updated Jul 29, 2025

luhengshiwo / LLMForEverybody

每个人都能看懂的大模型知识分享，LLMs春/秋招大模型面试前必看，让你和面试官侃侃而谈

Jupyter Notebook 4,951 488 Updated Oct 13, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,819 1,034 Updated Dec 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Alex-Songs

Achievements

Achievements

Block or report Alex-Songs

Stars

meituan-longcat / LongCat-Flash-Omni

meituan-longcat / LongCat-Video

meituan-longcat / vitabench

meituan-longcat / LongCat-Flash-Thinking

meituan-longcat / LongCat-Audio-Codec

meituan-longcat / LongCat-Flash-Chat

boson-ai / higgs-audio

alibaba / ROLL

yaolinli / TimeChat-Online

THUNLP-MT / StreamingBench

kyutai-labs / delayed-streams-modeling

ictnlp / Stream-Omni

AudioLLMs / Awesome-Audio-LLM

hiyouga / EasyR1

VectorSpaceLab / OmniGen2

simplescaling / s1

soham97 / mellow

huggingface / transformers