Skip to content
View Ethylyikes's full-sized avatar

Block or report Ethylyikes

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[AAAI 2026] ✨ TSPO: Temporal Sampling Policy Optimization for Long-form Video Language Understanding

Python 108 7 Updated Nov 12, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,285 327 Updated Dec 15, 2025

Official codes of "Monet: Reasoning in Latent Visual Space Beyond Image and Language"

Python 84 2 Updated Dec 16, 2025

【AAAI 2026】GenVidBench: A6-Million Benchmark for AI-Generated Video Detection

Python 54 2 Updated Nov 25, 2025

Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens (arXiv 2025)

Python 212 15 Updated Aug 2, 2025
Python 56 2 Updated Nov 10, 2025

Sparking "Thinking with Videos" via Reinforcement Learning

Python 116 3 Updated Oct 30, 2025

MiniMax-M2, a model built for Max coding & agentic workflows.

2,040 155 Updated Nov 13, 2025

The official repository for the paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"

Jupyter Notebook 124 3 Updated Nov 26, 2025

Open-source unified multimodal model

Python 5,478 481 Updated Oct 27, 2025

This is the official repository for the paper "MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning"

Python 54 Updated Nov 15, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,627 2,853 Updated Dec 19, 2025

🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning

Python 296 19 Updated Oct 24, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,623 838 Updated Dec 18, 2025

Official Repository for "FakingRecipe: Detecting Fake News on Short Video Platforms from the Perspective of Creative Process", ACM MM 2024

Python 56 5 Updated Oct 5, 2025

Multi-agent collaboration framework

Python 1,771 255 Updated Dec 19, 2025

The unified stack for multi-agent systems.

Python 36,124 4,774 Updated Dec 19, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 49,618 4,090 Updated Dec 18, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,749 1,072 Updated Dec 19, 2025

[NeurIPS 2025] VideoRFT: Incentivizing Video Reasoning Capability in MLLMs via Reinforced Fine-Tuning

Python 59 2 Updated Oct 26, 2025

Witness the aha moment of VLM with less than $3.

Python 4,009 289 Updated May 19, 2025

Links to conference/journal publications in automated fact-checking (resources for the TACL22/EMNLP23 paper).

548 58 Updated Feb 23, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,270 1,447 Updated Nov 28, 2025

[NeurIPS 2025] MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning

Python 93 4 Updated Sep 19, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,231 7,787 Updated Dec 19, 2025

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,308 58 Updated Dec 7, 2025

Notes about courses Dive into Deep Learning by Mu Li

Jupyter Notebook 3,714 583 Updated Apr 11, 2023

算法竞赛课件分享

4,346 795 Updated Sep 23, 2025

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

16,274 1,507 Updated Feb 13, 2023

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 15,878 2,274 Updated Sep 3, 2025