Ethylyikes

Follow

Ethylyikes

Follow

Undergraduate student at Communication University of China, and research assistant at Hong Kong University of Science and Technology (Guangzhou).

2 followers · 0 following

Stars

hiyouga / LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 69,210 8,432 Updated Mar 27, 2026

unslothai / unsloth

Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.

Python 58,577 4,956 Updated Mar 29, 2026

agno-agi / agno

Build, run, manage agentic software at scale.

Python 39,011 5,169 Updated Mar 29, 2026

recommenders-team / recommenders

Best Practices on Recommendation Systems

Python 21,565 3,306 Updated Mar 26, 2026

verl-project / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,299 3,529 Updated Mar 28, 2026

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,795 1,702 Updated Jan 30, 2026

dair-ai / ml-visuals

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

17,019 1,557 Updated Feb 13, 2023

Infrasys-AI / AISystem

AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,531 2,346 Updated Sep 3, 2025

Leey21 / awesome-ai-research-writing

Elevate your AI research writing, no more tedious polishing ✨

14,556 1,135 Updated Mar 25, 2026

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,408 1,306 Updated Mar 29, 2026

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 9,264 906 Updated Mar 29, 2026

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 5,780 511 Updated Oct 27, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,781 364 Updated Mar 26, 2026

hzwer / shareOI

算法竞赛课件分享

4,412 798 Updated Sep 23, 2025

StarsfieldAI / R1-V

Witness the aha moment of VLM with less than $3.

Python 4,046 286 Updated May 19, 2025

MLNLP-World / DeepLearning-MuLi-Notes

Notes about courses Dive into Deep Learning by Mu Li

Jupyter Notebook 3,768 598 Updated Apr 11, 2023

MiniMax-AI / MiniMax-M2

MiniMax-M2, a model built for Max coding & agentic workflows.

2,525 202 Updated Nov 13, 2025

jd-opensource / OxyGent

Multi-agent collaboration framework

Python 1,913 275 Updated Mar 17, 2026

Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,388 60 Updated Feb 26, 2026

Cartus / Automated-Fact-Checking-Resources

Links to conference/journal publications in automated fact-checking (resources for the TACL22/EMNLP23 paper).

560 62 Updated Feb 23, 2025

NVlabs / GDPO

Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Python 427 27 Updated Feb 17, 2026

RUC-NLPIR / Tool-Star

🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning

Python 334 22 Updated Jan 3, 2026

UMass-Embodied-AGI / Mirage

[CVPR 2026] Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens

Python 259 18 Updated Aug 2, 2025

ThinkMorph / ThinkMorph

[ICLR 2026] The official repository for paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"

Jupyter Notebook 172 4 Updated Jan 26, 2026

NOVAglow646 / Monet

[CVPR 2026] Official codes of "Monet: Reasoning in Latent Visual Space Beyond Image and Language"

Python 157 2 Updated Mar 19, 2026

shijian2001 / Video-Thinker

Sparking "Thinking with Videos" via Reinforcement Learning

Python 152 6 Updated Oct 30, 2025

Hui-design / TSPO

[AAAI 2026] ✨ TSPO: Temporal Sampling Policy Optimization for Long-form Video Language Understanding

Python 120 11 Updated Nov 12, 2025

xinyan-cxy / MINT-CoT

[NeurIPS 2025] MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning

Python 103 5 Updated Sep 19, 2025

genvidbench / GenVidBench

【AAAI 2026】GenVidBench: A 6-Million Benchmark for AI-Generated Video Detection

Python 76 2 Updated Mar 13, 2026

shiwk24 / MathCanvas

This is the official repository for the paper "MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning"

Python 67 3 Updated Dec 29, 2025