Skip to content
View Ethylyikes's full-sized avatar

Block or report Ethylyikes

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 69,210 8,432 Updated Mar 27, 2026

Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.

Python 58,577 4,956 Updated Mar 29, 2026

Build, run, manage agentic software at scale.

Python 39,011 5,169 Updated Mar 29, 2026

Best Practices on Recommendation Systems

Python 21,565 3,306 Updated Mar 26, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,299 3,529 Updated Mar 28, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,795 1,702 Updated Jan 30, 2026

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

17,019 1,557 Updated Feb 13, 2023

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,531 2,346 Updated Sep 3, 2025

Elevate your AI research writing, no more tedious polishing ✨

14,556 1,135 Updated Mar 25, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,408 1,306 Updated Mar 29, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 9,264 906 Updated Mar 29, 2026

Open-source unified multimodal model

Python 5,780 511 Updated Oct 27, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,781 364 Updated Mar 26, 2026

算法竞赛课件分享

4,412 798 Updated Sep 23, 2025

Witness the aha moment of VLM with less than $3.

Python 4,046 286 Updated May 19, 2025

Notes about courses Dive into Deep Learning by Mu Li

Jupyter Notebook 3,768 598 Updated Apr 11, 2023

MiniMax-M2, a model built for Max coding & agentic workflows.

2,525 202 Updated Nov 13, 2025

Multi-agent collaboration framework

Python 1,913 275 Updated Mar 17, 2026

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,388 60 Updated Feb 26, 2026

Links to conference/journal publications in automated fact-checking (resources for the TACL22/EMNLP23 paper).

560 62 Updated Feb 23, 2025

Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Python 427 27 Updated Feb 17, 2026

🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning

Python 334 22 Updated Jan 3, 2026

[CVPR 2026] Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens

Python 259 18 Updated Aug 2, 2025

[ICLR 2026] The official repository for paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"

Jupyter Notebook 172 4 Updated Jan 26, 2026

[CVPR 2026] Official codes of "Monet: Reasoning in Latent Visual Space Beyond Image and Language"

Python 157 2 Updated Mar 19, 2026

Sparking "Thinking with Videos" via Reinforcement Learning

Python 152 6 Updated Oct 30, 2025

[AAAI 2026] ✨ TSPO: Temporal Sampling Policy Optimization for Long-form Video Language Understanding

Python 120 11 Updated Nov 12, 2025

[NeurIPS 2025] MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning

Python 103 5 Updated Sep 19, 2025

【AAAI 2026】GenVidBench: A 6-Million Benchmark for AI-Generated Video Detection

Python 76 2 Updated Mar 13, 2026

This is the official repository for the paper "MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning"

Python 67 3 Updated Dec 29, 2025
Next