Skip to content
View Ethylyikes's full-sized avatar

Block or report Ethylyikes

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
27 stars written in Python
Clear filter

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 69,399 8,445 Updated Apr 1, 2026

Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.

Python 58,969 5,005 Updated Apr 2, 2026

Build, run, manage agentic software at scale.

Python 39,121 5,194 Updated Apr 2, 2026

Best Practices on Recommendation Systems

Python 21,584 3,308 Updated Apr 2, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,396 3,558 Updated Apr 2, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,497 1,318 Updated Apr 2, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 9,296 910 Updated Mar 30, 2026

Open-source unified multimodal model

Python 5,782 512 Updated Oct 27, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,799 365 Updated Mar 26, 2026

Witness the aha moment of VLM with less than $3.

Python 4,045 286 Updated May 19, 2025

Multi-agent collaboration framework

Python 1,901 275 Updated Apr 2, 2026

Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Python 432 29 Updated Feb 17, 2026

🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning

Python 335 22 Updated Jan 3, 2026

[CVPR 2026] Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens

Python 260 18 Updated Aug 2, 2025

[CVPR 2026] Official codes of "Monet: Reasoning in Latent Visual Space Beyond Image and Language"

Python 159 2 Updated Mar 19, 2026

Sparking "Thinking with Videos" via Reinforcement Learning

Python 152 6 Updated Oct 30, 2025

[AAAI 2026] ✨ TSPO: Temporal Sampling Policy Optimization for Long-form Video Language Understanding

Python 122 11 Updated Nov 12, 2025

[NeurIPS 2025] MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning

Python 103 5 Updated Sep 19, 2025

【AAAI 2026】GenVidBench: A 6-Million Benchmark for AI-Generated Video Detection

Python 76 2 Updated Mar 13, 2026

This is the official repository for the paper "MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning"

Python 67 3 Updated Dec 29, 2025

[NeurIPS 2025] VideoRFT: Incentivizing Video Reasoning Capability in MLLMs via Reinforced Fine-Tuning

Python 66 2 Updated Jan 6, 2026
Python 66 2 Updated Feb 1, 2026

Official Repository for "FakingRecipe: Detecting Fake News on Short Video Platforms from the Perspective of Creative Process", ACM MM 2024

Python 61 6 Updated Oct 5, 2025

the official code for Fact-R1: Towards Explainable Video Misinformation Detection with Deep Reasoning

Python 41 Updated Nov 26, 2025

GeometryZero: Improving Geometry Solving for LLM with Group Contrastive Policy Optimization

Python 9 Updated Sep 1, 2025

[Information Fusion] Official Implementation of DAE (Bridging Cognition and Emotion: Empathy-Driven Multimodal Misinformation Detection)

Python 6 Updated Feb 14, 2026

Official Implementation of LatentGeo: Learnable Auxiliary Constructions in Latent Space for Multimodal Geometric Reasoning

Python 2 Updated Mar 13, 2026