Skip to content
View zinuoli's full-sized avatar

Highlights

  • Pro

Block or report zinuoli

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Language-Free Universal Vision Anomaly Detection

165 3 Updated Feb 2, 2026

[NeurIPS 2025] Official repository of the paper "Unlocking Aha Moments via Reinforcement Learning: Advancing Collaborative Visual Comprehension and Generation"

Python 19 Updated Sep 27, 2025

The code repository of UniRL

Python 52 3 Updated May 30, 2025

Fast and Universal 3D reconstruction model for versatile tasks

Python 1,043 100 Updated Feb 6, 2026

Reinforcement Learning via Self-Distillation (SDPO)

Python 689 65 Updated Feb 18, 2026

A tool for generating synthetic function call datasets for Large Language Models (LLMs).

Jupyter Notebook 2 Updated Feb 5, 2026

A simple yet powerful agent framework that delivers with open-source models

Python 4,483 460 Updated Mar 21, 2026

[CVPR 2026] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs

Python 118 9 Updated Mar 12, 2026
Python 18 6 Updated Oct 6, 2024
Python 20 4 Updated Feb 25, 2024

[NeurIPS 2025] Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM

Python 22 1 Updated Feb 10, 2026

🔥🔥🔥 Latest Papers, Codes and Datasets on Video-LMM Post-Training

Python 274 11 Updated Mar 3, 2026

Pytorch implementation of "SKEL-CF: Coarse-to-Fine Biomechanical Skeleton and Surface Mesh Recovery"

Jupyter Notebook 53 3 Updated Mar 17, 2026

📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.

423 20 Updated Mar 6, 2026

Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).

Python 568 57 Updated Mar 26, 2026

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,774 362 Updated Mar 10, 2026

Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"

Python 23 Updated Nov 1, 2025

Repo for paper "MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding".

Python 39 1 Updated Jun 9, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,361 1,297 Updated Mar 26, 2026
Python 163 7 Updated Dec 26, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,557 239 Updated Jan 8, 2026
Python 1,291 107 Updated Feb 12, 2026

The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"

Python 88 1 Updated Oct 15, 2025

Pixel-Level Reasoning Model trained with RL [NeuIPS25]

Python 284 11 Updated Nov 6, 2025

CS336 作业 5 实现, 附加作业里面的 dpo/rlhf 也完成了, 消融实验分析也放在飞书文档里面了, 仅供参考

Python 27 1 Updated Sep 27, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,300 371 Updated Nov 13, 2025

[ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Python 371 22 Updated Jan 12, 2026

Structured Video Comprehension of Real-World Shorts

Python 234 7 Updated Sep 21, 2025

A version of verl to support diverse tool use

Python 928 78 Updated Mar 2, 2026

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,620 465 Updated Feb 10, 2026
Next