zinuoli

Zinuo Li zinuoli

Research Intern @ Tencent Youth Lab PhD student @ UWA

32 followers · 24 following

University of Western Australia (UWA)
https://zinuoli.github.io/

Achievements

Highlights

Lists (5)

Sort

Stars

gaobb / UniADet

Language-Free Universal Vision Anomaly Detection

166 3 Updated Feb 2, 2026

wendell0218 / Janus-Pro-R1

[NeurIPS 2025] Official repository of the paper "Unlocking Aha Moments via Reinforcement Learning: Advancing Collaborative Visual Comprehension and Generation"

Python 20 Updated Sep 27, 2025

showlab / UniRL

The code repository of UniRL

Python 52 3 Updated May 30, 2025

Tencent-Hunyuan / HunyuanWorld-Mirror

Fast and Universal 3D reconstruction model for versatile tasks

Python 1,053 102 Updated Feb 6, 2026

lasgroup / SDPO

Reinforcement Learning via Self-Distillation (SDPO)

Python 707 75 Updated Feb 18, 2026

diqiuzhuanzhuan / openllm_func_call_synthesizer

A tool for generating synthetic function call datasets for Large Language Models (LLMs).

Jupyter Notebook 2 Updated Feb 5, 2026

TencentCloudADP / youtu-agent

A simple yet powerful agent framework that delivers with open-source models

Python 4,493 463 Updated Mar 21, 2026

TencentARC / TimeLens

[CVPR 2026] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs

Python 119 9 Updated Mar 12, 2026

zibinpan / FedLF

Python 18 6 Updated Oct 6, 2024

zibinpan / FedMDFG

Python 20 4 Updated Feb 25, 2024

zinuoli / TriSense

[NeurIPS 2025] Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM

Python 22 1 Updated Feb 10, 2026

yunlong10 / Awesome-Video-LMM-Post-Training

🔥🔥🔥 Latest Papers, Codes and Datasets on Video-LMM Post-Training

Python 276 11 Updated Mar 3, 2026

Intellindust-AI-Lab / SKEL-CF

Pytorch implementation of "SKEL-CF: Coarse-to-Fine Biomechanical Skeleton and Surface Mesh Recovery"

Jupyter Notebook 53 3 Updated Mar 17, 2026

weijiawu / Awesome-RL-for-Multimodal-Foundation-Models

📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.

424 20 Updated Mar 6, 2026

agentscope-ai / Trinity-RFT

Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).

Python 574 59 Updated Mar 26, 2026

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,785 364 Updated Mar 26, 2026

HumanMLLM / LOVE-R1

Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"

Python 23 Updated Nov 1, 2025

THUNLP-MT / MUSEG

Repo for paper "MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding".

Python 40 1 Updated Jun 9, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,427 1,310 Updated Mar 30, 2026

InternScience / SciReason

Python 152 6 Updated Dec 26, 2025

QwenLM / Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,581 241 Updated Jan 8, 2026