HarryHsing

🎾

TTWSYF

XING, Zhenghao HarryHsing

🎾

TTWSYF

Ph.D. student in Computer Science at CUHK

24 followers · 15 following

The Chinese University of Hong Kong
Hong Kong
10:13 (UTC +08:00)
https://harryhsing.github.io/
in/xingzhenghao
@onehsing

Achievements

Highlights

Lists (1)

Sort

✨ Inspiration

Stars

facebookresearch / sam-audio

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 2,264 162 Updated Dec 19, 2025

aiming-lab / GLIMPSE

[EMNLP'25 Oral] GLIMPSE: Do Large Vision-Language Models Truly Think With Videos or Just Glimpse at Them?

Python 8 Updated Aug 22, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 2,926 353 Updated Dec 22, 2025

zjuruizhechen / Awesome-Video-Agent

A collection of awesome think with videos papers.

74 2 Updated Dec 1, 2025

Visual-Agent / DeepEyesV2

Python 442 46 Updated Nov 25, 2025

thuml / MiniVeo3-Reasoner

Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give it a star 🌟 if you find it useful.

Python 193 7 Updated Oct 12, 2025

sunnweiwei / FoldAgent

Python 79 6 Updated Oct 28, 2025

ddlBoJack / Omni-Captioner

Data Pipeline, Models, and Benchmark for Omni-Captioner.

Python 105 Updated Oct 17, 2025

QwenLM / Qwen-Agent

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 12,733 1,175 Updated Sep 26, 2025

jinxiang-liu / anno-free-AVS

Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"

Python 35 3 Updated Oct 11, 2024

DabDans / AudioMarathon

Code for "AudioMarathon: A Comprehensive Benchmark for Long-Context Audio Understanding and Efficiency in Audio LLMs"

Python 20 Updated Oct 9, 2025

adobe-research / EditVerse

Official repo for paper "EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning"

Python 117 3 Updated Oct 9, 2025

FunAudioLLM / ThinkSound

[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

Python 1,107 65 Updated Nov 25, 2025

QwenLM / Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,144 193 Updated Oct 9, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 4,883 467 Updated Dec 21, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,676 1,354 Updated Dec 17, 2025

TsinghuaC3I / Awesome-RL-for-LRMs

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,183 120 Updated Nov 9, 2025

modelcontextprotocol / registry

A community driven registry service for Model Context Protocol (MCP) servers.

Go 6,158 537 Updated Dec 18, 2025

Mini-o3 / Mini-o3

Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"

Python 377 15 Updated Sep 15, 2025

Gen-Verse / dLLM-RL

TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models

Python 367 30 Updated Dec 16, 2025

microsoft / lost_in_conversation

Code that accompanies the public release of the paper Lost in Conversation (https://arxiv.org/abs/2505.06120)

Python 195 15 Updated Jun 23, 2025

pengzhangzhi / Open-dLLM

The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.

Python 488 34 Updated Nov 11, 2025

RL4VLM / RL4VLM

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Jupyter Notebook 403 35 Updated Dec 15, 2024

TIGER-AI-Lab / verl-tool

A version of verl to support diverse tool use

Python 769 63 Updated Dec 10, 2025

xhyumiracle / Awesome-AgenticLLM-RL-Papers

1,337 60 Updated Sep 5, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,290 327 Updated Dec 15, 2025

EvoAgentX / Awesome-Self-Evolving-Agents

[Survey] A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

1,433 92 Updated Oct 11, 2025

weijiawu / Awesome-Visual-Reinforcement-Learning

📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.

369 19 Updated Nov 29, 2025

illuin-tech / colpali

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 2,396 224 Updated Dec 19, 2025

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,297 116 Updated Dec 11, 2025