Skip to content
View HarryHsing's full-sized avatar
🎾
TTWSYF
🎾
TTWSYF

Highlights

  • Pro

Block or report HarryHsing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
143 results for source starred repositories
Clear filter

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,783 207 Updated Feb 8, 2026
Python 94 2 Updated Feb 4, 2026

Moonshot's most powerful model

818 77 Updated Jan 31, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 23,418 4,357 Updated Feb 8, 2026

[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents

Python 1,895 155 Updated Jan 22, 2026

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 7,156 885 Updated Feb 6, 2026

[ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos

Python 419 20 Updated Jan 30, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,587 1,195 Updated Feb 7, 2026

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,279 277 Updated Jan 5, 2026

slime is an LLM post-training framework for RL Scaling.

Python 3,708 501 Updated Feb 5, 2026

A collection of awesome think with videos papers.

87 2 Updated Dec 1, 2025
Python 514 49 Updated Jan 28, 2026

Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give it a star 🌟 if you find it useful.

Python 208 8 Updated Oct 12, 2025

Scaling Long-Horizon LLM Agent via Context-Folding

Python 110 8 Updated Jan 26, 2026

[ICLR 2026] Data Pipeline, Models, and Benchmark for Omni-Captioner.

Python 117 Updated Oct 17, 2025

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 13,228 1,246 Updated Feb 3, 2026

Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"

Python 37 3 Updated Oct 11, 2024

Code for "AudioMarathon: A Comprehensive Benchmark for Long-Context Audio Understanding and Efficiency in Audio LLMs"

Python 22 Updated Oct 9, 2025

Official repo for paper "EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning"

Python 128 4 Updated Oct 9, 2025

[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

Python 1,157 66 Updated Jan 27, 2026

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,399 210 Updated Jan 8, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,081 501 Updated Feb 7, 2026

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 18,186 1,404 Updated Feb 7, 2026

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,321 129 Updated Nov 9, 2025

A community driven registry service for Model Context Protocol (MCP) servers.

Go 6,380 588 Updated Feb 6, 2026

Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"

Python 401 15 Updated Jan 29, 2026

[ICLR 2026] TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models

Python 423 31 Updated Jan 28, 2026

Code that accompanies the public release of the paper Lost in Conversation (https://arxiv.org/abs/2505.06120)

Python 206 16 Updated Jun 23, 2025

The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.

Python 513 38 Updated Nov 11, 2025

A version of verl to support diverse tool use

Python 862 73 Updated Jan 6, 2026
Next