Shenzhi-Wang

Follow

Shenzhi Wang Shenzhi-Wang

Follow

PhD Candidate @ Tsinghua University

111 followers · 23 following

Achievements

Achievements

Stars

SHI-Labs / IMG-Multimodal-Diffusion-Alignment

IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance, ICCV 2025

Python 29 2 Updated Oct 1, 2025

sail-sg / variational-reasoning

Code for "Variational Reasoning for Language Models"

Python 44 1 Updated Sep 29, 2025

expert-kit / expert-kit

Expert Kit is an efficient foundation of Expert Parallelism (EP) for MoE model Inference on heterogenous hardware

Rust 56 14 Updated Sep 18, 2025

LeapLabTHU / Absolute-Zero-Reasoner

Official Repository of Absolute Zero Reasoner

Python 1,705 281 Updated Aug 24, 2025

modelscope / Trinity-RFT

Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).

Python 361 37 Updated Oct 9, 2025

LeapLabTHU / cooragent

Official Repository of Cooragent

Python 2,256 136 Updated Sep 2, 2025

multimodal-art-projection / COIG-P

Python 39 2 Updated Jul 15, 2025

QwenLM / QwQ

QwQ is the reasoning model series developed by Qwen team, Alibaba Cloud.

Python 523 24 Updated Mar 27, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 3,720 281 Updated Oct 6, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 2,045 117 Updated Jun 2, 2025

NVIDIA / Cosmos

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,059 521 Updated Jun 9, 2025

LeapLabTHU / OVM3D-Det

Python 42 2 Updated Jan 2, 2025

hiyouga / MathRuler

A light-weight tool for evaluating LLMs in rule-based ways.

Python 69 5 Updated Jun 19, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,117 2,515 Updated Oct 9, 2025

Ziwei-Zheng / Nullu

Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection

OpenEdge ABL 42 1 Updated Mar 13, 2025

LeapLabTHU / diver-ct

Python 13 Updated Dec 19, 2024

LeapLabTHU / Uni-AdaFocus

Official repository of Uni-AdaFocus (TPAMI 2024).

Python 49 3 Updated Dec 17, 2024

OS-Agent-Survey / OS-Agent-Survey

This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).

354 17 Updated Aug 16, 2025

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 16,981 1,539 Updated Sep 5, 2024

allenai / open-instruct

AllenAI's post-training codebase

Python 3,231 444 Updated Oct 9, 2025

alibaba / ChatLearn

A flexible and efficient training framework for large-scale alignment tasks

Python 428 35 Updated Oct 9, 2025

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,834 376 Updated Oct 9, 2025

Shenzhi-Wang / recon

The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)

Python 13 Updated Aug 12, 2024

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 17,933 2,625 Updated Oct 9, 2025

lucywang720 / model-surgery

Python 30 3 Updated Feb 23, 2025

open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,140 672 Updated Oct 8, 2025

fanshiqing / grouped_gemm

Forked from tgale96/grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 151 47 Updated Aug 28, 2025

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 15,835 3,126 Updated Oct 9, 2025

zai-org / GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 6,880 590 Updated Jul 4, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 150,810 30,702 Updated Oct 9, 2025