wduo

Follow

🎯

Focusing

Duo Wang wduo

🎯

Focusing

Follow

A wandering machine learning researcher, bouncing between groups. I want to understand things clearly, and explain them well. - Colah

29 followers · 37 following

Pretending in Hangzhou Creative Culture Company(PH3C)
Beijing(wangduo.cnblogs.com)
zhihu.com/people/wangduo2014

Achievements

Achievements

Stars

spring-projects / spring-ai

An Application Framework for AI Engineering

Java 8,298 2,390 Updated Mar 20, 2026

alibaba / spring-ai-alibaba

Agentic AI Framework for Java Developers

Java 8,917 1,969 Updated Mar 21, 2026

alibaba / ROCK

A construction kit for reinforcement learning environment management.

Python 395 50 Updated Mar 25, 2026

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,012 248 Updated Mar 25, 2026

THU-KEG / RM-Bench

[ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style

Python 79 3 Updated Jul 18, 2025

allenai / reward-bench

RewardBench: the first evaluation tool for reward models.

Python 706 98 Updated Feb 16, 2026

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,297 371 Updated Nov 13, 2025

HqWu-HITCS / Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

22,468 2,114 Updated May 19, 2025

tangqiaoyu / ToolAlpaca

the official code for "ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases"

Python 883 37 Updated Oct 26, 2024

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,346 1,296 Updated Mar 25, 2026

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,759 1,692 Updated Jan 30, 2026

BytedTsinghua-SIA / DAPO

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,764 81 Updated May 11, 2025

deepseek-ai / DeepSeek-R1

91,980 11,750 Updated Jun 27, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,000 1,943 Updated Jan 9, 2026

verl-project / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,189 3,496 Updated Mar 25, 2026

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,000 4,988 Updated Mar 25, 2026

dzhng / deep-research

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 18,625 1,933 Updated Sep 8, 2025

mshumer / OpenDeepResearcher

Jupyter Notebook 2,760 364 Updated May 2, 2025

matthewrenze / self-reflection

Self-Reflection in LLM Agents: Effects on Problem-Solving Performance

Python 94 9 Updated Nov 25, 2024

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 9,241 903 Updated Mar 25, 2026

git-lfs / git-lfs

Git extension for versioning large files

Go 14,159 2,195 Updated Mar 17, 2026

alibaba / animate-anything

Fine-Grained Open Domain Image Animation with Motion Guidance

Python 964 78 Updated Oct 18, 2024

HUSTAI / uie_pytorch

PaddleNLP UIE模型的PyTorch版实现

Python 686 122 Updated Aug 13, 2023

qingyujean / document-level-classification

超长文本分类（大于1000字）；文档级/篇章级文本分类；主要是解决长距离依赖问题

Python 131 32 Updated Oct 9, 2021

xuyige / BERT4doc-Classification

Code and source for paper ``How to Fine-Tune BERT for Text Classification?``

Python 641 101 Updated Oct 19, 2021

quqxui / Awesome-LLM4IE-Papers

Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)

1,057 62 Updated Nov 18, 2024

neo4j-labs / llm-graph-builder

Neo4j graph construction from unstructured data using LLMs

Jupyter Notebook 4,544 784 Updated Mar 23, 2026

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 6,419 724 Updated Nov 24, 2025

brightmart / roberta_zh

RoBERTa中文预训练模型: RoBERTa for Chinese

Python 2,773 408 Updated Jul 22, 2024

ArtifexSoftware / pdf2docx

Open source Python library for converting PDF to DOCX.

Python 3,354 477 Updated Mar 9, 2026