Skip to content
View wduo's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report wduo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Application Framework for AI Engineering

Java 8,266 2,377 Updated Mar 20, 2026

Agentic AI Framework for Java Developers

Java 8,857 1,951 Updated Mar 21, 2026

A construction kit for reinforcement learning environment management.

Python 388 51 Updated Mar 22, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,998 245 Updated Mar 22, 2026

[ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style

Python 79 3 Updated Jul 18, 2025

RewardBench: the first evaluation tool for reward models.

Python 705 98 Updated Feb 16, 2026

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,278 368 Updated Nov 13, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

22,442 2,113 Updated May 19, 2025

the official code for "ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases"

Python 882 37 Updated Oct 26, 2024

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,279 1,288 Updated Mar 22, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,735 1,683 Updated Jan 30, 2026

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,762 81 Updated May 11, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 26,979 1,939 Updated Jan 9, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,111 3,478 Updated Mar 22, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 24,879 4,944 Updated Mar 22, 2026

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 18,607 1,935 Updated Sep 8, 2025
Jupyter Notebook 2,756 365 Updated May 2, 2025

Self-Reflection in LLM Agents: Effects on Problem-Solving Performance

Python 94 9 Updated Nov 25, 2024

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 9,219 902 Updated Mar 20, 2026

Git extension for versioning large files

Go 14,156 2,194 Updated Mar 17, 2026

Fine-Grained Open Domain Image Animation with Motion Guidance

Python 963 78 Updated Oct 18, 2024

PaddleNLP UIE模型的PyTorch版实现

Python 685 120 Updated Aug 13, 2023

超长文本分类(大于1000字);文档级/篇章级文本分类;主要是解决长距离依赖问题

Python 131 32 Updated Oct 9, 2021

Code and source for paper ``How to Fine-Tune BERT for Text Classification?``

Python 641 101 Updated Oct 19, 2021

Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)

1,057 62 Updated Nov 18, 2024

Neo4j graph construction from unstructured data using LLMs

Jupyter Notebook 4,536 783 Updated Mar 20, 2026

Modeling, training, eval, and inference code for OLMo

Python 6,412 722 Updated Nov 24, 2025

RoBERTa中文预训练模型: RoBERTa for Chinese

Python 2,773 408 Updated Jul 22, 2024

Open source Python library for converting PDF to DOCX.

Python 3,347 476 Updated Mar 9, 2026
Next