hewei2001

Focusing

Wei HE hewei2001

Focusing

Coder & NLPer

126 followers · 23 following

Fudan University
Shanghai
https://hwcoder.top/about

Achievements

Highlights

Organizations

Lists (9)

Sort

Stars

hewei2001 / Self-Demos

[NAACL 2024] Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models

Jupyter Notebook 4 Updated Sep 27, 2024

meituan-longcat / vitabench

VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications

Python 72 9 Updated Dec 3, 2025

WooooDyy / AgentGym

Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

Python 673 98 Updated Sep 11, 2025

xhyumiracle / Awesome-AgenticLLM-RL-Papers

1,355 60 Updated Sep 5, 2025

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,400 206 Updated Dec 25, 2025

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,712 312 Updated Nov 13, 2025

QwenLM / Qwen-Agent

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 12,774 1,179 Updated Sep 26, 2025

Simple-Efficient / RL-Factory

Train your Agent model via our easy and efficient framework

Python 1,675 157 Updated Dec 5, 2025

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,318 117 Updated Dec 11, 2025

browser-use / browser-use

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 74,138 8,878 Updated Dec 24, 2025

chenwxOggai / BiRM

Code & Dataset for Paper: "Better Process Supervision with Bi-directional Rewarding Signals"

Python 8 Updated Mar 9, 2025

TideDra / zotero-arxiv-daily

Recommend new arxiv papers of your interest daily according to your Zotero libarary.

Python 4,294 3,793 Updated Dec 23, 2025

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,874 371 Updated Dec 17, 2025

RLHFlow / RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Python 1,490 108 Updated Apr 24, 2025

Zhen-Tan-dmml / LLM4Annotation

630 23 Updated Jul 29, 2025

docling-project / docling

Get your documents ready for gen AI

Python 47,760 3,344 Updated Dec 24, 2025

Yiwen-Ding / Guided-Self-Improvement

Python 9 Updated Nov 10, 2024

zengxingchen / LLM-Visualization-Paper-List

Awesome-Paper-list: Visualization meets LLM

59 2 Updated Dec 7, 2025

hewei2001 / ReachQA

[EMNLP 2025] Distill Visual Chart Reasoning Ability from LLMs to MLLMs

Python 58 Updated Aug 25, 2025

unclecode / crawl4ai

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

Python 57,660 5,841 Updated Dec 25, 2025

opendatalab / MinerU

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 51,028 4,237 Updated Dec 24, 2025

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,599 598 Updated Dec 25, 2025

InternLM / xtuner

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 5,030 395 Updated Dec 25, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,851 1,087 Updated Dec 25, 2025