Skip to content
View yaqingwang's full-sized avatar
😀
Focusing
😀
Focusing

Block or report yaqingwang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 8,946 873 Updated Feb 3, 2026

OO for LLMs

Python 892 71 Updated Feb 4, 2026

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

433 23 Updated Mar 8, 2025

The Abstraction and Reasoning Corpus

JavaScript 4,717 700 Updated Apr 4, 2025

✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

729 27 Updated Dec 8, 2025

✨✨Latest Advances on Multimodal Large Language Models

17,314 1,110 Updated Jan 27, 2026

An MBTI Exploration of Large Language Models

Python 525 24 Updated Feb 2, 2024

Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)

Python 220 23 Updated Feb 21, 2023

Deal or No Deal? End-to-End Learning for Negotiation Dialogues

Python 1,397 279 Updated May 4, 2020

[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"

Python 469 71 Updated Nov 7, 2025

Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

Python 609 46 Updated Oct 29, 2024
JavaScript 12 3 Updated Jul 27, 2023

Robust recipes to align language models with human and AI preferences

Python 5,489 467 Updated Sep 8, 2025

Paper List of Pre-trained Foundation Recommender Models

366 27 Updated Aug 12, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,887 596 Updated May 3, 2024

DeepSeek Coder: Let the Code Write Itself

Python 22,738 2,728 Updated Nov 11, 2025

SAIL: Search Augmented Instruction Learning

Python 158 15 Updated Jul 22, 2025

Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging

Python 116 10 Updated Oct 23, 2023

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

Python 1,534 148 Updated Aug 11, 2025
Jupyter Notebook 6 1 Updated Jan 24, 2023

Reverse Instructions to generate instruction tuning data with corpus examples

216 9 Updated Mar 5, 2024

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

721 37 Updated Apr 7, 2024

[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627

Python 509 49 Updated Oct 9, 2024

A programming framework for agentic AI

Python 54,287 8,179 Updated Jan 22, 2026

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Python 737 44 Updated Apr 10, 2024

The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".

1,593 99 Updated Jun 3, 2025
Python 130 6 Updated Jul 8, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,847 234 Updated Aug 11, 2024

SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks

Python 324 30 Updated Oct 22, 2024
Next