Skip to content
View yaqingwang's full-sized avatar
😀
Focusing
😀
Focusing

Block or report yaqingwang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,641 838 Updated Dec 18, 2025

OO for LLMs

Python 883 70 Updated Dec 18, 2025

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

430 23 Updated Mar 8, 2025

The Abstraction and Reasoning Corpus

JavaScript 4,677 700 Updated Apr 4, 2025

✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

699 25 Updated Dec 8, 2025

✨✨Latest Advances on Multimodal Large Language Models

17,038 1,096 Updated Dec 22, 2025

An MBTI Exploration of Large Language Models

Python 514 23 Updated Feb 2, 2024

Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)

Python 217 23 Updated Feb 21, 2023

Deal or No Deal? End-to-End Learning for Negotiation Dialogues

Python 1,394 278 Updated May 4, 2020

[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"

Python 448 66 Updated Nov 7, 2025

Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

Python 603 44 Updated Oct 29, 2024
JavaScript 12 4 Updated Jul 27, 2023

Robust recipes to align language models with human and AI preferences

Python 5,454 468 Updated Sep 8, 2025

Paper List of Pre-trained Foundation Recommender Models

366 27 Updated Aug 12, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,838 583 Updated May 3, 2024

DeepSeek Coder: Let the Code Write Itself

Python 22,533 2,687 Updated Nov 11, 2025

SAIL: Search Augmented Instruction Learning

Python 158 15 Updated Jul 22, 2025

Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging

Python 115 10 Updated Oct 23, 2023

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

Python 1,524 148 Updated Aug 11, 2025
Jupyter Notebook 6 1 Updated Jan 24, 2023

Reverse Instructions to generate instruction tuning data with corpus examples

216 9 Updated Mar 5, 2024

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

714 37 Updated Apr 7, 2024

[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627

Python 504 49 Updated Oct 9, 2024

A programming framework for agentic AI

Python 52,757 8,019 Updated Oct 8, 2025

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Python 733 43 Updated Apr 10, 2024

The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".

1,585 101 Updated Jun 3, 2025
Python 130 6 Updated Jul 8, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,816 233 Updated Aug 11, 2024

SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks

Python 323 29 Updated Oct 22, 2024
Next