yaqingwang

😀

Focusing

Yaqing Wang yaqingwang

😀

Focusing

Research Scientist @google-deepmind PhD @Purdue, #DataMining, #NLP, #AI, #Efficiency @MSFTResearch@Amazon.

58 followers · 71 following

Achievements

Starred repositories

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,641 838 Updated Dec 18, 2025

google / langfun

OO for LLMs

Python 883 70 Updated Dec 18, 2025

facebookresearch / MovieGenBench

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

430 23 Updated Mar 8, 2025

fchollet / ARC-AGI

The Abstraction and Reasoning Corpus

JavaScript 4,677 700 Updated Apr 4, 2025

MME-Benchmarks / Video-MME

✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

699 25 Updated Dec 8, 2025

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

17,038 1,096 Updated Dec 22, 2025

nuster1128 / LLM_Agent_Memory_Survey

449 9 Updated Jul 28, 2025

PKU-YuanGroup / Machine-Mindset

An MBTI Exploration of Large Language Models

Python 514 23 Updated Feb 2, 2024

ezelikman / STaR

Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)

Python 217 23 Updated Feb 21, 2023

facebookresearch / end-to-end-negotiator

Deal or No Deal? End-to-End Learning for Negotiation Dialogues

Python 1,394 278 Updated May 4, 2020

OSU-NLP-Group / TravelPlanner

[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"

Python 448 66 Updated Nov 7, 2025

choosewhatulike / trainable-agents

Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

Python 603 44 Updated Oct 29, 2024

juliamarkel / GPTeach

JavaScript 12 4 Updated Jul 27, 2023

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 5,454 468 Updated Sep 8, 2025

westlake-repl / Recommendation-Systems-without-Explicit-ID-Features-A-Literature-Review

Paper List of Pre-trained Foundation Recommender Models

366 27 Updated Aug 12, 2024

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,838 583 Updated May 3, 2024

deepseek-ai / DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Python 22,533 2,687 Updated Nov 11, 2025

luohongyin / SAIL

SAIL: Search Augmented Instruction Learning

Python 158 15 Updated Jul 22, 2025

joeljang / RLPHF

Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging

Python 115 10 Updated Oct 23, 2023

Farama-Foundation / chatarena

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

Python 1,524 148 Updated Aug 11, 2025