Stars
Recipe2Plan: Evaluating Planning Abilities of LLMs for Efficient and Feasible Multitasking with Time Constraints Between Actions(EMNLP 2025 Findings)
Code for 🌍 UI-Simulator: LLMs as Scalable, General-Purpose Simulators For Evolving Digital Agent Training
[ACL'25] Read it in Two Steps: Translating Extremely Low-Resource Languages with Code-Augmented Grammar Books
[EMNLP2025] From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery
Paper list for Efficient Reasoning.
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]
A brief and partial summary of RLHF algorithms.
A curated list of papers on LLMs and agents for scientific research and development
A bibliography and survey of the papers surrounding o1
[EMNLP 2024 Findings] Benchmarking Language Model Agents for Data-Driven Science
GPT4 based personalized ArXiv paper assistant bot
Source code and data for DiNeR: a Large Realistic Dataset for Evaluating Compositional Generalization(EMNLP 2023 main conference paper)
Code for ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context
[ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly
Repository for the Paper "Multi-LoRA Composition for Image Generation"
InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)
[ACL 2023] Reasoning with Language Model Prompting: A Survey
A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery (EMNLP'24)
Train transformer language models with reinforcement learning.
[ACL'24] MC^2: A Multilingual Corpus of Minority Languages in China (Tibetan, Uyghur, Kazakh, and Mongolian)
A library for advanced large language model reasoning