lilakk

Follow

Yapei Chang lilakk

Follow

✨ PhD student at University of Maryland, College Park

35 followers · 6 following

Achievements

Achievements

Starred repositories

thinkwee / AwesomeOPD

Awesome List for On-Policy Distillation

657 11 Updated Jun 13, 2026

SamuelSchmidgall / AgentLaboratory

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 5,699 795 Updated Aug 20, 2025

NVIDIA-NeMo / Gym

Evaluate and improve models and agents using environments

Python 991 191 Updated Jun 19, 2026

SKYLENAGE-AI / SWE-CI

SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration

Python 172 21 Updated Jun 10, 2026

ishepard / pydriller

Python Framework to analyse Git repositories

Python 962 155 Updated Dec 28, 2025

SWE-bench / SWE-smith

[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents

Python 680 120 Updated Jun 15, 2026

commit-0 / commit0

Commit0: Library Generation from Scratch

Python 192 20 Updated Feb 24, 2026

lilakk / how2everything

Official code for "How2Everything: Mining the Web for How-To Procedures to Evaluate and Improve LLMs"

Python 24 3 Updated Feb 10, 2026

taylorai / lm-deluge

utilities for batched llm calls with retries

Python 51 2 Updated Jun 12, 2026

test-time-training / e2e

Official JAX implementation of End-to-End Test-Time Training for Long Context

Python 621 47 Updated Feb 15, 2026

CatalaLang / catala

Programming language for literate programming law specification

OCaml 2,322 102 Updated Jun 19, 2026

allenai / infinigram-api

Python 101 16 Updated Jun 12, 2026

lilakk / BLEUBERI

Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"

Python 32 2 Updated Jun 5, 2025

SimengSun / L0-reasoning-bench

Python 9 1 Updated May 31, 2025

chtmp223 / CLIPPER

CLIPPER: Compression enables long-context synthetic data generation [COLM '25]

Python 10 1 Updated Apr 16, 2025

BytedTsinghua-SIA / DAPO

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,828 83 Updated May 11, 2025

hemingkx / Awesome-Efficient-Reasoning

Paper list for Efficient Reasoning.

889 45 Updated May 29, 2026

THU-KEG / Agentic-Reward-Modeling

[ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Python 132 6 Updated Jun 11, 2025

mbzuai-oryx / Awesome-LLM-Post-training

Awesome Reasoning LLM Tutorial/Survey/Guide

Python 2,451 164 Updated Apr 6, 2026

dzhng / deep-research

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 19,148 1,965 Updated Apr 11, 2026

OpenHands / OpenHands

🙌 OpenHands: AI-Driven Development

Python 77,747 9,881 Updated Jun 19, 2026

hiyouga / LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 72,301 8,848 Updated Jun 17, 2026

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 13,174 1,585 Updated Feb 27, 2026

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 26,329 2,444 Updated Apr 2, 2026

atfortes / Awesome-LLM-Reasoning

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,638 212 Updated Apr 20, 2026

THUDM / LongBench

LongBench v2 and LongBench (ACL 25'&24')

Python 1,196 136 Updated Jan 15, 2025

nightdessert / Retrieval_Head

open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality

Python 240 26 Updated Aug 2, 2024

allenai / dolma

Data and tools for generating and inspecting OLMo pre-training data.

Python 1,511 194 Updated Nov 5, 2025

allenai / papermage

library supporting NLP and CV research on scientific papers

Python 797 65 Updated Nov 8, 2024

microsoft / markitdown

Python tool for converting files and office documents to Markdown.

Python 155,928 10,837 Updated May 26, 2026

Starred topics

Natural language processing

Machine learning

Deep learning