Skip to content
View lilakk's full-sized avatar

Block or report lilakk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Awesome List for On-Policy Distillation

657 11 Updated Jun 13, 2026

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 5,699 795 Updated Aug 20, 2025

Evaluate and improve models and agents using environments

Python 991 191 Updated Jun 19, 2026

SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration

Python 172 21 Updated Jun 10, 2026

Python Framework to analyse Git repositories

Python 962 155 Updated Dec 28, 2025

[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents

Python 680 120 Updated Jun 15, 2026

Commit0: Library Generation from Scratch

Python 192 20 Updated Feb 24, 2026

Official code for "How2Everything: Mining the Web for How-To Procedures to Evaluate and Improve LLMs"

Python 24 3 Updated Feb 10, 2026

utilities for batched llm calls with retries

Python 51 2 Updated Jun 12, 2026

Official JAX implementation of End-to-End Test-Time Training for Long Context

Python 621 47 Updated Feb 15, 2026

Programming language for literate programming law specification

OCaml 2,322 102 Updated Jun 19, 2026
Python 101 16 Updated Jun 12, 2026

Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"

Python 32 2 Updated Jun 5, 2025
Python 9 1 Updated May 31, 2025

CLIPPER: Compression enables long-context synthetic data generation [COLM '25]

Python 10 1 Updated Apr 16, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,828 83 Updated May 11, 2025

Paper list for Efficient Reasoning.

889 45 Updated May 29, 2026

[ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Python 132 6 Updated Jun 11, 2025

Awesome Reasoning LLM Tutorial/Survey/Guide

Python 2,451 164 Updated Apr 6, 2026

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 19,148 1,965 Updated Apr 11, 2026

🙌 OpenHands: AI-Driven Development

Python 77,747 9,881 Updated Jun 19, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 72,301 8,848 Updated Jun 17, 2026

Minimal reproduction of DeepSeek R1-Zero

Python 13,174 1,585 Updated Feb 27, 2026

Fully open reproduction of DeepSeek-R1

Python 26,329 2,444 Updated Apr 2, 2026

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,638 212 Updated Apr 20, 2026

LongBench v2 and LongBench (ACL 25'&24')

Python 1,196 136 Updated Jan 15, 2025

open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality

Python 240 26 Updated Aug 2, 2024

Data and tools for generating and inspecting OLMo pre-training data.

Python 1,511 194 Updated Nov 5, 2025

library supporting NLP and CV research on scientific papers

Python 797 65 Updated Nov 8, 2024

Python tool for converting files and office documents to Markdown.

Python 155,928 10,837 Updated May 26, 2026
Next