Skip to content
View acbull's full-sized avatar
🦉
goo-goo-goo 
🦉
goo-goo-goo 

Highlights

  • Pro

Block or report acbull

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25

Python 95 9 Updated Jun 16, 2025
Python 53 5 Updated Aug 24, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,967 326 Updated Jan 14, 2026

[COLING 2025] Automated Molecular Concept Generation and Labeling with Large Language Models

Python 3 Updated Dec 29, 2024

Source code of Multi-Token Assisted Decoding

Python 11 Updated Apr 11, 2025

Official codebase for the Scattered Forest Search: Smarter Code Space Exploration and Inference Scaling with LLMs

Jupyter Notebook 10 1 Updated Feb 20, 2025

DataSciBench: An LLM Agent Benchmark for Data Science (Findings of ACL 2026)

Python 62 8 Updated Jan 21, 2026

RL Scaling and Test-Time Scaling (ICML'25)

116 1 Updated Jan 23, 2025

The website of paper "Strategist: Learning Strategic Skills by LLMs via Bi-Level Tree Search"

JavaScript 3 1 Updated Apr 10, 2025

Repository for Data Distillation for Offline Reinforcement Learning

Python 9 Updated Aug 2, 2024

Sci-BeRT model for paper reference source tracing. Submission for 2024 PST-KDD Cup.

Jupyter Notebook 3 Updated Jun 15, 2024

Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"

Python 107 23 Updated Jul 2, 2024

LLM101n: Let's build a Storyteller

37,337 2,050 Updated Aug 1, 2024

Course project for CS 145 - KDD 2024 AQA Challenge

Python 2 1 Updated Jun 13, 2024
Python 1 Updated Jun 12, 2024

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 706 49 Updated Jan 20, 2025

The official repo of paper "Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller"

Jupyter Notebook 18 2 Updated Aug 13, 2024

Enhancing Large Vision Language Models with Self-Training on Image Comprehension.

Python 68 4 Updated May 31, 2024

Path-RAG: Knowledge-Guided Key Region Retrieval for Open-ended Pathology Visual Question Answering

Jupyter Notebook 55 10 Updated Nov 13, 2024

The official Meta Llama 3 GitHub site

Python 29,288 3,528 Updated Jan 26, 2025

🤝 The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"

Python 118 18 Updated Apr 6, 2025

HLSyn benchmark for paper "Towards a Comprehensive Benchmark for FPGA Targeted High-Level Synthesis"

Python 7 4 Updated Oct 26, 2023

Official code for Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion (ICML 2024, Oral).

Python 88 8 Updated Aug 12, 2024

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,245 105 Updated May 8, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,887 236 Updated Aug 11, 2024

SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)

Python 88 11 Updated Feb 25, 2024

HLSyn benchmark for paper "Towards a Comprehensive Benchmark for FPGA Targeted High-Level Synthesis"

Python 32 1 Updated Dec 13, 2023

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 3,504 263 Updated Feb 8, 2026

This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'

Python 153 16 Updated May 30, 2025

Learning to Group Auxiliary Datasets for Molecule, NeurIPS2023

Python 18 Updated Dec 19, 2023
Next