acbull

🦉

goo-goo-goo　

Ziniu Hu acbull

🦉

goo-goo-goo　

https://acbull.github.io/

274 followers · 19 following

Achievements

x3 x3

Achievements

x3 x3

Highlights

Stars

THUDM / TreeRL

TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25

Python 95 9 Updated Jun 16, 2025

Rafa-zy / QLASS

Python 53 5 Updated Aug 24, 2025

deepseek-ai / DualPipe

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,967 326 Updated Jan 14, 2026

ziminz19 / AutoMolCo

[COLING 2025] Automated Molecular Concept Generation and Labeling with Large Language Models

Python 3 Updated Dec 29, 2024

ZongyueQin / MTAD

Source code of Multi-Token Assisted Decoding

Python 11 Updated Apr 11, 2025

codespace-optimization / sfs

Official codebase for the Scattered Forest Search: Smarter Code Space Exploration and Inference Scaling with LLMs

Jupyter Notebook 10 1 Updated Feb 20, 2025

THUDM / DataSciBench

DataSciBench: An LLM Agent Benchmark for Data Science (Findings of ACL 2026)

Python 62 8 Updated Jan 21, 2026

THUDM / T1

RL Scaling and Test-Time Scaling (ICML'25)

116 1 Updated Jan 23, 2025

llm-strategist / llm-strategist.github.io

The website of paper "Strategist: Learning Strategic Skills by LLMs via Bi-Level Tree Search"

JavaScript 3 1 Updated Apr 10, 2025

ggflow123 / DDRL

Repository for Data Distillation for Offline Reinforcement Learning

Python 9 Updated Aug 2, 2024

wzsmith / cs145-pst

Sci-BeRT model for paper reference source tracing. Submission for 2024 PST-KDD Cup.

Jupyter Notebook 3 Updated Jun 15, 2024

yecchen / MIRAI

Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"

Python 107 23 Updated Jul 2, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

37,337 2,050 Updated Aug 1, 2024

the-catalyst / KDD_AQA

Course project for CS 145 - KDD 2024 AQA Challenge

Python 2 1 Updated Jun 13, 2024

rizvi-ha / team2_gcn

Python 1 Updated Jun 12, 2024

THUDM / ReST-MCTS

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 706 49 Updated Jan 20, 2025

HenryCai11 / LLM-Self-Control

The official repo of paper "Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller"

Jupyter Notebook 18 2 Updated Aug 13, 2024

yihedeng9 / STIC

Enhancing Large Vision Language Models with Self-Training on Image Comprehension.

Python 68 4 Updated May 31, 2024

embedded-robotics / path-rag

Path-RAG: Knowledge-Guided Key Region Retrieval for Open-ended Pathology Visual Question Answering

Jupyter Notebook 55 10 Updated Nov 13, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,288 3,528 Updated Jan 26, 2025

camel-ai / agent-trust

🤝 The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"

Python 118 18 Updated Apr 6, 2025

ZongyueQin / HLSyn

HLSyn benchmark for paper "Towards a Comprehensive Benchmark for FPGA Targeted High-Level Synthesis"

Python 7 4 Updated Oct 26, 2023

yjhuangcd / rule-guided-music

Official code for Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion (ICML 2024, Oral).

Python 88 8 Updated Aug 12, 2024

uclaml / SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,245 105 Updated May 8, 2024

eric-mitchell / direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Python 2,887 236 Updated Aug 11, 2024

THUDM / SciGLM

SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)

Python 88 11 Updated Feb 25, 2024

UCLA-DM / HLSyn

Forked from ZongyueQin/HLSyn

HLSyn benchmark for paper "Towards a Comprehensive Benchmark for FPGA Targeted High-Level Synthesis"

Python 32 1 Updated Dec 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ziniu Hu acbull

Achievements

Achievements

Highlights

Block or report acbull

Stars

THUDM / TreeRL

Rafa-zy / QLASS

deepseek-ai / DualPipe

ziminz19 / AutoMolCo

ZongyueQin / MTAD

codespace-optimization / sfs

THUDM / DataSciBench

THUDM / T1

llm-strategist / llm-strategist.github.io

ggflow123 / DDRL

wzsmith / cs145-pst

yecchen / MIRAI

karpathy / LLM101n

the-catalyst / KDD_AQA

rizvi-ha / team2_gcn

THUDM / ReST-MCTS

HenryCai11 / LLM-Self-Control

yihedeng9 / STIC

embedded-robotics / path-rag

meta-llama / llama3

camel-ai / agent-trust

ZongyueQin / HLSyn

yjhuangcd / rule-guided-music

uclaml / SPIN

eric-mitchell / direct-preference-optimization

THUDM / SciGLM

UCLA-DM / HLSyn

THUDM / AgentBench

jonathanmli / Avalon-LLM

Graph-and-Geometric-Learning / MolGroup