b1tx

Follow

b1tx

Follow

2 followers · 14 following

Stars

ZeweiYu1 / ARLCP

Official implementation of the paper [ICLR2026] Stop Unnecessary Reflection: Training LRMs for Efficient Reasoning with Adaptive Reflection and Length Coordinated Penalty

Python 5 Updated Mar 3, 2026

thu-nics / R2R

[NeurIPS'25] The official code implementation for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing"

Python 86 13 Updated Apr 2, 2026

z-lab / paroquant

[ICLR 2026] ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

Python 177 15 Updated Apr 6, 2026

AI9Stars / AStar-Thought

[NeurIPS 2025] A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings

Python 13 1 Updated Mar 20, 2026

yu-lin-li / ReBalance

[ICLR 2026] Efficient Reasoning with Balanced Thinking

Python 108 5 Updated Mar 18, 2026

datawhalechina / hello-agents

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 33,838 3,897 Updated Mar 30, 2026

FoundationAgents / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 55,633 9,705 Updated Feb 11, 2026

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT！🌏 Train a 64M-parameter GPT from scratch in just 2h!

Python 45,742 5,602 Updated Apr 4, 2026

lucky-aeon / AgentX

AgentX 致力于让小白也能无门槛通过自然语言打造属于自己的 Agent。AgentX 采用了自研 MCP 网关，模型高可用组件打造高可用

Java 642 109 Updated Mar 18, 2026

wyf3 / llm_related

复现大模型相关算法及一些学习记录

Python 3,219 431 Updated Mar 21, 2026

huggingface / smolagents

🤗 smolagents: a barebones library for agents that think in code.

Python 26,461 2,442 Updated Apr 2, 2026

bytedance / deer-flow

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…

Python 58,380 7,311 Updated Apr 6, 2026

sinaptik-ai / pandas-ai

Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.

Python 23,443 2,298 Updated Oct 28, 2025

openinterpreter / open-interpreter

A natural language interface for computers

Python 63,009 5,440 Updated Feb 9, 2026

eosphoros-ai / DB-GPT

open-source agentic AI data assistant for the next generation of AI + Data products.

Python 18,447 2,606 Updated Apr 3, 2026

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 13,020 1,583 Updated Feb 27, 2026

WujiangXu / A-mem-sys

A-MEM: Agentic Memory for LLM Agents

Python 312 48 Updated Mar 15, 2026

verl-project / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,460 3,577 Updated Apr 3, 2026

chunhuizhang / llm_rl

llm & rl

Jupyter Notebook 281 28 Updated Oct 24, 2025

2dust / v2rayN

A GUI client for Windows, Linux and macOS, support Xray and sing-box and others

C# 100,863 14,469 Updated Apr 6, 2026

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 15,932 3,786 Updated Apr 6, 2026

ZJU-REAL / Self-Braking-Tuning

[NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604

Python 55 Updated Nov 4, 2025

flagos-ai / awesome-LLM-driven-kernel-generation

Review automated kernel generation in the era of LLMs

161 8 Updated Mar 26, 2026

YuxuanJiang1 / DRP

Python 12 1 Updated Feb 24, 2026

PercyHayes / UCAS_class_collection

国科大雁栖湖校区2024~2025年课程资料，包括强化学习、智能计算系统、模式识别、矩阵分析与应用、人工智能原理与算法、自然语言处理

Python 41 Updated Sep 22, 2025

Replicable-MARL / MARLlib

One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)

Python 1,298 194 Updated Nov 28, 2024

LiveCodeBench / LiveCodeBench

Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"

Python 835 185 Updated Jul 16, 2025

Zengwh02 / ASAP

Pruning the Unsurprising: Efficient LLM Reasoning via First-Token Surprisal

Python 13 1 Updated Jan 8, 2026

AgenticIR-Lab / OThink-R1

This is the official code for OThink-R1 project.

Python 22 5 Updated Jun 19, 2025

staymylove / COT_Compresstion_via_Step_entropy

Python 22 Updated Aug 8, 2025