Skip to content
View b1tx's full-sized avatar

Block or report b1tx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation of the paper [ICLR2026] Stop Unnecessary Reflection: Training LRMs for Efficient Reasoning with Adaptive Reflection and Length Coordinated Penalty

Python 5 Updated Mar 3, 2026

[NeurIPS'25] The official code implementation for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing"

Python 86 13 Updated Apr 2, 2026

[ICLR 2026] ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

Python 177 15 Updated Apr 6, 2026

[NeurIPS 2025] A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings

Python 13 1 Updated Mar 20, 2026

[ICLR 2026] Efficient Reasoning with Balanced Thinking

Python 108 5 Updated Mar 18, 2026

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 33,838 3,897 Updated Mar 30, 2026

No fortress, purely open ground. OpenManus is Coming.

Python 55,633 9,705 Updated Feb 11, 2026

🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT!🌏 Train a 64M-parameter GPT from scratch in just 2h!

Python 45,742 5,602 Updated Apr 4, 2026

AgentX 致力于让小白也能无门槛通过自然语言打造属于自己的 Agent。AgentX 采用了自研 MCP 网关,模型高可用组件打造高可用

Java 642 109 Updated Mar 18, 2026

复现大模型相关算法及一些学习记录

Python 3,219 431 Updated Mar 21, 2026

🤗 smolagents: a barebones library for agents that think in code.

Python 26,461 2,442 Updated Apr 2, 2026

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…

Python 58,380 7,311 Updated Apr 6, 2026

Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.

Python 23,443 2,298 Updated Oct 28, 2025

A natural language interface for computers

Python 63,009 5,440 Updated Feb 9, 2026

open-source agentic AI data assistant for the next generation of AI + Data products.

Python 18,447 2,606 Updated Apr 3, 2026

Minimal reproduction of DeepSeek R1-Zero

Python 13,020 1,583 Updated Feb 27, 2026

A-MEM: Agentic Memory for LLM Agents

Python 312 48 Updated Mar 15, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,460 3,577 Updated Apr 3, 2026

llm & rl

Jupyter Notebook 281 28 Updated Oct 24, 2025

A GUI client for Windows, Linux and macOS, support Xray and sing-box and others

C# 100,863 14,469 Updated Apr 6, 2026

Ongoing research training transformer models at scale

Python 15,932 3,786 Updated Apr 6, 2026

[NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604

Python 55 Updated Nov 4, 2025

Review automated kernel generation in the era of LLMs

161 8 Updated Mar 26, 2026
Python 12 1 Updated Feb 24, 2026

国科大雁栖湖校区2024~2025年课程资料,包括强化学习、智能计算系统、模式识别、矩阵分析与应用、人工智能原理与算法、自然语言处理

Python 41 Updated Sep 22, 2025

One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)

Python 1,298 194 Updated Nov 28, 2024

Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"

Python 835 185 Updated Jul 16, 2025

Pruning the Unsurprising: Efficient LLM Reasoning via First-Token Surprisal

Python 13 1 Updated Jan 8, 2026

This is the official code for OThink-R1 project.

Python 22 5 Updated Jun 19, 2025
Next