smj0

Follow

smj0

Follow

6 followers · 25 following

Stars

alchaincyf / nuwa-skill

你想蒸馏的下一个员工，何必是同事。蒸馏任何人的思维方式——心智模型、决策启发式、表达DNA。Distill how anyone thinks.

Python 10,547 1,759 Updated Apr 13, 2026

ChinaSiro / claude-code-sourcemap

TypeScript 8,858 14,454 Updated Mar 31, 2026

garrytan / gstack

Use Garry Tan's exact Claude Code setup: 23 opinionated tools that serve as CEO, Designer, Eng Manager, Release Manager, Doc Engineer, and QA

TypeScript 72,430 10,211 Updated Apr 15, 2026

jx453331958 / openclaw-autobackup

Auto-backup tool for AI agent workspaces — syncs files to Git with scheduled backups, web dashboard & Telegram notifications. Built with Go.

Shell 5 1 Updated Mar 22, 2026

NVlabs / GDPO

Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Python 441 31 Updated Feb 17, 2026

UmeanNever / RankSurprisalRatio

[ACL 2026 Main] Official Repo for Paper "Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment“

Python 14 Updated Mar 25, 2026

GMLR-Penn / Multiplex-Thinking

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

Python 117 10 Updated Apr 1, 2026

Kun-Xiang / AtomThink

[TPAMI 2026] Offical Repository of "AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning"

Python 65 Updated Nov 18, 2025

Yuan-lab-LLM / Yuan3.0

Yuan3.0: Mixture-of-Experts (MoE) Language Model

Python 182 30 Updated Apr 7, 2026

lblankl / Token-Assorted

Python 7 Updated Apr 23, 2025

Linn3a / siren

Official implementation of Selective Entropy Regularization (SIREN), proposed by paper 'Rethinking Entropy Regularization in Large Reasoning Models'.

Python 31 Updated Dec 10, 2025

chenzhiling9954 / Critical-Tokens-Matter

Python 48 2 Updated May 25, 2025

wjw136 / SynAdapt_Review

Code and Datasets for reviewing of "SynAdapt: Learning Adaptive Reasoning in Large Language Models via Synthetic Continuous Chain-of-Thought"

Python 3 Updated Sep 23, 2025

yuleiqin / RAIF

A Recipe for Building LLM Reasoners to Solve Complex Instructions

Python 31 Updated Oct 9, 2025

seal-rg / recurrent-pretraining

Pretraining and inference code for a large-scale depth-recurrent language model

Python 872 78 Updated Dec 29, 2025

build-with-groq / g1

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,195 360 Updated Dec 30, 2025

MikeGu721 / GAPO

Python 9 Updated Apr 2, 2025

GraphPKU / number_cookbook

Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.

Python 20 1 Updated Mar 31, 2025

VainF / Thinkless

[NeurIPS 2025] Thinkless: LLM Learns When to Think

Python 257 20 Updated Sep 26, 2025

alperengozeten / CoT2

Official Repository for "Continuous Chain of Thought Enables Parallel Exploration and Reasoning"

Python 10 Updated Feb 22, 2026

xuyige / SoftCoT

ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of-Thought Reasoning

Python 85 15 Updated May 30, 2025

eric-ai-lab / Soft-Thinking

Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"

Python 330 40 Updated Jan 26, 2026

zjunlp / LightThinker

[EMNLP 2025] LightThinker: Thinking Step-by-Step Compression

Python 151 6 Updated Apr 7, 2026

zhenyi4 / codi

Official repository for "CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation"

Python 84 14 Updated Dec 15, 2025

digailab / awesome-llm-implicit-reasoning

115 12 Updated Jan 11, 2026

selectstar-ai / CAC-CoT

Connector-Aware Compact CoT (Synthetic Method For Reasoning Data)

Python 2 1 Updated Dec 30, 2025

InternLM / SIM-CoT

[ICLR 2026] An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"

Python 195 12 Updated Apr 13, 2026

hemingkx / TokenSkip

[EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs

Python 213 18 Updated Nov 30, 2025

THU-KEG / VerIF

[EMNLP 2025] Verification Engineering for RL in Instruction Following

Python 53 2 Updated Mar 30, 2026

multimodal-art-projection / REER_DeepWriter

Forked from HaozheH3/REER_DeepWriter

REverse-Engineered Reasoning for Open-Ended Generation

Python 95 7 Updated Sep 10, 2025