Skip to content
View smj0's full-sized avatar

Block or report smj0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

你想蒸馏的下一个员工,何必是同事。蒸馏任何人的思维方式——心智模型、决策启发式、表达DNA。Distill how anyone thinks.

Python 10,547 1,759 Updated Apr 13, 2026

Use Garry Tan's exact Claude Code setup: 23 opinionated tools that serve as CEO, Designer, Eng Manager, Release Manager, Doc Engineer, and QA

TypeScript 72,430 10,211 Updated Apr 15, 2026

Auto-backup tool for AI agent workspaces — syncs files to Git with scheduled backups, web dashboard & Telegram notifications. Built with Go.

Shell 5 1 Updated Mar 22, 2026

Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Python 441 31 Updated Feb 17, 2026

[ACL 2026 Main] Official Repo for Paper "Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment“

Python 14 Updated Mar 25, 2026

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

Python 117 10 Updated Apr 1, 2026

[TPAMI 2026] Offical Repository of "AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning"

Python 65 Updated Nov 18, 2025

Yuan3.0: Mixture-of-Experts (MoE) Language Model

Python 182 30 Updated Apr 7, 2026
Python 7 Updated Apr 23, 2025

Official implementation of Selective Entropy Regularization (SIREN), proposed by paper 'Rethinking Entropy Regularization in Large Reasoning Models'.

Python 31 Updated Dec 10, 2025

Code and Datasets for reviewing of "SynAdapt: Learning Adaptive Reasoning in Large Language Models via Synthetic Continuous Chain-of-Thought"

Python 3 Updated Sep 23, 2025

A Recipe for Building LLM Reasoners to Solve Complex Instructions

Python 31 Updated Oct 9, 2025

Pretraining and inference code for a large-scale depth-recurrent language model

Python 872 78 Updated Dec 29, 2025

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,195 360 Updated Dec 30, 2025
Python 9 Updated Apr 2, 2025

Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.

Python 20 1 Updated Mar 31, 2025

[NeurIPS 2025] Thinkless: LLM Learns When to Think

Python 257 20 Updated Sep 26, 2025

Official Repository for "Continuous Chain of Thought Enables Parallel Exploration and Reasoning"

Python 10 Updated Feb 22, 2026

ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of-Thought Reasoning

Python 85 15 Updated May 30, 2025

Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"

Python 330 40 Updated Jan 26, 2026

[EMNLP 2025] LightThinker: Thinking Step-by-Step Compression

Python 151 6 Updated Apr 7, 2026

Official repository for "CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation"

Python 84 14 Updated Dec 15, 2025

Connector-Aware Compact CoT (Synthetic Method For Reasoning Data)

Python 2 1 Updated Dec 30, 2025

[ICLR 2026] An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"

Python 195 12 Updated Apr 13, 2026

[EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs

Python 213 18 Updated Nov 30, 2025

[EMNLP 2025] Verification Engineering for RL in Instruction Following

Python 53 2 Updated Mar 30, 2026

REverse-Engineered Reasoning for Open-Ended Generation

Python 95 7 Updated Sep 10, 2025
Next