Skip to content
View bys0318's full-sized avatar

Highlights

  • Pro

Organizations

@THU-KEG @THUDM

Block or report bys0318

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Train the smallest LM you can that fits in 16MB. Best model wins!

Python 4,522 2,854 Updated Mar 30, 2026

Open Multi-Agent Interactive Classroom — Get an immersive, multi-agent learning experience in just one click

TypeScript 13,511 2,151 Updated Mar 31, 2026

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

65 5 Updated Mar 14, 2026

The awesome collection of OpenClaw skills. 5,400+ skills filtered and categorized from the official OpenClaw Skills Registry.🦞

43,362 4,134 Updated Mar 26, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 342,738 67,728 Updated Mar 31, 2026

Ongoing research training transformer models at scale

Python 15,869 3,772 Updated Mar 31, 2026

Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"

Python 572 49 Updated Nov 4, 2025

🌿 DeepPrune: Parallel Scaling without Inter-trace Redundancy

Python 21 Updated Oct 10, 2025
Python 99 9 Updated Feb 11, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,448 493 Updated Mar 31, 2026

[SIGGRAPH Asia 2025] CHARM: Control-point-based 3D Anime Hairstyle Auto-Regressive Modeling

Python 47 3 Updated Sep 26, 2025

🚀🚀 Efficient implementations of Native Sparse Attention

Python 941 15 Updated Sep 29, 2025

Renderer for the harmony response format to be used with gpt-oss

Rust 4,252 263 Updated Mar 27, 2026

GLM-SIMPLE-EVALS: The evaluation repository for the GLM-4.5 series of models by Z.ai.

Python 39 7 Updated Oct 17, 2025

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Python 4,307 443 Updated Feb 1, 2026

[COLM 2024] A Survey on Deep Learning for Theorem Proving

221 16 Updated May 28, 2025
Python 43 1 Updated Feb 22, 2026

An efficient implementation of the NSA (Native Sparse Attention) kernel

Python 132 5 Updated Jun 24, 2025
Python 723 20 Updated Feb 5, 2026

TradingAgents: Multi-Agents LLM Financial Trading Framework

Python 44,959 8,135 Updated Mar 29, 2026

Scaling RL on advanced reasoning models

Python 678 42 Updated Oct 20, 2025

slime is an LLM post-training framework for RL Scaling.

Python 5,051 677 Updated Mar 29, 2026
Python 34 3 Updated Jun 5, 2025

[SIGGRAPH 2025] PrimitiveAnything: Human-Crafted 3D Primitive Assembly Generation with Auto-Regressive Transformer

Python 386 16 Updated May 13, 2025
Python 19 1 Updated Jun 29, 2025

ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括359个大模型,覆盖chatgpt、gpt-5.2、o4-mini、谷歌gemini-3-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3-max、qwen3.5-plus、百川、讯飞星火、商汤senseChat等商用模型, 以及step3.5-flash、kimi-k2.5、ernie4.5、…

5,780 234 Updated Mar 22, 2026

[ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring

Python 276 21 Updated Jul 6, 2025
Next