Skip to content
View bys0318's full-sized avatar

Highlights

  • Pro

Organizations

@THU-KEG @THUDM

Block or report bys0318

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Train the smallest LM you can that fits in 16MB. Best model wins!

Python 4,975 3,310 Updated Apr 27, 2026

Open Multi-Agent Interactive Classroom — Get an immersive, multi-agent learning experience in just one click

TypeScript 16,486 3,097 Updated Apr 28, 2026

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

92 8 Updated Mar 14, 2026

The awesome collection of OpenClaw skills. 5,400+ skills filtered and categorized from the official OpenClaw Skills Registry.🦞

47,426 4,644 Updated Apr 20, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 365,431 74,866 Updated Apr 28, 2026

Ongoing research training transformer models at scale

Python 16,173 3,879 Updated Apr 28, 2026

Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"

Python 579 49 Updated Nov 4, 2025

🌿 DeepPrune: Parallel Scaling without Inter-trace Redundancy

Python 21 Updated Apr 20, 2026
Python 101 10 Updated Feb 11, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,832 528 Updated Apr 27, 2026

[SIGGRAPH Asia 2025] CHARM: Control-point-based 3D Anime Hairstyle Auto-Regressive Modeling

Python 48 3 Updated Apr 17, 2026

🚀🚀 Efficient implementations of Native Sparse Attention

Python 675 15 Updated Sep 29, 2025

Renderer for the harmony response format to be used with gpt-oss

Rust 4,345 268 Updated Apr 8, 2026

GLM-SIMPLE-EVALS: The evaluation repository for the GLM-4.5 series of models by Z.ai.

Python 39 7 Updated Oct 17, 2025

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Python 4,332 452 Updated Feb 1, 2026

[COLM 2024] A Survey on Deep Learning for Theorem Proving

223 17 Updated May 28, 2025
Python 46 1 Updated Apr 12, 2026

An efficient implementation of the NSA (Native Sparse Attention) kernel

Python 133 5 Updated Jun 24, 2025
Python 727 20 Updated Feb 5, 2026

TradingAgents: Multi-Agents LLM Financial Trading Framework

Python 53,949 9,791 Updated Apr 25, 2026

Scaling RL on advanced reasoning models

Python 677 42 Updated Oct 20, 2025

slime is an LLM post-training framework for RL Scaling.

Python 5,500 754 Updated Apr 28, 2026
Python 34 3 Updated Jun 5, 2025

[SIGGRAPH 2025] PrimitiveAnything: Human-Crafted 3D Primitive Assembly Generation with Auto-Regressive Transformer

Python 387 16 Updated Apr 17, 2026
Python 19 1 Updated Jun 29, 2025

ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括374个大模型,覆盖chatgpt、gpt-5.4、谷歌gemini-3.1-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3.6-max、qwen3.6-plus、百川、讯飞星火、商汤senseChat等商用模型, 以及step3.5-flash、kimi-k2.6、ernie4.5、Mini…

5,929 241 Updated Apr 26, 2026
Next