- Hong Kong
-
07:27
(UTC +08:00) - https://www.cse.cuhk.edu.hk/~zhpei23/
Stars
分享AI Infra知识&代码练习:PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等
OpenSquilla — Token-Efficient AI Agent with same budget, higher intelligence density
Elevate your AI research writing, no more tedious polishing ✨
Academic Research Skills for Claude Code: research → write → review → revise → finalize
Open-source harness distillation: frontier teachers improve prompts, tools, validators, skills, and runtime policies around weaker models.
将博导十年科研经验炼化为可直接调用的 AI 技能。从 Idea 构思到论文投稿,你的 AI 科研副导师。
OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophisticated proprietary systems like the GPT-4 Code Interpreter. …
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
Train transformer language models with reinforcement learning.
Headless Slay the Spire 2 CLI — play the full game from a terminal.
DFlash: Block Diffusion for Flash Speculative Decoding
The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!
RLAnything (ICML 2026) & AutoTool (ICML 2026), DemyAgent: Open-Source RL for LLMs and Agentic Scenarios
The implementation for the paper, FuseGPT: Learnable Layers Fusion of Generative Pre-trained Transformers.
[ACL 2026 Main] Analytical FFN-to-MoE Restructuring via Activation Pattern Analysis
Proactive Inference for Efficient Mixture-of-Experts
SCOPE: Self-evolving Context Optimization via Prompt Evolution - A framework for automatic prompt optimization
OpenClaw-RL: Train any agent simply by talking
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
LOFT: A 1 Million+ Token Long-Context Benchmark
Official JAX implementation of End-to-End Test-Time Training for Long Context
Collection of advice for prospective and current PhD students
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …
[ASPDAC23] High Dimensional Yield Estimation using Shrinkage Deep Features and Maximization of Integral Entropy Reduction