liuqh16

🐶

Qihan Liu liuqh16

🐶

Ph.D student from Department of Automation, Tsinghua University.

58 followers · 18 following

Tsinghua University
Beijing
15:03 (UTC +08:00)
https://orcid.org/0000-0001-6637-8346

Achievements

Highlights

Lists (9)

Sort

Stars

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 48,699 6,773 Updated Mar 21, 2026

aiming-lab / MetaClaw

🦞 Just talk to your agent — it learns and EVOLVES 🧬.

Python 2,305 246 Updated Mar 20, 2026

Gen-Verse / OpenClaw-RL

OpenClaw-RL: Train any agent simply by talking

Python 3,946 385 Updated Mar 22, 2026

7hinkDifferent / agent-cracker

breakdown of popular coding agents

Shell 2 Updated Mar 19, 2026

volcengine / OpenViking

OpenViking is an open-source context database designed specifically for AI Agents(such as openclaw). OpenViking unifies the management of context (memory, resources, and skills) that Agents need th…

Python 17,608 1,204 Updated Mar 22, 2026

zeroclaw-labs / zeroclaw

Fast, small, and fully autonomous AI personal assistant infrastructure, ANY OS, ANY PLATFORM — deploy anywhere, swap anything 🦀

Rust 28,310 3,863 Updated Mar 22, 2026

sipeed / picoclaw

Tiny, Fast, and Deployable anywhere — automate the mundane, unleash your creativity

Go 25,735 3,549 Updated Mar 22, 2026

qwibitai / nanoclaw

A lightweight alternative to OpenClaw that runs in containers for security. Connects to WhatsApp, Telegram, Slack, Discord, Gmail and other messaging apps,, has memory, scheduled jobs, and runs dir…

TypeScript 24,797 7,690 Updated Mar 21, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 329,057 63,891 Updated Mar 22, 2026

yiwenlu66 / PiloTY

PiloTY: AI pilot for PTY operations via MCP - enables AI agents to control interactive terminals like a human

Python 30 4 Updated Mar 11, 2026

san-tian / miniclaw

一个mini实现 demo for clawdbot

TypeScript 2 1 Updated Feb 8, 2026

thinking-machines-lab / tinker-cookbook

Post-training with Tinker

Python 2,966 357 Updated Mar 22, 2026

ulab-uiuc / LLMRouter

LLMRouter: An Open-Source Library for LLM Routing

Python 1,534 141 Updated Mar 17, 2026

GeeeekExplorer / nano-vllm

Nano vLLM

Python 12,352 1,760 Updated Nov 3, 2025

xlite-dev / LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 9,959 993 Updated Mar 20, 2026

datawhalechina / handy-ollama

动手学Ollama，CPU玩转大模型部署，在线阅读地址：https://datawhalechina.github.io/handy-ollama/

Jupyter Notebook 2,294 288 Updated Jan 15, 2026

buoyancy99 / diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 1,193 67 Updated Nov 9, 2025

jannerm / diffuser

Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"

Python 1,255 198 Updated Jul 18, 2024

deepseek-ai / awesome-deepseek-integration

Integrate the DeepSeek API into popular software

35,996 3,998 Updated Feb 23, 2026

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,526 1,005 Updated Feb 6, 2026

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,956 2,416 Updated Nov 24, 2025

huggingface / lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,353 443 Updated Mar 9, 2026

kubeflow / kubeflow

Machine Learning Toolkit for Kubernetes

15,526 2,614 Updated Jan 5, 2026

verl-project / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,097 3,477 Updated Mar 21, 2026

deepseek-ai / DeepSeek-R1

91,970 11,748 Updated Jun 27, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 12,965 1,581 Updated Feb 27, 2026

FellouAI / eko

Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai

TypeScript 4,900 438 Updated Mar 3, 2026

ikostrikov / pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,883 842 Updated May 29, 2022

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 28,318 2,628 Updated Mar 21, 2026

instadeepai / Mava

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

Python 887 117 Updated Feb 26, 2026

Qihan Liu liuqh16

Highlights

Lists (9)

CodeLib

Courses

Diffusion

JAX

LLM

MARL

MBRL

RL

Tools

Stars