Skip to content
View liuqh16's full-sized avatar
🐶
🐶

Highlights

  • Pro

Block or report liuqh16

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AI agents running research on single-GPU nanochat training automatically

Python 48,699 6,773 Updated Mar 21, 2026

🦞 Just talk to your agent — it learns and EVOLVES 🧬.

Python 2,305 246 Updated Mar 20, 2026

OpenClaw-RL: Train any agent simply by talking

Python 3,946 385 Updated Mar 22, 2026

breakdown of popular coding agents

Shell 2 Updated Mar 19, 2026

OpenViking is an open-source context database designed specifically for AI Agents(such as openclaw). OpenViking unifies the management of context (memory, resources, and skills) that Agents need th…

Python 17,608 1,204 Updated Mar 22, 2026

Fast, small, and fully autonomous AI personal assistant infrastructure, ANY OS, ANY PLATFORM — deploy anywhere, swap anything 🦀

Rust 28,310 3,863 Updated Mar 22, 2026

Tiny, Fast, and Deployable anywhere — automate the mundane, unleash your creativity

Go 25,735 3,549 Updated Mar 22, 2026

A lightweight alternative to OpenClaw that runs in containers for security. Connects to WhatsApp, Telegram, Slack, Discord, Gmail and other messaging apps,, has memory, scheduled jobs, and runs dir…

TypeScript 24,797 7,690 Updated Mar 21, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 329,057 63,891 Updated Mar 22, 2026

PiloTY: AI pilot for PTY operations via MCP - enables AI agents to control interactive terminals like a human

Python 30 4 Updated Mar 11, 2026

一个mini实现 demo for clawdbot

TypeScript 2 1 Updated Feb 8, 2026

Post-training with Tinker

Python 2,966 357 Updated Mar 22, 2026

LLMRouter: An Open-Source Library for LLM Routing

Python 1,534 141 Updated Mar 17, 2026

Nano vLLM

Python 12,352 1,760 Updated Nov 3, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 9,959 993 Updated Mar 20, 2026

动手学Ollama,CPU玩转大模型部署,在线阅读地址:https://datawhalechina.github.io/handy-ollama/

Jupyter Notebook 2,294 288 Updated Jan 15, 2026

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 1,193 67 Updated Nov 9, 2025

Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"

Python 1,255 198 Updated Jul 18, 2024

Integrate the DeepSeek API into popular software

35,996 3,998 Updated Feb 23, 2026

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,526 1,005 Updated Feb 6, 2026

Fully open reproduction of DeepSeek-R1

Python 25,956 2,416 Updated Nov 24, 2025

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,353 443 Updated Mar 9, 2026

Machine Learning Toolkit for Kubernetes

15,526 2,614 Updated Jan 5, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,097 3,477 Updated Mar 21, 2026

Minimal reproduction of DeepSeek R1-Zero

Python 12,965 1,581 Updated Feb 27, 2026

Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai

TypeScript 4,900 438 Updated Mar 3, 2026

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,883 842 Updated May 29, 2022

A generative world for general-purpose robotics & embodied AI learning.

Python 28,318 2,628 Updated Mar 21, 2026

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

Python 887 117 Updated Feb 26, 2026
Next