-
Zhejiang University
- China
- cu-oh-2.github.io
- https://codeforces.com/profile/Cu_OH_2
Stars
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
⭐️ A cross-platform CLI All-in-One assistant tool for Claude Code, Codex & Gemini CLI.
A cross-platform desktop All-in-One assistant tool for Claude Code, Codex, OpenCode, openclaw & Gemini CLI.
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
Deep dive into Claude Code internals — architecture, agent loop, context engineering, and more. / 深入解析 Claude Code 源码:架构、Agent 循环、上下文工程、工具系统等
Build your own Claude Code from scratch. 🔍 Claude Code 开源了 50 万行代码,读不动?用 ~4000 行 TypeScript / Python 从零复现核心架构,11 章分步教程带你理解 coding agent 精髓
📚 A curated collection of papers and open-source code repositories dedicated to the application of Vision-Language Models (VLMs) for streaming video.
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Awesome AI Memory | LLM Memory | A curated knowledge base on AI memory for LLMs and agents, covering long-term memory, reasoning, retrieval, and memory-native system design. Awesome-AI-Memory 是一个 集…
A feature-rich command-line audio/video downloader
A Direct IP multiplayer mod for Slay the Spire 2. Say goodbye to complex platform network issues and connect directly with your friends via IP address to climb the Spire together!
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.
Writing AI Conference Papers: A Handbook for Beginners
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
[CVPR 2026] LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding
The paper list of "Memory in the Age of AI Agents: A Survey"
[Awesome] 🔥🔥🔥 Latest Papers, Codes and Datasets on Streaming / Online Video Understanding
一个自动化的每日学术播客生成器,支持直接从 Hugging Face Paper 上获取每日论文以及用户额外输入论文,生成图文并茂的高质量论文播客。
[NeurIPS'2025] Official repository for "LiveStar: Live Streaming Assistant for Real-World Online Video Understanding"
A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.
SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations (CVPR 2026 Findings)
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
A high-throughput and memory-efficient inference and serving engine for LLMs
revolutionary new technology that turns any image into obama