cs-qyzhang

张丘洋 cs-qyzhang

华中科技大学计算机系统结构博士在读

70 followers · 59 following

Huazhong University of Science and Technology
Wuhan, China
https://jianyue.tech

Achievements

Highlights

Starred repositories

467 results for source starred repositories

Clear filter

ModelEngine-Group / unified-cache-management

Persist and reuse KV Cache to speedup your LLM.

Python 106 34 Updated Nov 6, 2025

thunlp / InfLLM

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

Python 387 36 Updated Apr 20, 2024

InferenceMAX / InferenceMAX

Python 319 39 Updated Nov 6, 2025

danny-avila / LibreChat

Enhanced ChatGPT Clone: Features Agents, MCP, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message…

TypeScript 31,386 6,095 Updated Nov 6, 2025

Mega4alik / ollm

Python 2,104 178 Updated Nov 4, 2025

morgen52 / webanns

Try the demo of WebANNS on our GitHub page!

C++ 12 Updated Jul 14, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,951 1,294 Updated Nov 3, 2025

state-spaces / mamba

Mamba SSM architecture

Python 16,339 1,481 Updated Oct 10, 2025

NVlabs / GatedDeltaNet

[ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule

Python 357 21 Updated Sep 15, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,758 293 Updated Nov 6, 2025

marius-team / quake

Query-Adaptive Vector Search

C++ 60 12 Updated Nov 3, 2025

spylang / spy

SPy language

Python 625 35 Updated Nov 4, 2025

mlc-ai / mlc-llm

Universal LLM Deployment Engine with ML Compilation

Python 21,565 1,851 Updated Nov 4, 2025

microsoft / poml

Prompt Orchestration Markup Language

TypeScript 4,715 248 Updated Oct 21, 2025

openai / codex

Lightweight coding agent that runs in your terminal

Rust 49,890 6,162 Updated Nov 6, 2025

Azure / AzurePublicDataset

Microsoft Azure Traces

Jupyter Notebook 1,019 167 Updated Oct 20, 2025

NetX-lab / Ayo

[ASPLOS'25] Towards End-to-End Optimization of LLM-based Applications with Ayo

Python 48 5 Updated Aug 5, 2025

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,112 1,905 Updated Nov 1, 2025

coze-dev / coze-studio

An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.

TypeScript 18,350 2,550 Updated Nov 5, 2025

jlowin / fastmcp

🚀 The fast, Pythonic way to build MCP servers and clients

Python 20,010 1,460 Updated Nov 6, 2025

lancedb / lancedb

Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.

Rust 7,907 637 Updated Nov 5, 2025

TreeAI-Lab / Awesome-KV-Cache-Management

This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding code links.

233 7 Updated Jul 29, 2025

mit-han-lab / omniserve

[MLSys'25] QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving; [MLSys'25] LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

C++ 775 53 Updated Mar 6, 2025