Skip to content
View stzoozz's full-sized avatar
👩‍🚒
👩‍🚒

Highlights

  • Pro

Block or report stzoozz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The agent that grows with you

Python 65,972 8,825 Updated Apr 12, 2026

Code Repository of Evaluating Quantized Large Language Models

Python 134 10 Updated Sep 8, 2024

推荐算法实战(Recommend algorithm)

Jupyter Notebook 238 38 Updated Jun 29, 2025

Classic papers and resources on recommendation

Python 3,526 815 Updated Oct 16, 2025

A Lighting Pytorch Framework for Recommendation Models, Easy-to-use and Easy-to-extend.

Jupyter Notebook 988 136 Updated Apr 10, 2026

Autonomous Agents (LLMs) research papers. Updated Daily.

1,214 94 Updated Apr 8, 2026

vLLM Documentation in Chinese Simplified / vLLM 中文文档

TypeScript 169 21 Updated Mar 5, 2026

Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM (CHI 2024 paper). LLooM automatically surfaces high-level concepts to analyze unstructured text.

Python 158 23 Updated Jun 4, 2025

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 24,876 3,915 Updated Mar 6, 2026

Agent Skills as a Memory Layer

TypeScript 3,314 311 Updated Apr 11, 2026

A repo for llm on ncnn

C++ 216 23 Updated Apr 3, 2026

A smart, powerful, and beautiful excalidraw drawing tool.Draw Professional Charts with Natural Language

JavaScript 3,096 380 Updated Jan 22, 2026

Project for ECE143, Data Visualization

Jupyter Notebook 3 Updated Dec 3, 2025

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

3,289 227 Updated Apr 10, 2026

Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory

Python 266 37 Updated Mar 12, 2026

⚡A CLI tool for code structural search, lint and rewriting. Written in Rust

Rust 13,390 342 Updated Apr 11, 2026
Python 15 Updated Nov 20, 2025

LLM model quantization (compression) toolkit with HW acceleration support for Nvidia, AMD, Intel GPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.

Python 1,102 176 Updated Apr 12, 2026

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.

Python 699 77 Updated Apr 1, 2026

Memory Agent monorepo

Python 86 12 Updated Oct 9, 2025

Fast low-bit matmul kernels in Triton

Python 443 33 Updated Apr 4, 2026

Awesome list for LLM quantization

Python 412 23 Updated Oct 11, 2025

[ACL 2025] Graph-guided agentic framework for code localization https://arxiv.org/abs/2503.09089

Python 603 94 Updated Aug 17, 2025
JavaScript 51 4 Updated Jun 21, 2025

Welcome to the official repository of SINQ! A novel, fast and high-quality quantization method designed to make any Large Language Model smaller while preserving accuracy.

Python 609 50 Updated Feb 23, 2026

A Lightweight LLM Post-Training Library

Python 2,216 277 Updated Apr 10, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,480 503 Updated Apr 12, 2026

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 1,421 86 Updated Apr 21, 2025

FlashInfer: Kernel Library for LLM Serving

Python 5,375 890 Updated Apr 11, 2026
Next