Skip to content
View AndroidSheepy's full-sized avatar
  • USTC, intern@MBZUAI
  • Abu Dhabi, UAE

Highlights

  • Pro

Block or report AndroidSheepy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 4,809 845 Updated Apr 1, 2026

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

C++ 1,334 143 Updated Apr 28, 2026

Presentation Slides for Developers

TypeScript 46,080 2,036 Updated Apr 28, 2026

Elevate your AI research writing, no more tedious polishing ✨

20,116 1,607 Updated Mar 25, 2026

Repository for the paper "Large Language Model-Based Agents for Software Engineering: A Survey". Keep updating.

542 34 Updated Mar 16, 2025

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 6,882 896 Updated Dec 22, 2025

LaTeX template for USTC thesis

TeX 2,089 447 Updated Apr 16, 2026

Accelerating Long Context LLM Inference with Accuracy-Preserving Context Optimization in SGLang, vLLM, llama.cpp, OpenClaw, RAG, and Agentic AI.

Python 81 6 Updated Apr 21, 2026

Kimina-Prover RL pipeline

Python 11 1 Updated Aug 14, 2025

Kimina Lean server (+ client SDK)

Python 191 29 Updated Jan 11, 2026

Major CS conference publication stats (including accepted and submitted) by year.

Python 178 13 Updated Dec 23, 2025

slime is an LLM post-training framework for RL Scaling.

Python 5,510 756 Updated Apr 28, 2026

Serverless LLM Serving for Everyone.

Python 676 71 Updated Apr 24, 2026

Official repository for the EMNLP 2025 paper "Slim-SC: Thought Pruning for Efficient Scaling with Self-Consistency".

Jupyter Notebook 14 Updated Sep 18, 2025

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 4,348 326 Updated Jan 14, 2026

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,562 229 Updated Dec 15, 2025

[ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2

Python 290 44 Updated Aug 28, 2025

A PyTorch native platform for training generative AI models

Python 5,277 799 Updated Apr 28, 2026

An interference-aware scheduler for fine-grained GPU sharing

Python 162 28 Updated Nov 26, 2025

NVIDIA Linux open GPU kernel module source

C 16,941 1,671 Updated Apr 3, 2026

Easy and Efficient dLLM Fine-Tuning

Python 250 15 Updated Mar 2, 2026

LM engine is a library for pretraining/finetuning LLMs

Python 165 29 Updated Apr 26, 2026

LaTeX Template for Statement of Purpose (SoP)

TeX 149 22 Updated Oct 28, 2022

dInfer: An Efficient Inference Framework for Diffusion Language Models

Python 459 45 Updated Feb 11, 2026

[SIGMOD 2025] PQCache: Product Quantization-based KVCache for Long Context LLM Inference

Python 87 25 Updated Dec 7, 2025
Cuda 639 107 Updated Apr 27, 2026

Pie: Programmable LLM Serving

Python 150 17 Updated Apr 27, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,111 275 Updated Apr 28, 2026

Kernels, of the mega variety :)

Python 715 56 Updated Apr 28, 2026
Next