Skip to content
View AndroidSheepy's full-sized avatar
  • USTC, intern@MBZUAI
  • Abu Dhabi, UAE

Highlights

  • Pro

Block or report AndroidSheepy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Repository for the paper "Large Language Model-Based Agents for Software Engineering: A Survey". Keep updating.

538 33 Updated Mar 16, 2025

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 6,617 872 Updated Dec 22, 2025

LaTeX template for USTC thesis

TeX 2,040 444 Updated Mar 30, 2026

Accelerating Long Context LLM Inference with Accuracy-Preserving Context Optimization in SGLang, vLLM, llama.cpp, OpenClaw, RAG, and Agentic AI.

Python 71 4 Updated Apr 3, 2026

Kimina-Prover RL pipeline

Python 10 1 Updated Aug 14, 2025

Kimina Lean server (+ client SDK)

Python 191 29 Updated Jan 11, 2026

Major CS conference publication stats (including accepted and submitted) by year.

Python 176 12 Updated Dec 23, 2025

slime is an LLM post-training framework for RL Scaling.

Python 5,112 689 Updated Apr 3, 2026

Serverless LLM Serving for Everyone.

Python 667 69 Updated Mar 6, 2026

Official repository for the EMNLP 2025 paper "Slim-SC: Thought Pruning for Efficient Scaling with Self-Consistency".

Jupyter Notebook 14 Updated Sep 18, 2025

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 4,219 306 Updated Jan 14, 2026

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,551 227 Updated Dec 15, 2025

[ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2

Python 288 42 Updated Aug 28, 2025

A PyTorch native platform for training generative AI models

Python 5,206 772 Updated Apr 4, 2026

An interference-aware scheduler for fine-grained GPU sharing

Python 161 28 Updated Nov 26, 2025

NVIDIA Linux open GPU kernel module source

C 16,853 1,652 Updated Apr 3, 2026

Easy and Efficient dLLM Fine-Tuning

Python 238 14 Updated Mar 2, 2026

LM engine is a library for pretraining/finetuning LLMs

Python 162 28 Updated Apr 4, 2026

LaTeX Template for Statement of Purpose (SoP)

TeX 148 21 Updated Oct 28, 2022

dInfer: An Efficient Inference Framework for Diffusion Language Models

Python 449 42 Updated Feb 11, 2026

[SIGMOD 2025] PQCache: Product Quantization-based KVCache for Long Context LLM Inference

Python 84 23 Updated Dec 7, 2025
C++ 630 107 Updated Mar 31, 2026

Pie: Programmable LLM Serving

Python 141 17 Updated Apr 4, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,046 262 Updated Apr 4, 2026

Kernels, of the mega variety :)

Python 699 54 Updated Apr 1, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,307 852 Updated Mar 22, 2026
C++ 33 2 Updated Jul 17, 2024

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 37,699 16,767 Updated Apr 4, 2026
Cuda 32 1 Updated Apr 2, 2025
Next