AndroidSheepy

Ryan Ruan AndroidSheepy

Smooth Seas Don't Make Good Sailors

6 followers · 18 following

USTC, intern@MBZUAI
Abu Dhabi, UAE

Highlights

Lists (1)

Sort

✨ Inspiration

Stars

FudanSELab / Agent4SE-Paper-List

Repository for the paper "Large Language Model-Based Agents for Software Engineering: A Survey". Keep updating.

538 33 Updated Mar 16, 2025

ResearAI / AutoFigure-Edit

Python 2,231 160 Updated Apr 4, 2026

Infrasys-AI / AIInfra

AIInfra（AI 基础设施）指AI系统从底层芯片等硬件，到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 6,617 872 Updated Dec 22, 2025

ustctug / ustcthesis

LaTeX template for USTC thesis

TeX 2,040 444 Updated Mar 30, 2026

EfficientContext / ContextPilot

Accelerating Long Context LLM Inference with Accuracy-Preserving Context Optimization in SGLang, vLLM, llama.cpp, OpenClaw, RAG, and Agentic AI.

Python 71 4 Updated Apr 3, 2026

project-numina / kimina-prover-rl

Forked from verl-project/verl

Kimina-Prover RL pipeline

Python 10 1 Updated Aug 14, 2025

project-numina / kimina-lean-server

Kimina Lean server (+ client SDK)

Python 191 29 Updated Jan 11, 2026

emeryberger / csconferences

Major CS conference publication stats (including accepted and submitted) by year.

Python 176 12 Updated Dec 23, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 5,112 689 Updated Apr 3, 2026

ServerlessLLM / ServerlessLLM

Serverless LLM Serving for Everyone.

Python 667 69 Updated Mar 6, 2026

hyscale-lab / slimsc

Official repository for the EMNLP 2025 paper "Slim-SC: Thought Pruning for Efficient Scaling with Self-Consistency".

Jupyter Notebook 14 Updated Sep 18, 2025

deepseek-ai / Engram

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 4,219 306 Updated Jan 14, 2026

alibaba / Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,551 227 Updated Dec 15, 2025

AIoT-MLSys-Lab / SVD-LLM

[ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2

Python 288 42 Updated Aug 28, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 5,206 772 Updated Apr 4, 2026

eth-easl / orion

An interference-aware scheduler for fine-grained GPU sharing

Python 161 28 Updated Nov 26, 2025

NVIDIA / open-gpu-kernel-modules

NVIDIA Linux open GPU kernel module source

C 16,853 1,652 Updated Apr 3, 2026

inclusionAI / dFactory

Easy and Efficient dLLM Fine-Tuning

Python 238 14 Updated Mar 2, 2026

open-lm-engine / lm-engine

LM engine is a library for pretraining/finetuning LLMs

Python 162 28 Updated Apr 4, 2026

Swapnil-Gandhi / SoP-Template

LaTeX Template for Statement of Purpose (SoP)

TeX 148 21 Updated Oct 28, 2022

inclusionAI / dInfer

dInfer: An Efficient Inference Framework for Diffusion Language Models

Python 449 42 Updated Feb 11, 2026

HugoZHL / PQCache

[SIGMOD 2025] PQCache: Product Quantization-based KVCache for Long Context LLM Inference

Python 84 23 Updated Dec 7, 2025

NVIDIA / cuCollections

C++ 630 107 Updated Mar 31, 2026

pie-project / pie

Pie: Programmable LLM Serving

Python 141 17 Updated Apr 4, 2026

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,046 262 Updated Apr 4, 2026

HazyResearch / Megakernels

Kernels, of the mega variety :)

Python 699 54 Updated Apr 1, 2026

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,307 852 Updated Mar 22, 2026

AlibabaResearch / mononn

C++ 33 2 Updated Jul 17, 2024

llvm / llvm-project

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 37,699 16,767 Updated Apr 4, 2026

HPMLL / SpInfer_EuroSys25

Cuda 32 1 Updated Apr 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly