OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,442 705 Updated Dec 17, 2025

deepseek-ai / DeepSeek-V3

Python 100,794 16,423 Updated Aug 28, 2025

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,201 745 Updated Dec 12, 2025

srush / Triton-Puzzles

Puzzles for learning Triton

Jupyter Notebook 2,186 178 Updated Nov 18, 2024

test-time-training / ttt-lm-kernels

Inference Speed Benchmark for Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Cuda 76 5 Updated Jul 14, 2024

BlackSamorez / tensor_parallel

Automatically split your PyTorch models on multiple GPUs for training & inference

Python 657 45 Updated Jan 2, 2024

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,858 644 Updated Dec 20, 2025

NVIDIA / RULER

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 1,396 118 Updated Nov 13, 2025

state-spaces / mamba

Mamba SSM architecture

Python 16,770 1,541 Updated Nov 11, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 21,196 2,232 Updated Dec 18, 2025

VectorSpaceLab / OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 4,289 367 Updated Dec 4, 2025

dvlab-research / LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,698 294 Updated Aug 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pan-Yuqi

Achievements

Achievements

Block or report Pan-Yuqi

Stars

huggingface / open-r1

volcengine / verl

ChenmienTan / RL2

THUDM / slime

booydar / babilong

MoonshotAI / MoBA

LargeWorldModel / LWM

EvolvingLMMs-Lab / LLaVA-OneVision-1.5

ByteDance-Seed / VeOmni

shawntan / scattermoe

a1600012888 / LaCT

HazyResearch / prefix-linear-attention

zhuzilin / ring-flash-attention

KellerJordan / modded-nanogpt

karpathy / nanoGPT

NVIDIA / Megatron-LM

test-time-training / ttt-lm-pytorch

facebookresearch / lingua

open-compass / opencompass