Skip to content
View loofahcus's full-sized avatar
  • Beijing, China
  • 00:41 (UTC +08:00)

Block or report loofahcus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,511 182 Updated Dec 22, 2025

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,785 236 Updated Dec 22, 2025

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 3,995 719 Updated Dec 18, 2025

The LLM Evaluation Framework

Python 12,685 1,121 Updated Dec 21, 2025

ResiDual: Transformer with Dual Residual Connections, https://arxiv.org/abs/2304.14802

Python 97 7 Updated Aug 18, 2023

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 180 18 Updated Jun 27, 2025
Python 547 56 Updated Jul 11, 2024

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 943 48 Updated Mar 19, 2025

Efficient Mixture of Experts for LLM Paper List

Python 151 6 Updated Sep 28, 2025

Awesome LLM Books: Curated list of books on Large Language Models

1,523 206 Updated Oct 24, 2025

High-speed Large Language Model Serving for Local Deployment

C++ 8,499 463 Updated Aug 2, 2025

GPU operators for sparse tensor operations

Python 35 1 Updated Mar 11, 2024

Benchmarking Benchmark Leakage in Large Language Models

JavaScript 58 3 Updated May 20, 2024

A resource repository for machine unlearning in large language models

515 31 Updated Dec 17, 2025

LLM Unlearning

Python 178 20 Updated Oct 20, 2023

An Extensible Deep Learning Library

Python 2,303 392 Updated Dec 11, 2025

2025年12月更新,目前国内可用Docker镜像源汇总,DockerHub国内镜像加速列表,🚀DockerHub镜像加速器

6,913 329 Updated Dec 16, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,929 922 Updated Dec 15, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,506 1,533 Updated Apr 24, 2025

Fully open data curation for reasoning models

Python 2,173 182 Updated Dec 2, 2025

交易模块

Python 7,559 1,733 Updated Sep 10, 2025

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,433 1,689 Updated Sep 24, 2025

🚀 Efficient implementations of state-of-the-art linear attention models

Python 4,108 336 Updated Dec 20, 2025

A Telegram bot to recommend arXiv papers

Python 289 24 Updated Nov 10, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 35,953 4,244 Updated Dec 22, 2025

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 1,105 69 Updated Feb 7, 2025
Python 969 111 Updated Jan 23, 2025

O1 Replication Journey

2,003 63 Updated Jan 14, 2025

RLHF implementation details of OAI's 2019 codebase

Python 197 12 Updated Jan 14, 2024
Next