Skip to content
View loofahcus's full-sized avatar
  • Beijing, China
  • 18:15 (UTC +08:00)

Block or report loofahcus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,708 230 Updated Oct 14, 2025

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 3,749 675 Updated Oct 11, 2025

The LLM Evaluation Framework

Python 11,962 1,045 Updated Nov 5, 2025

ResiDual: Transformer with Dual Residual Connections, https://arxiv.org/abs/2304.14802

Python 96 7 Updated Aug 18, 2023

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 171 17 Updated Jun 27, 2025
Python 545 55 Updated Jul 11, 2024

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 918 47 Updated Mar 19, 2025

Efficient Mixture of Experts for LLM Paper List

Python 142 5 Updated Sep 28, 2025

Awesome LLM Books: Curated list of books on Large Language Models

1,060 161 Updated Oct 24, 2025

High-speed Large Language Model Serving for Local Deployment

C++ 8,376 450 Updated Aug 2, 2025

GPU operators for sparse tensor operations

Python 35 1 Updated Mar 11, 2024

Benchmarking Benchmark Leakage in Large Language Models

JavaScript 55 3 Updated May 20, 2024

A resource repository for machine unlearning in large language models

503 29 Updated Jul 20, 2025

LLM Unlearning

Python 177 20 Updated Oct 20, 2023

An Extensible Deep Learning Library

Python 2,280 383 Updated Nov 4, 2025

2025年11月更新,目前国内可用Docker镜像源汇总,DockerHub国内镜像加速列表,🚀DockerHub镜像加速器

6,318 298 Updated Oct 22, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,842 896 Updated Sep 30, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,352 1,523 Updated Apr 24, 2025

Fully open data curation for reasoning models

Python 2,132 176 Updated Sep 3, 2025

交易模块

Python 7,433 1,691 Updated Sep 10, 2025

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,183 1,665 Updated Sep 24, 2025

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,737 290 Updated Nov 3, 2025

A Telegram bot to recommend arXiv papers

Python 287 24 Updated Apr 12, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 32,498 3,758 Updated Nov 2, 2025

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 1,104 71 Updated Feb 7, 2025
Python 963 110 Updated Jan 23, 2025

O1 Replication Journey

2,001 63 Updated Jan 14, 2025

RLHF implementation details of OAI's 2019 codebase

Python 193 12 Updated Jan 14, 2024

A flexible and efficient training framework for large-scale alignment tasks

Python 436 36 Updated Oct 23, 2025
Next