Skip to content
View SuperCB's full-sized avatar
🏠
Working from home
🏠
Working from home
  • rednote-hilab
  • Beijing

Block or report SuperCB

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

277 stars written in Python
Clear filter

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,228 3,678 Updated Jul 4, 2024

Fast and memory-efficient exact attention

Python 20,376 2,118 Updated Nov 5, 2025

Build Real-Time Knowledge Graphs for AI Agents

Python 19,893 1,871 Updated Nov 6, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 19,880 3,291 Updated Nov 7, 2025

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 19,102 1,949 Updated Apr 4, 2024

Letta is the platform for building stateful agents: open AI with advanced memory that can learn and self-improve over time.

Python 19,074 1,992 Updated Oct 24, 2025

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,941 1,877 Updated Jul 15, 2025

Inference code for CodeLlama models

Python 16,359 1,935 Updated Aug 12, 2024

A high-performance, zero-overhead, extensible Python compiler with built-in NumPy support

Python 15,991 557 Updated Nov 1, 2025

FauxPilot - an open-source alternative to GitHub Copilot server

Python 14,759 634 Updated Apr 9, 2024

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,625 2,257 Updated Nov 2, 2025

Python Implementation of Reinforcement Learning: An Introduction

Python 14,399 4,958 Updated Aug 9, 2024

Ongoing research training transformer models at scale

Python 14,113 3,249 Updated Nov 7, 2025

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,110 969 Updated Nov 3, 2025

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 12,794 3,692 Updated Nov 6, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,365 1,523 Updated Apr 24, 2025

An open-source tool-augmented conversational language model from Fudan University

Python 12,061 1,138 Updated Jul 13, 2024

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 11,946 932 Updated Mar 11, 2025

Retrieval and Retrieval-augmented LLMs

Python 10,793 804 Updated Oct 22, 2025

Large Language Model Text Generation Inference

Python 10,625 1,234 Updated Nov 6, 2025

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 10,137 1,226 Updated Aug 4, 2025

A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques

Python 9,566 1,453 Updated Oct 26, 2025

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,509 705 Updated Sep 27, 2025

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,374 583 Updated Oct 28, 2024

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, or on-prem).

Python 8,914 833 Updated Nov 7, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,330 807 Updated Oct 31, 2025

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 7,994 716 Updated May 31, 2024

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,868 681 Updated Oct 11, 2025

A faster pytorch implementation of faster r-cnn

Python 7,842 2,322 Updated May 20, 2022

🚴 Call stack profiler for Python. Shows you why your code is slow!

Python 7,460 254 Updated Nov 3, 2025