Skip to content
View luofuli's full-sized avatar

Block or report luofuli

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"

Python 1,612 191 Updated Aug 12, 2020

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,043 5,037 Updated Mar 28, 2026

Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"

Python 320 28 Updated Dec 20, 2023
Jupyter Notebook 1,227 159 Updated Dec 22, 2025

SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.

6,396 578 Updated Sep 3, 2025

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

6,567 1,055 Updated Nov 11, 2025

Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch

Python 28 2 Updated Mar 28, 2026
Python 4,417 481 Updated Jul 31, 2025

一键拥有你自己的 ChatGPT 网页服务。 One-Click to deploy your own ChatGPT web UI.(基于 langchain 实现的插件版本 Plugin version implemented based on langchain)

TypeScript 1,208 346 Updated Feb 24, 2026

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

Python 1,000 60 Updated Dec 6, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,722 194 Updated Jun 25, 2024

Repository for NPHardEval, a quantified-dynamic benchmark of LLMs

Jupyter Notebook 64 3 Updated Mar 26, 2024

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,816 470 Updated Oct 14, 2025

DeepSeek LLM: Let there be answers

Makefile 6,787 1,060 Updated Feb 4, 2024
Python 39 142 Updated May 22, 2025

A curated list of open-source projects related to DeepSeek Coder

770 210 Updated Nov 11, 2025

DeepSeek Coder: Let the Code Write Itself

Python 22,965 2,748 Updated Nov 11, 2025

[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

Python 3,009 358 Updated Apr 22, 2025

Simple demonstration of a cjk tokenizer.

Python 5 1 Updated Sep 11, 2023

A series of large language models developed by Baichuan Intelligent Technology

Python 4,117 292 Updated Nov 8, 2024

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 182,900 46,220 Updated Mar 28, 2026

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 18,094 2,908 Updated Mar 26, 2026

Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

Python 2,682 342 Updated Jul 25, 2023

TextGAN is a PyTorch framework for Generative Adversarial Networks (GANs) based text generation models.

Python 910 206 Updated Jun 26, 2024

Parameter Efficient Transfer Learning with Diff Pruning

Python 74 9 Updated Feb 3, 2021

MEND: Fast Model Editing at Scale

Python 257 33 Updated Aug 30, 2023
4 1 Updated Mar 6, 2022

EsViT: Efficient self-supervised Vision Transformers

Python 413 41 Updated Aug 28, 2023

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 13,370 894 Updated Dec 17, 2024

Pytorch implementations of Bayes By Backprop, MC Dropout, SGLD, the Local Reparametrization Trick, KF-Laplace, SG-HMC and more

Jupyter Notebook 1,963 305 Updated Oct 20, 2023
Next