Stars
Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"
SGLang is a high-performance serving framework for large language models and multimodal models.
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
一键拥有你自己的 ChatGPT 网页服务。 One-Click to deploy your own ChatGPT web UI.(基于 langchain 实现的插件版本 Plugin version implemented based on langchain)
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Repository for NPHardEval, a quantified-dynamic benchmark of LLMs
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
DeepSeek LLM: Let there be answers
A curated list of open-source projects related to DeepSeek Coder
DeepSeek Coder: Let the Code Write Itself
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
A series of large language models developed by Baichuan Intelligent Technology
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
TextGAN is a PyTorch framework for Generative Adversarial Networks (GANs) based text generation models.
Parameter Efficient Transfer Learning with Diff Pruning
EsViT: Efficient self-supervised Vision Transformers
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Pytorch implementations of Bayes By Backprop, MC Dropout, SGLD, the Local Reparametrization Trick, KF-Laplace, SG-HMC and more