[NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personali…

Python 2,924 286 Updated Sep 4, 2025

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 2,891 540 Updated Nov 7, 2025

henrywoo / pyllama

LLaMA: Open and Efficient Foundation Language Models

Python 2,801 307 Updated Nov 8, 2023

dvlab-research / LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,689 290 Updated Aug 14, 2024

wzhe06 / SparrowRecSys

A Deep Learning Recommender System

Python 2,671 866 Updated Jun 2, 2024

BBuf / tvm_mlir_learn

compiler learning resources collect.

Python 2,582 359 Updated Mar 19, 2025

InternLM / HuixiangDou

HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance

Python 2,440 179 Updated Aug 13, 2025

opendilab / PPOxFamily

PPO x Family DRL Tutorial Course（决策智能入门级公开课：8节课帮你盘清算法理论，理顺代码逻辑，玩转决策AI应用实践）

Python 2,425 204 Updated Mar 13, 2025

OpenBMB / CPM-Bee

百亿参数的中英文双语基座大模型

Python 2,420 179 Updated Jul 28, 2023

shenweichen / DeepMatch

A deep matching model library for recommendations & advertising. It's easy to train models and to export representation vectors which can be used for ANN search.

Python 2,386 546 Updated Apr 26, 2025

xdit-project / xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 2,378 278 Updated Oct 30, 2025

dvmazur / mixtral-offloading

Run Mixtral-8x7B models in Colab or consumer desktops

Python 2,326 231 Updated Apr 8, 2024

X-PLUG / mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Python 2,258 129 Updated May 30, 2025

severian42 / GraphRAG-Local-UI

GraphRAG using Local LLMs - Features robust API and multiple apps for Indexing/Prompt Tuning/Query/Chat/Visualizing/Etc. This is meant to be the ultimate GraphRAG/KG local LLM app.

Python 2,256 284 Updated Nov 9, 2024

TigerResearch / TigerBot

TigerBot: A multi-language multi-task LLM

Python 2,256 189 Updated Dec 28, 2024

AkariAsai / self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Python 2,234 210 Updated May 25, 2024

facebookresearch / schedule_free

Schedule-Free Optimization in PyTorch

Python 2,229 68 Updated May 21, 2025

IST-DASLab / gptq

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,214 182 Updated Mar 27, 2024

guyulongcs / Awesome-Deep-Learning-Papers-for-Search-Recommendation-Advertising

Awesome Deep Learning papers for industrial Search, Recommendation and Advertisement. They focus on Embedding, Matching, Pre-Ranking, Ranking (CTR/CVR prediction), Post Ranking, Relevance, LLM, Rei…

Python 2,182 275 Updated Nov 7, 2025