Skip to content
View bbx8216's full-sized avatar
🌨️
🌨️

Organizations

@CUAI-CAU @GDGoC-CAU @All-Bareumi @HiK-Hi-Korea @SSLAB-CAU-FPGA @2023-2-Design-Pattern @SSLAB-for-MPS-scheduling

Block or report bbx8216

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
C++ 20 12 Updated Jan 21, 2026

A curated list of 120+ LLM libraries category wise.

10,077 1,600 Updated Mar 28, 2026

AIOS: AI Agent Operating System

Python 5,480 752 Updated Jan 22, 2026

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 2,137 237 Updated Oct 16, 2025

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,824 469 Updated Oct 14, 2025

GeminiFS: A Companion File System for GPUs

C++ 73 11 Updated Feb 18, 2025

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 33,114 3,973 Updated Mar 25, 2026

Magnum IO community repo

C++ 115 19 Updated Mar 23, 2026

A Survey on Multimodal Retrieval-Augmented Generation

502 26 Updated Feb 20, 2026
Cuda 218 71 Updated Mar 28, 2026

Build userspace NVMe drivers and storage applications with CUDA support

C 423 55 Updated Dec 18, 2023

XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.

Python 188 15 Updated May 3, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,797 1,026 Updated Mar 30, 2026

NVIDIA GPUDirect Storage Driver

C 340 55 Updated Mar 18, 2026

cuVS - a library for vector search and clustering on the GPU

Cuda 727 179 Updated Apr 9, 2026

Retrieval and Retrieval-augmented LLMs

Python 11,512 852 Updated Apr 1, 2026

ESPN: Embedding from Storage Pipelined Network. GDS implementation for multi-vector embedding retrieval and bindings.

C++ 13 1 Updated May 18, 2024

Web-scale retrieval for knowledge-intensive NLP

Python 553 27 Updated Dec 6, 2022

Vector Index Benchmark for Embeddings (VIBE) is an extensible benchmark for approximate nearest neighbor search methods, or vector indexes, using modern embedding datasets.

Python 36 6 Updated Mar 23, 2026

A library for efficient similarity search and clustering of dense vectors.

C++ 39,659 4,323 Updated Apr 9, 2026
Python 14 4 Updated Jun 25, 2025

Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".

Python 225 18 Updated Dec 16, 2025

Benchmark baseline for retrieval qa applications

Jupyter Notebook 121 14 Updated Apr 14, 2024

Official repository of the MIRAGE benchmark

Python 202 25 Updated Feb 6, 2026

Comprehensive benchmark for RAG

Jupyter Notebook 279 35 Updated Jun 14, 2025

Text describing xv6 on RISC-V

TeX 869 193 Updated Sep 2, 2025

Commentary for xv6-public

Perl 272 68 Updated Aug 10, 2020

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 75,818 15,354 Updated Apr 9, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,061 657 Updated Apr 9, 2026
Next