Skip to content
View loloxwg's full-sized avatar
🤙
Focusing
🤙
Focusing

Organizations

@doocs @infiniflow @KipData

Block or report loloxwg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

597 results for source starred repositories
Clear filter

VRAFT is a framework written in C++ that implements RAFT protocol and SEDA architecture. Based on VRAFT, distributed software can be developed easily, such as vectordb and distributed storage system.

Jupyter Notebook 11 2 Updated Sep 24, 2024

高性能QUANTAXIS交易所以及自研数据库

Rust 34 15 Updated Nov 5, 2025

Democratizing large model inference and training on any device.

Rust 165 13 Updated Oct 31, 2025

C++ implementation of a fast hash map and hash set using robin hood hashing

C++ 1,414 135 Updated Nov 2, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,235 420 Updated Nov 6, 2025

Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes

Python 4,742 1,292 Updated Nov 6, 2025

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 1,918 314 Updated Nov 6, 2025

An transformer based LLM. Written completely in Rust

Rust 2,948 243 Updated Oct 10, 2025

AKG (Auto Kernel Generator) is an optimizer for operators in Deep Learning Networks, which provides the ability to automatically fuse ops with specific patterns.

Python 233 43 Updated Nov 7, 2025

🚀 GizmoSQL — High-Performance SQL Server

C++ 223 22 Updated Nov 7, 2025

Embeddable Postgres with real-time, reactive bindings.

TypeScript 13,151 314 Updated Nov 7, 2025

Integrates DuckDB with Google BigQuery, allowing direct querying and management of BigQuery datasets

C++ 140 7 Updated Nov 2, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

66,800 7,535 Updated Jun 4, 2025

A Unified and Flexible Inference Engine with Hybrid Cache Acceleration and Parallelism for 🤗Diffusers.

Python 528 20 Updated Nov 7, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 5,918 694 Updated Nov 7, 2025

Official Repository of "LLM × DATA" Survey Paper

536 53 Updated Nov 2, 2025

Scalable, fast, and disk-friendly vector search in Postgres, the successor of pgvecto.rs.

Rust 1,285 43 Updated Nov 7, 2025

Source code for iCache-HPCA'23

Python 50 3 Updated Apr 22, 2023

WiredTiger's source tree

C 2,344 409 Updated Nov 7, 2025

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

C++ 528 50 Updated Sep 13, 2025
Rust 18 1 Updated Nov 7, 2025

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook 719 81 Updated Apr 6, 2025

Inference engine from scratch

C++ 18 5 Updated Jan 1, 2025

🐸 Read Frog - Open Source Immersive Translate | 🐸 陪读蛙 - 开源沉浸式翻译

TypeScript 2,831 158 Updated Nov 6, 2025

The lance extensions for DuckDB enable reading and writing of lance tables.

C++ 15 Updated Aug 13, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,449 11,111 Updated Nov 7, 2025

Diffusion model(SD,Flux,Wan,Qwen Image,...) inference in pure C/C++

C++ 4,529 439 Updated Nov 7, 2025

DINOv2 inference engine written in C/C++ using ggml and OpenCV.

C++ 80 6 Updated May 6, 2025

Eliminates delay when activating caps lock on macOS OSX

Swift 841 19 Updated Jun 18, 2025
Next