Skip to content
View hhy3's full-sized avatar
  • Hilbert Space
  • 22:51 (UTC +08:00)
  • LinkedIn in/zhwangcs

Organizations

@milvus-io

Block or report hhy3

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

MSLK (Meta Superintelligence Labs Kernels) is a collection of PyTorch GPU operator libraries that are designed and optimized for GenAI training and inference, such as FP8 row-wise quantization and …

Python 49 21 Updated Feb 7, 2026
C++ 54 4 Updated Feb 6, 2026

[NeurIPS 2025] Official PyTorch implementation of paper "Learning Grouped Lattice Vector Quantizers for Low-Bit LLM Compression".

6 Updated Oct 24, 2025

Trainable fast and memory-efficient sparse attention

Python 532 49 Updated Feb 1, 2026

⚡ Faster similarity search with PDX: A vertical data layout for vectors

C++ 71 8 Updated Jan 15, 2026

Super fast K-Means for High-Dimensional vectors on CPUs (x86, ARM) and GPUs — for Python and C++. Up to 10x faster clustering of embeddings than FAISS and Scikit-Learn

C++ 14 Updated Feb 2, 2026

Fast, Sharp & Reliable Agentic Intelligence

C++ 641 31 Updated Feb 7, 2026

Moonshot's most powerful model

797 75 Updated Jan 31, 2026

Learning TileLang with 10 puzzles!

Python 118 12 Updated Jan 30, 2026

OpenViking is an open-source context database designed specifically for AI Agents. OpenViking unifies the management of context (memory, resources, and skills) that Agents need through a file syste…

Python 1,027 96 Updated Feb 7, 2026

Lean 4 programming language and theorem prover

Lean 7,261 750 Updated Feb 7, 2026

LEMUR: Learned Multi-Vector Retrieval

Python 24 3 Updated Jan 28, 2026

Multi-AI adversarial PR review tool

TypeScript 9 4 Updated Jan 29, 2026

FlashInfer Bench @ MLSys 2026: Building AI agents to write high performance GPU kernels

Python 98 42 Updated Feb 5, 2026

High Performance LLM Inference Operator Library

C++ 697 56 Updated Feb 5, 2026

Our first fully AI generated deep learning system

Python 492 30 Updated Feb 2, 2026

Algorithm powering the For You feed on X

Rust 15,065 2,618 Updated Jan 20, 2026

A lightweight, lightning-fast, in-process vector database

C++ 358 20 Updated Feb 6, 2026

Label Filtering Vector Similarity Search

C++ 3 Updated Dec 11, 2025

ray connector to milvus storage

Python 5 4 Updated Jan 14, 2026

Jasper is an approximate nearest neighbors search index built for GPUs. Using the batch-parallel tiling scheme from Manohar et al. and custom-built search kernels, Jasper provides state of the art …

Cuda 6 1 Updated Jan 18, 2026

The lance extensions for DuckDB enable reading and writing of lance tables.

C++ 70 7 Updated Feb 6, 2026

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 3,596 244 Updated Jan 14, 2026

mHC-lite: You Don’t Need 20 Sinkhorn-Knopp Iterations

Python 64 2 Updated Jan 12, 2026

High-Performance Embeddable Vector Database with Document Storage, Hybrid Search, and Filtering

C++ 21 Updated Feb 3, 2026

A collection of daily coding challenges designed to help you master idiomatic Go through deliberate, repetitive practice.

Go 2,093 154 Updated Jan 23, 2026

A runtime for writing reliable asynchronous applications with Rust. Provides I/O, networking, scheduling, timers, ...

Rust 31,008 2,912 Updated Feb 6, 2026

A cloud native embedded storage engine built on object storage.

Rust 2,701 189 Updated Feb 6, 2026
Next