Skip to content
View hhy3's full-sized avatar
  • Hilbert Space
  • 02:48 (UTC +08:00)
  • LinkedIn in/zhwangcs

Organizations

@milvus-io

Block or report hhy3

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

程序员延寿指南 | A programmer's guide to live longer

34,606 2,368 Updated May 19, 2025

S3 vector database for LLM Agents and RAG.

Python 55 4 Updated Dec 12, 2025

Accelerating MoE with IO and Tile-aware Optimizations

Python 334 14 Updated Dec 18, 2025

[SIGMOD2026] Reveal Hidden Pitfalls and Navigate Next Generation of Vector Similarity Search with Task-Centric Benchmarks

C++ 7 Updated Dec 16, 2025

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Python 1,211 112 Updated Aug 16, 2025
Python 1,415 102 Updated Dec 18, 2025

SigNoz is an open-source observability platform native to OpenTelemetry with logs, traces and metrics in a single application. An open-source alternative to DataDog, NewRelic, etc. 🔥 🖥. 👉 Open sour…

TypeScript 24,980 1,862 Updated Dec 19, 2025

PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.

Python 4,067 250 Updated Dec 19, 2025

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 725 20 Updated Dec 19, 2025

Light Video Generation Inference Framework

Python 1,236 78 Updated Dec 19, 2025

Adamas: Hadamard Sparse Attention for Efficient Long-context Inference

Cuda 10 1 Updated Nov 25, 2025

Cookbook of SGLang - Recipe

JavaScript 38 6 Updated Dec 18, 2025

DS SERVE: The Largest Open Vector Store over Pretain Data; A Framework for Efficient and Scalable Neural Retrieval

Python 27 4 Updated Dec 12, 2025

Tile-Based Runtime for Ultra-Low-Latency LLM Inference

Python 451 19 Updated Dec 8, 2025

Ada-ef — Adaptive efSearch for HNSW-based vector search

C++ 4 Updated Dec 9, 2025
C++ 5 Updated Dec 3, 2025

train a model on huchenfeng dataset

Jupyter Notebook 48 2 Updated Dec 8, 2025

The ultimate training toolkit for finetuning diffusion models

Python 8,340 977 Updated Dec 19, 2025

A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

Python 168 22 Updated Dec 19, 2025

A framework for efficient model inference with omni-modality models

Python 1,003 136 Updated Dec 19, 2025

NVIDIA cuTile learn

Python 130 Updated Dec 9, 2025

Helpful kernel tutorials and examples for tile-based GPU programming

Python 456 22 Updated Dec 19, 2025

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 1,629 83 Updated Dec 19, 2025

A comprehensive guide for beginners in the field of data management and artificial intelligence.

506 21 Updated Apr 8, 2025

Code for the paper “Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling”

Python 81 1 Updated Dec 19, 2025
Python 170 28 Updated Dec 7, 2025

Milvus web documents and contents

MDX 134 132 Updated Dec 15, 2025

Official inference repo for FLUX.2 models

Python 1,241 62 Updated Dec 1, 2025

Classic papers and resources on recommendation

Python 3,466 815 Updated Oct 16, 2025
Next