Skip to content
View zhaijiaqi's full-sized avatar

Highlights

  • Pro

Block or report zhaijiaqi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 17 2 Updated May 13, 2026

OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization

Python 530 73 Updated Jun 8, 2026

A list of works on video generation towards world model

496 10 Updated Mar 21, 2026
Python 2 1 Updated Nov 23, 2025

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Python 427 46 Updated Aug 13, 2024
Python 337 16 Updated Apr 24, 2026

Productive, portable, and performant GPU programming in Python.

C++ 28,250 2,386 Updated Jun 9, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 378,978 79,307 Updated Jun 16, 2026

[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention

Python 682 46 Updated Mar 6, 2026
C++ 2 Updated Dec 9, 2024

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 6,506 604 Updated Jun 16, 2026

Numerical linear algebra software package

C++ 603 118 Updated Jun 16, 2026

Mirror of https://gitlab.com/petsc/petsc

C 519 213 Updated Jun 16, 2026

Large-scale sparse Conjugate Gradient (CG) solvers on High Bandwidth Memory (HBM) FPGAs

C++ 9 1 Updated Jul 26, 2024

Implementation of ConjugateGradients method using C and Nvidia CUDA

Python 53 7 Updated Jun 21, 2022

Design preconditioners with a CNN to accelerate the conjugate gradient method.

Python 27 8 Updated Jul 2, 2025

GPU-accelerated linear solvers based on the conjugate gradient (CG) method, supporting NVIDIA and AMD GPUs with GPU-aware MPI, NCCL, RCCL or NVSHMEM

C 15 6 Updated Mar 14, 2026

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 7,350 953 Updated Dec 22, 2025

[OSDI 2025] DecDEC: A Systems Approach to Advancing Low‑Bit LLM Quantization

Python 24 3 Updated Jan 29, 2026
Python 154 32 Updated Jun 24, 2024

[VLDB 25] Maximum Inner Product is Query-Scaled Nearest Neighbor

C++ 40 4 Updated Oct 31, 2025

A vector indexing library to bring fast, fresh and filtered search to your database

Rust 1,850 428 Updated Jun 16, 2026
Python 32 11 Updated Jun 22, 2025

Navigating Spreading-out Graph For Approximate Nearest Neighbor Search

C++ 734 165 Updated Sep 26, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 11,266 1,154 Updated Jun 16, 2026

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 1 Updated Aug 18, 2025

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 51,290 10,747 Updated Jun 16, 2026
Next