Skip to content
View jyx-whu's full-sized avatar
  • Peking University
  • Guangdong
  • 10:17 (UTC +08:00)

Highlights

  • Pro

Block or report jyx-whu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design

Python 132 14 Updated Jun 27, 2023

PyTorch native quantization and sparsity for training and inference

Python 2,859 530 Updated Jun 15, 2026

Sparse Inferencing for transformer based LLMs

Python 220 12 Updated Mar 25, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,884 2,468 Updated Jun 16, 2026

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

716 26 Updated Apr 15, 2026

LLM KV cache compression made easy

Python 1,113 154 Updated Jun 10, 2026

🚀 Efficient implementations for emerging model architectures

Python 5,224 558 Updated Jun 11, 2026

A machine learning accelerator core designed for energy-efficient AI at the edge.

Emacs Lisp 2,388 295 Updated Jun 15, 2026

Synthesisable SystemVerilog implementation of a Transformer Decoder block

SystemVerilog 4 Updated May 1, 2026

Parameterised AXI4 crossbar interconnect in SystemVerilog — N masters, M slaves, round-robin arbitration, ID-based response routing

SystemVerilog 1 Updated Mar 19, 2026

Simple, safe way to store and distribute tensors

Rust 3,774 328 Updated Jun 15, 2026

Unified KV cache management for multi-task VLA inference.

Python 8 Updated May 12, 2026

A very simple and easy to understand RISC-V core.

C 1,482 240 Updated Nov 9, 2023

Vision–Language–Action models for Autonomous Driving (VLA4AD) resources, serving as the companion repository to the survey paper “A Survey on Vision–Language–Action Models for Autonomous Driving”.

605 53 Updated Nov 20, 2025

The official NaplesPU hardware code repository

SystemVerilog 31 5 Updated Jul 27, 2019

A curated list of academic papers and resources on Vision-Language-Action (VLA) and World Action Models (WAM)

Python 30 1 Updated Jun 15, 2026

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,899 1,908 Updated Jun 16, 2026

🏆 OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond — redefining the accuracy-efficiency Pareto front for X-LLMs KV quantization.

C++ 135 13 Updated May 21, 2026
Python 82 7 Updated May 27, 2026

⚡ Clash for Lab 是为实验室环境设计的科学上网工具,无需sudo权限,优雅地一键式脚本安装

Shell 366 19 Updated Feb 1, 2026

DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving (ICLR 2026)

Python 377 29 Updated Feb 11, 2026

Graphs that teach > graphs that impress. Turn any code into an interactive knowledge graph you can explore, search, and ask questions about. Works with Claude Code, Codex, Cursor, Copilot, Gemini C…

TypeScript 60,635 5,005 Updated Jun 11, 2026

An agentic skills framework & software development methodology that works.

Shell 228,865 20,357 Updated Jun 16, 2026

A paper list of some recent works about Token Compress for Vit and VLM

921 43 Updated Jun 2, 2026

From Automated Idea Factory to Realization

Shell 1,118 91 Updated Jun 13, 2026

F1: A Vision Language Action Model Bridging Understanding and Generation to Actions

Python 201 14 Updated Jan 2, 2026

A curated collection of papers on Vision-Language-Action (VLA) models for autonomous driving and robotics

Python 16 1 Updated Jun 13, 2026

Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and evaluation of new memory system designs (e.g., new DRAM stan…

C++ 575 177 Updated May 14, 2026

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.

Python 243 10 Updated Sep 13, 2025

Index of hardware design repositories — CPUs, arithmetic units, SoC design, HDL, and power electronics

HTML 3 1 Updated May 1, 2026
Next