Lists (1)
Sort Name ascending (A-Z)
Stars
Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding" [ACL 2026]
This is the official implementation of ICCV 2025 "Flash-VStream: Efficient Real-Time Understanding for Long Video Streams"
A high-throughput and memory-efficient inference and serving engine for LLMs
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)
Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).
[Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.
StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding
Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding
Implementation for paper "Forcing-KV: Hybrid KV Cache Compression for Efficient Autoregressive Video Diffusion Models".
本项目主要包含的是ZJUSE大二大三的一些专业课的学习笔记、课件以及部分本人完成的作业,但仅供学习参考使用,请勿用于不正当用途!
C语言教程+博客+代码演示+课程设计。 帮助初学者更好的理解 C 难点,提升代码量! For beginners:C tuition/self-learning