Skip to content
View RewindL's full-sized avatar
  • Alibaba Cloud Intelligence Group
  • Hangzhou, China

Block or report RewindL

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Github Pages template based upon HTML and Markdown for personal, portfolio-based websites.

SCSS 16,815 6,460 Updated Apr 8, 2026
Python 458 37 Updated Apr 14, 2026

[ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule

Python 545 29 Updated Mar 13, 2026

Official Implementation of MARS

Python 24 Updated Apr 9, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 358,591 72,916 Updated Apr 16, 2026

Fast and memory-efficient exact kmeans

Python 536 27 Updated Mar 26, 2026

🚀 Efficient implementations for emerging model architectures

Python 4,895 497 Updated Apr 16, 2026
Python 41 3 Updated Mar 6, 2026

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.

Python 704 78 Updated Apr 1, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,897 5,399 Updated Apr 16, 2026

STEP-GUI: The top GUI agent solution in the galaxy. Developed by the StepFun-GELab team and powered by StepFun’s cutting-edge research capabilities.

Python 2,128 183 Updated Mar 14, 2026

VisionSelector: End-to-End Learnable Visual Token Compression for Efficient Multimodal LLMs

Python 60 2 Updated Mar 24, 2026
Python 28 3 Updated Jan 5, 2026

Official implementation of "Pyramid Texture Filtering"

MATLAB 30 4 Updated Jun 1, 2023

Archer2.0 evolves from its predecessor by introducing ASPO, which overcomes fundamental PPO-Clip limitations to prevent premature convergence and unlock greater RL potential.

Python 31 2 Updated Oct 10, 2025

Nano vLLM

Python 12,939 1,939 Updated Apr 13, 2026

[EMNLP 2025 Main] Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models

Python 109 13 Updated Apr 10, 2026

Official Repo of paper "QUITO: Accelerating Long-Context Reasoning through Query-Guided Context Compression".

Python 12 2 Updated Nov 11, 2025

🔥 Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.

3,066 212 Updated Mar 10, 2026

[NeurIPS 2025] HoliTom: Holistic Token Merging for Fast Video Large Language Models

Python 77 Updated Oct 10, 2025
Python 46 2 Updated Sep 27, 2025

Code for paper: Optimizing Length Compression in Large Reasoning Models

Python 28 4 Updated Oct 20, 2025

This is the open-source code for TokenCarve.

Python 26 3 Updated Jan 23, 2026

[NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search

17 Updated Jan 24, 2026

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 4,027 561 Updated Apr 15, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,083 269 Updated Apr 16, 2026
Python 14 Updated May 27, 2025

[ICLR 2026] InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models

Python 52 1 Updated Feb 12, 2026

[EMNLP 2025] LightThinker: Thinking Step-by-Step Compression

Python 152 6 Updated Apr 7, 2026
Next