Skip to content
View BierOne's full-sized avatar
💦
Focusing
💦
Focusing

Block or report BierOne

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A paper list of some recent works about Token Compress for Vit and VLM

786 36 Updated Dec 18, 2025

Toolkit for Prompt Compression

Python 284 8 Updated Feb 11, 2025

[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 5,693 339 Updated Oct 28, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,750 1,070 Updated Dec 19, 2025

Advancing AI by embracing human-likeness for better AI understanding, human–AI collaboration, and social simulation, bridging technology and genuine human experience.

Python 57 5 Updated Nov 26, 2025

Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467

Python 301 27 Updated Feb 14, 2025

The repo for In-context Autoencoder

Jupyter Notebook 157 19 Updated May 11, 2024

[R]einforcement [L]earning from [M]odel-rewarded [T]hinking - code for the paper "Language Models That Think, Chat Better"

Python 122 6 Updated Oct 27, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,237 7,787 Updated Dec 19, 2025

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,311 78 Updated Mar 6, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,413 231 Updated Nov 12, 2025

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Python 478 30 Updated Mar 19, 2024

Open source traditional chinese handwriting dataset.

Jupyter Notebook 222 37 Updated May 20, 2021

[ACL 2024] Long-Context Language Modeling with Parallel Encodings

Python 166 10 Updated Jun 13, 2024

Oh my tmux! My self-contained, pretty & versatile tmux configuration made with 💛🩷💙🖤❤️🤍

Shell 23,919 3,527 Updated Nov 13, 2025

A jailbreak prompt that can universally attack strong leading LLMs.

Python 8 2 Updated Nov 28, 2025

Let your Claude able to think

TypeScript 16,616 1,965 Updated Nov 4, 2025

An end-to-end signature verification system to extract, clean and verify signatures in documents. Signatures are detected using YOLOv5. Noise is cleaned using a CycleGAN approach and verified. Kera…

Jupyter Notebook 192 72 Updated May 8, 2024
Python 5 Updated Mar 3, 2025

Muon is Scalable for LLM Training

1,384 78 Updated Aug 3, 2025

通过修改Hosts解决国内Github经常抽风访问不到,每日更新

Java 2,249 156 Updated Dec 19, 2025

[TIP'24] Official PyTorch implementation of Concept Activation-Guided Contrast Learning.

Python 4 Updated Dec 30, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,782 12,068 Updated Dec 19, 2025
Jupyter Notebook 9 2 Updated Jul 17, 2024

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 157,903 13,966 Updated Dec 19, 2025

SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter Optimization

Python 1,208 242 Updated Dec 16, 2025

Automatic architecture search and hyperparameter optimization for PyTorch

Python 2,511 302 Updated Apr 9, 2024

[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627

Python 504 49 Updated Oct 9, 2024

NASLib is a Neural Architecture Search (NAS) library for facilitating NAS research for the community by providing interfaces to several state-of-the-art NAS search spaces and optimizers.

Python 575 124 Updated Nov 11, 2024

Automated Deep Learning: Neural Architecture Search Is Not the End (a curated list of AutoDL resources and an in-depth analysis)

Python 2,326 319 Updated Sep 26, 2022
Next