Skip to content
View nliu-25's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report nliu-25

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

(NeurIPS 2024) Official repository of paper "Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models"

Python 36 3 Updated Mar 22, 2025

Official implementation of FullMatch (CVPR2023)

Python 45 3 Updated Jul 14, 2025

This is the code for Knowledge-Guided Adversarial Training(KGAT)

Python 162 Updated Mar 27, 2026

Fast Multimodal LLM on Mobile Devices

C++ 1,511 198 Updated Apr 30, 2026

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 3,510 410 Updated Nov 11, 2025

自动化所硕博论文模板

TeX 46 12 Updated Mar 6, 2018

Seeing Beyond Words: Self-Supervised Visual Learning for Multimodal Large Language Models

Python 8 Updated Mar 14, 2026

MCP for xiaohongshu.com

Go 13,624 2,070 Updated May 15, 2026

xiaohongshu-skills

Python 1,287 185 Updated May 15, 2026

A cross-platform desktop All-in-One assistant for Claude Code, Codex, OpenCode, OpenClaw, Gemini CLI & Hermes Agent. Only official website: ccswitch.io

Rust 73,393 4,766 Updated May 16, 2026

PixelPrune: Pixel-Level Adaptive Visual Token Reduction via Predictive Coding

Python 19 2 Updated May 13, 2026

Qwen3.6 is the large language model series developed by Qwen team, Alibaba Group.

3,391 219 Updated May 11, 2026

Official repository of the paper "A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models"

Python 94 2 Updated Feb 13, 2026

The official code for the paper: LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs

Python 121 3 Updated Jul 1, 2025

Official repository for VisionZip (CVPR 2025)

Python 429 26 Updated Jul 21, 2025

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Python 1,127 132 Updated Oct 7, 2024

Awesome LLM pruning papers all-in-one repository with integrating all useful resources and insights.

164 12 Updated Mar 2, 2026

[TMLR 2026] Survey: https://arxiv.org/pdf/2507.20198

351 21 Updated May 17, 2026

A paper list of some recent works about Token Compress for Vit and VLM

902 42 Updated May 13, 2026

The implement of paper "Asymmetric Contextual Modulation for Infrared Small Target Detection" in Pytorch

Python 60 9 Updated Dec 4, 2020
Python 25 Updated Apr 5, 2026

Pytorch implementation of "EdgeCrafter: Compact ViTs for Edge Dense Prediction via Task-Specialized Distillation"

Python 188 20 Updated May 9, 2026

行业内领先的报告集合 行业 员工 金融 个税 福利薪酬 领导力 财富 会议 报告&工具

Batchfile 77 46 Updated Mar 15, 2019

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…

Python 9,675 933 Updated May 17, 2026

AIGCPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。

Vue 4,982 749 Updated May 9, 2026

✨✨Latest Advances on Multimodal Large Language Models

17,798 1,123 Updated May 1, 2026

Collection of AWESOME vision-language models for vision tasks

3,117 233 Updated Oct 14, 2025

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 2,264 150 Updated Mar 25, 2026
Next