Skip to content
View gi2wzh's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report gi2wzh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Evaluation code and datasets for the ACL 2024 paper, VISTA: Visualized Text Embedding for Universal Multi-Modal Retrieval. The original code and model can be accessed at FlagEmbedding.

Python 46 2 Updated Nov 16, 2024

Official implement of CIKM2025: 《UniECS: Unified Multimodal E-Commerce Search Framework with Gated Cross-modal Fusion》

Python 18 2 Updated Sep 17, 2025

(CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models"

Python 525 29 Updated Dec 18, 2025

LLM inference in C/C++

C++ 9 Updated Sep 9, 2025

Official PyTorch Implementation of Correlation Verification for Image Retrieval, CVPR 2022 (Oral Presentation)

Python 191 13 Updated Aug 21, 2023

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 7,573 838 Updated Dec 23, 2025

[DEIMv2] Real Time Object Detection Meets DINOv3

Jupyter Notebook 1,303 132 Updated Dec 13, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,474 335 Updated Dec 22, 2025

gguf (GPT-Generated Unified Format) connector

Python 47 10 Updated Dec 24, 2025

RepVGG: Making VGG-style ConvNets Great Again

Python 3,445 433 Updated Feb 10, 2023

Nano vLLM

Python 10,142 1,272 Updated Nov 3, 2025

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 9,489 981 Updated Aug 12, 2024

Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)

Jupyter Notebook 1,023 66 Updated Dec 15, 2025

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 66,839 9,545 Updated Dec 23, 2025

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 19,017 1,301 Updated Oct 21, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,516 58 Updated Jun 14, 2025

ClashX 是一个以Clash为内核的MAC系统翻墙工具。

Swift 576 271 Updated Feb 21, 2024

OpenMMLab Model Deployment Framework

Python 3,091 690 Updated Sep 30, 2024

PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT

Python 2,911 376 Updated Dec 25, 2025

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 7,084 525 Updated May 5, 2025

[Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 514 20 Updated Nov 5, 2025

Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜

Jupyter Notebook 1,818 138 Updated Oct 27, 2025

将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调

Python 479 48 Updated Sep 8, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,527 185 Updated Dec 26, 2025

Some out-of-the-box hooks for pre-commit

Python 6,252 769 Updated Dec 22, 2025

A PyTorch-based knowledge distillation toolkit for natural language processing

Python 1,690 247 Updated May 8, 2023

Text-audio foundation model from Boson AI

Python 7,772 578 Updated Sep 15, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,802 2,901 Updated Dec 26, 2025
Next