gi2wzh

🏠

Working from home

Wang Zhihao gi2wzh

🏠

Working from home

0 followers · 7 following

Achievements

Lists (11)

Sort

Stars

JUNJIE99 / VISTA_Evaluation_FineTuning

Evaluation code and datasets for the ACL 2024 paper, VISTA: Visualized Text Embedding for Universal Multi-Modal Retrieval. The original code and model can be accessed at FlagEmbedding.

Python 46 2 Updated Nov 16, 2024

qzp2018 / UniECS

Official implement of CIKM2025: 《UniECS: Unified Multimodal E-Commerce Search Framework with Gated Cross-modal Fusion》

Python 18 2 Updated Sep 17, 2025

iSEE-Laboratory / LLMDet

(CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models"

Python 525 29 Updated Dec 18, 2025

jina-ai / llama.cpp

Forked from ggml-org/llama.cpp

LLM inference in C/C++

C++ 9 Updated Sep 9, 2025

sungonce / CVNet

Official PyTorch Implementation of Correlation Verification for Image Retrieval, CVPR 2022 (Oral Presentation)

Python 191 13 Updated Aug 21, 2023

CVHub520 / X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 7,573 838 Updated Dec 23, 2025

Intellindust-AI-Lab / DEIMv2

[DEIMv2] Real Time Object Detection Meets DINOv3

Jupyter Notebook 1,303 132 Updated Dec 13, 2025

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,474 335 Updated Dec 22, 2025

calcuis / gguf-connector

gguf (GPT-Generated Unified Format) connector

Python 47 10 Updated Dec 24, 2025

DingXiaoH / RepVGG

RepVGG: Making VGG-style ConvNets Great Again

Python 3,445 433 Updated Feb 10, 2023

Tencent-Hunyuan / HunyuanOCR

Python 1,344 106 Updated Dec 4, 2025

GeeeekExplorer / nano-vllm

Nano vLLM

Python 10,142 1,272 Updated Nov 3, 2025

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 9,489 981 Updated Aug 12, 2024

IDEA-Research / Rex-Omni

Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)

Jupyter Notebook 1,023 66 Updated Dec 15, 2025

PaddlePaddle / PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 66,839 9,545 Updated Dec 23, 2025

datalab-to / surya

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 19,017 1,301 Updated Oct 21, 2025

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,516 58 Updated Jun 14, 2025