Skip to content
View jcwang0602's full-sized avatar
👋
Working
👋
Working

Highlights

  • Pro

Block or report jcwang0602

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 22 Updated Mar 24, 2026

Flash Attention implementatio with attention score

Python 8 Updated Mar 14, 2026

[CVPR2026]RAGTrack: Language-aware RGBT Tracking with Retrieval-Augmented Generation

31 Updated Feb 22, 2026
Python 17 Updated Mar 20, 2026

R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement Learning.

Python 66 5 Updated May 14, 2025

Official code of paper "GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis" [ICCV 2025]

Jupyter Notebook 43 Updated Jun 29, 2025

BARE: Towards Bias-Aware and Reasoning-Enhanced One-Tower Visual Grounding

Python 7 Updated Dec 30, 2025

VPTracker: Global Vision-Language Tracking via Visual Prompt and MLLM

Python 13 Updated Mar 10, 2026

A multi-platform proxy client based on ClashMeta,simple and easy to use, open-source and ad-free.

Dart 34,880 2,124 Updated Mar 25, 2026
Python 9 1 Updated Apr 1, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,475 1,317 Updated Apr 1, 2026

[nature biomedical engineering 2025] Official code for paper: A generalist foundation model and database for open-world medical image segmentation (MedSegX)

Python 83 11 Updated Oct 2, 2025

Vision-Language based Visual Object Tracking

Python 30 1 Updated Oct 10, 2025

MLLMSeg: Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decoder

Python 51 4 Updated Aug 16, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,938 769 Updated Sep 22, 2025

Progressive Language-guided Visual Learning for Multi-Task Visual Grounding

Python 13 Updated May 9, 2025

The official sources for the RDKit library

HTML 3,371 996 Updated Apr 1, 2026

[ICME 2025] Overcoming Feature Contamination by Unidirectional Information Modeling for Vision-Language Tracking

Python 3 Updated Mar 22, 2025

[ICME 2025] A Simple and Better Baseline for Visual Grounding

Python 3 Updated May 2, 2025
Python 11 Updated Aug 20, 2025

[NeurIPS2025 Spotlight 🔥 ] Official implementation of 🛸 "UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface"

Python 271 11 Updated Nov 5, 2025

[NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion

Python 101 4 Updated Oct 29, 2025
Python 100 9 Updated Dec 17, 2024

[TPAMI 2025] Towards Visual Grounding: A Survey

Shell 300 26 Updated Nov 18, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 69,364 8,442 Updated Apr 1, 2026
TeX 710 120 Updated Nov 11, 2025
Python 23 1 Updated Aug 20, 2024

The official pytorch implementation of our AAAI 2024 paper "Unifying Visual and Vision-Language Tracking via Contrastive Learning"

Python 47 5 Updated Nov 4, 2024

Script for download the dataset 'ChestX-ray8'

Python 6 Updated Mar 8, 2021

[ECCV 2024] Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance

Python 104 7 Updated Feb 6, 2026
Next