zytx121

🎯

Focusing

Yue Zhou zytx121

🎯

Focusing

Associate Professor at DREAMS@ECNU

139 followers · 38 following

BUPT -> SJTU -> NTU -> ECNU
China
01:43 (UTC +08:00)
zhouyue.space

Achievements

x3 x3 x2

Achievements

x3 x3 x2

Organizations

Lists (1)

Sort

🚀 My stack

Stars

285 results for source starred repositories

Clear filter

madderscientist / GratisHub_issue_blog

利用 GitHub 的 Issues 和 GitHub Pages 搭建个人博客站点/数据展示。多屏幕适配。

Dart 1 1 Updated Oct 10, 2025

PhoenixZ810 / MM-HELIX

Official Repository of paper MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

Python 73 1 Updated Oct 19, 2025

madderscientist / issueStat

A GitHub Actions workflow for automatically counting open issues and their labels, and saving the statistics to a tag message for further request.

JavaScript 1 Updated Sep 29, 2025

madderscientist / JEapp

je曲谱库·移动端

Dart 9 Updated Oct 10, 2025

bingoogolapple / bga_issue_blog

Flutter 或 Vue 全家桶（Vue + VueRouter + Vuex + Axios）抓取 GitHub 上的 Issues，结合 GitHub Pages 搭建个人博客站点，支持 GitHub 登录和评论

Dart 273 54 Updated Dec 10, 2024

VisionXLab / avi-math

Multimodal Mathematical Reasoning Embedded in Aerial Vehicle Imagery: Benchmarking, Analysis, and Exploration

Python 10 1 Updated Sep 15, 2025

whu-pzhang / ASANet

ASANet: Asymmetric Semantic Aligning Network for RGB and SAR image land cover classification

Python 49 2 Updated Dec 5, 2024

madderscientist / timbreAMT

explore AMT from the perspective of timbre

Jupyter Notebook 8 2 Updated Jun 26, 2025

Thinklab-SJTU / Bench2Drive-VL

Adapting VLMs to Bench2Drive.

Python 163 20 Updated Oct 12, 2025

VisionXLab / AirSpatialBot

[TGRS'25] AirSpatialBot: A Spatially-Aware Aerial Agent for Fine-Grained Vehicle Attribute Recognization and Retrieval

Python 21 Updated Aug 24, 2025

VisionXLab / mllm-mmrotate

[IGARSS 2025 Oral] A Simple Aerial Detection Baseline of Multimodal Language Models.

Jupyter Notebook 85 6 Updated Jul 3, 2025

madderscientist / je_score_operator

【Numbered musical notation tools】je 简谱处理工具，包括转调、播放、制谱、midi提取（转换）与制作等

JavaScript 71 10 Updated Sep 20, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 5,673 366 Updated Oct 21, 2025

The-AI-Alliance / GEO-Bench-VLM

GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks

Python 84 6 Updated Jul 1, 2025

VisionXLab / GeoGround

GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding

72 2 Updated May 10, 2025

mc-lan / Text4Seg

[ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation

Python 149 3 Updated Sep 15, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,897 946 Updated Nov 5, 2025

szx2015 / spider_autohome_data

汽车之家车型品牌车系车型等的数据

3 1 Updated Sep 16, 2023

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,255 419 Updated Nov 3, 2025

gastruc / osv5m

Python 172 14 Updated May 6, 2024

zilunzhang / StreetCLIP-Repoduce

Python 12 4 Updated Jul 1, 2024

zilunzhang / Awesome-Geoguesser

Summary of Geoguesser Models / Agents

5 Updated Jun 27, 2024

zou-group / textgrad

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.

Python 3,063 253 Updated Jul 25, 2025

OpenGVLab / PIIP

[NeurIPS 2024 Spotlight ⭐️ & TPAMI 2025] Parameter-Inverted Image Pyramid Networks (PIIP)

Python 105 5 Updated Aug 5, 2025

OpenGVLab / OmniCorpus

[ICLR 2025 Spotlight] OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Python 400 6 Updated May 5, 2025

VisionXLab / STAR-MMRotate

[TPAMI] Oriented object detection on STAR dataset.

Python 83 5 Updated Feb 3, 2025

Zhuzi24 / STAR-MMDetection

3 Updated Jul 2, 2024

zytx121 / Awesome-VLGFM

A Survey on Vision-Language Geo-Foundation Models (VLGFMs)

175 7 Updated May 24, 2025

InternLM / InternLM-XComposer

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,900 176 Updated May 26, 2025

penghao-wu / vstar

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Python 680 42 Updated Jan 7, 2024