NROwind

Zhihong 陈 NROwind

6 followers · 10 following

Achievements

Stars

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 81,532 11,854 Updated Mar 26, 2026

wanshuiyin / Auto-claude-code-research-in-sleep

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…

Python 9,697 936 Updated May 17, 2026

EIT-NLP / MCMR

Python 6 Updated Mar 7, 2026

XMUDeepLIT / UME-R1

The code implementation for UME-R1: Exploring Reasoning-Driven Generative Multimodal Embeddings (ICLR 2026).

Python 59 3 Updated Feb 25, 2026

FireRedTeam / FireRed-Image-Edit

FireRed-Image-Edit is a powerful image editing foundation model achieving open-source state-of-the-art performance with precise instruction following, high-fidelity generation, superior identity co…

Python 1,213 74 Updated Apr 3, 2026

ZY0025 / GRLM

Python 36 4 Updated Apr 9, 2026

RuijieZhu94 / StatisticalLearning_USTC

Statistical Learning course in USTC. 中科大统计学习（刘东）课程复习资料。

TeX 62 10 Updated Jan 9, 2024

Qinying-Liu / Awesome-omni-modal-understanding

Collection of papers about video-audio understanding

25 1 Updated Dec 26, 2025

Jinghaoleven / RLFR

Official implementation of RLFR: Extending Reinforcement Learning for LLMs with Flow Environment

Python 47 1 Updated Nov 15, 2025

NROwind / OpenGPT-4o-Image

A Comprehensive Dataset for Advanced Image Generation and Editing}

32 2 Updated Oct 2, 2025

QwenLM / Qwen3-Embedding

Python 1,928 122 Updated Sep 30, 2025

IDEA-Research / Rex-Thinker

[ICLR-2026] Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning

Python 149 7 Updated Jun 30, 2025

Visual-Agent / DeepEyes

Python 1,211 76 Updated Nov 20, 2025

ligeng0197 / Awesome-Thinking-With-Images

Latest open-source "Thinking with images" (O3/O4-mini) papers, covering training-free, SFT-based, and RL-enhanced methods for "fine-grained visual understanding".

113 2 Updated Aug 21, 2025

zhaochen0110 / Awesome_Think_With_Images

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,456 45 Updated Mar 9, 2026

QQ-MM / QQMM-embed

Python 24 1 Updated Oct 16, 2025

360CVGroup / FG-CLIP

New generation of CLIP with strong fine grained discrimination capability, ICML2026 and ICML2025

Python 754 36 Updated May 8, 2026

XMUDeepLIT / LLaVE

LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning

Python 77 3 Updated May 23, 2025

GAIR-NLP / anole

[Extended verision ICLR 2025 Blog Track] Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 838 50 Updated Jun 16, 2025

JiuhaiChen / BLIP3o

Official implementation of BLIP3o-Series

Python 1,654 78 Updated Nov 29, 2025

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,574 66 Updated Jun 14, 2025