-
Wuhan University
- Singapore
-
00:46
(UTC +08:00)
Highlights
- Pro
Stars
EmDash is a full-stack TypeScript CMS based on Astro; the spiritual successor to WordPress
📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.
[CVPR 2026] Official codes of "Monet: Reasoning in Latent Visual Space Beyond Image and Language"
[NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'
[CVPR2025] Code Release of F-LMM: Grounding Frozen Large Multimodal Models
Think Twice to See More: Iterative Visual Reasoning in Medical VLMs
A High-Quality Diabetic Retinopathy Pixel-Level Annotation Dataset
🍒 This is the mobile version of Cherry Studio.
[CVPR'2025] VoCo-LLaMA: This repo is the official implementation of "VoCo-LLaMA: Towards Vision Compression with Large Language Models".
[ICLR2025] LLaVA-HR: High-Resolution Large Language-Vision Assistant
Code for the paper "ViConEx-Med: Visual Concept Explainability via Multi-Concept Token Transformer for Medical Image Analysis", 2025.
A Transparent Generalist Model towards Holistic Medical Vision-Language Understanding
[ICCV'25 Highlight] Derm1M: A Million‑Scale Vision‑Language Dataset Aligned with Clinical Ontology Knowledge for Dermatology
[NeurIPS 2025] Official code for JAFAR: Jack up Any Feature at Any Resolution
A list of papers about concept bottleneck models (CBMs)
Recommend new arxiv papers of your interest daily according to your Zotero libarary.
Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.
(Supports DeepSeek R1) An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models.
雅思词汇真经、雅思语法、听力 179、阅读 538 同义替换等。Everything during preparing for my IELTS exam.