Official PyTorch implementation of "Aligned but Stereotypical? The Hidden Influence of System Prompts on Social Bias in LVLM-Based Text-to-Image Models"

Python 12 Updated Dec 5, 2025

ScanNet / ScanNet

C 2,188 366 Updated Nov 3, 2025

matterport / habitat-matterport-3dresearch

579 43 Updated Mar 25, 2023

pkunlp-icler / FastV

[ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

Python 534 22 Updated Jan 4, 2025

oyt9306 / RePIC

[NeurIPS 2025] We propose a first RL-based personalized image captioning framework with well-defined verifiable rewards.

Python 10 Updated Nov 17, 2025

coli-saar / grpo-prm

Code for the paper "GRPO is Secretly a Process Reward Model": https://arxiv.org/abs/2509.21154

Python 5 Updated Oct 1, 2025

lillian039 / VARC

Python 164 8 Updated Nov 26, 2025

facebookresearch / sam-3d-objects

SAM 3D Objects

Python 5,018 463 Updated Dec 16, 2025

facebookresearch / sam3

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 6,283 728 Updated Dec 21, 2025

deepcs233 / Visual-CoT

[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

Python 413 21 Updated Dec 22, 2024

Yushi-Hu / VisualSketchpad

Codes for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models

Jupyter Notebook 272 15 Updated Aug 5, 2025

zhangquanchen / 3DThinker

Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views

Python 110 4 Updated Dec 9, 2025

allenai / aokvqa

Official repository for the A-OKVQA dataset

Python 106 14 Updated May 8, 2024

datalev001 / DeepSeek-TS

Python 63 8 Updated Feb 3, 2025

UMass-Embodied-AGI / 3D-LLM

Code for 3D-LLM: Injecting the 3D World into Large Language Models

Python 1,164 71 Updated Jun 6, 2024

evelinehong / 3D-CLR-Official

Forked from zsh2000/3D-CLR

[CVPR 2023] Code for "3D Concept Learning and Reasoning from Multi-View Images"

Python 84 4 Updated Jan 20, 2024

Jiho Choi JihoChoi

Highlights

Organizations

Lists (24)

🤖 AI

💯 Algorithm

🔍 BigQuery

🔖

📎 CLIP / VLM

Data Mining

👁️‍🗨️ Vision

Game Bot

🧑‍💻 Git

🌐 GNN

👨 Personal Web Templates

💬 NLP

💻 nodesktop

🧊 object-centric learning

📖 Open Vocabulary

🎑 Scene Graph

📜 Templates

⚙️ Setup, dotfile

🎇 Part Segmentation

⭐ Hetero GNN / CL

🖥️ Ubuntu

Visualization

VLM Bias

🎲 Wordle

Starred repositories

self-attention