-
Visual-RFT Public
Forked from Liuziyu77/Visual-RFTOfficial repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’
-
zhaoyangli-1.github.io Public template
Forked from academicpages/academicpages.github.ioGithub Pages template based upon HTML and Markdown for personal, portfolio-based websites.
HTML MIT License UpdatedOct 17, 2025 -
ORIC Public
Evaluate LVLMs’ robustness to context-incongruent object recognition and hallucination errors.
Python MIT License UpdatedOct 16, 2025 -
lvlm-interpret Public
Forked from IntelLabs/lvlm-interpretJupyter Notebook Apache License 2.0 UpdatedOct 16, 2025 -
VL-InterpreT Public
Forked from IntelLabs/VL-InterpreTVisual Language Transformer Interpreter - An interactive visualization tool for interpreting vision-language transformers
Python MIT License UpdatedOct 13, 2025 -
-
drug-drug-interaction Public
Forked from youweiliang/drugchat -
-
diffusion_policy Public
Forked from real-stanford/diffusion_policy[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
Python MIT License UpdatedDec 24, 2024 -
s2v-dagger Public
Forked from tongzhoumu/s2v-daggerCode for "When Should We Prefer State-to-Visual DAgger Over Visual Reinforcement Learning?"
Python MIT License UpdatedDec 19, 2024 -
open_flamingo Public
Forked from mlfoundations/open_flamingoAn open-source framework for training large multimodal models.
Python MIT License UpdatedAug 21, 2024 -
-
-
-
POPE Public
Forked from RUCAIBox/POPEThe official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''
Python MIT License UpdatedMar 25, 2024 -
-
matlab-dockerfile Public
Forked from mathworks-ref-arch/matlab-dockerfileCreate a docker container that contains a MATLAB install
Python Other UpdatedFeb 22, 2024 -
VIMABench Public
Forked from vimalabs/VIMABenchOfficial Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
Python MIT License UpdatedJan 25, 2024 -
-
LLaVA Public
Forked from haotian-liu/LLaVA[NeurIPS'23 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.
Python Apache License 2.0 UpdatedNov 8, 2023 -
llava-docker Public
Forked from VikingDadMedic/llava-dockerDocker image for LLaVA: Large Language and Vision Assistant
Shell GNU General Public License v3.0 UpdatedNov 3, 2023 -
alfworld Public
Forked from lz1oceani/alfworldALFWorld: Aligning Text and Embodied Environments for Interactive Learning
Python MIT License UpdatedOct 25, 2023 -
alfred Public
Forked from lz1oceani/alfredALFRED - A Benchmark for Interpreting Grounded Instructions for Everyday Tasks
C MIT License UpdatedOct 24, 2023 -
reflexion Public
Forked from noahshinn/reflexion[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
Python MIT License UpdatedOct 12, 2023 -
SA-1B-Downloader Public
Forked from KKallidromitis/SA-1B-DownloaderSimple script to parallelize download and extract files for SA-1B Dataset.
Python MIT License UpdatedAug 28, 2023 -
-
-
CARLA-Team-Code Public
Forked from Slijeff/CARLA-Team-CodeCentral code repository for CARLA research
Python UpdatedJan 3, 2023 -
-