GoGiants1

😄

Hyungwook Choi GoGiants1

😄

@wafflestudio

19 followers · 63 following

Seoul National University
Seoul

Achievements

x3 x2

Achievements

x3 x2

Organizations

Lists (9)

Sort

🚀 My stack

PIAA

4 repositories

🧪 Interview

2 repositories

🚂 Training

10 repositories

🦄 Multi-modal

58 repositories

Stars

HorizonWind2004 / reconstruction-alignment

[ICLR 2026] Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Lear…

Python 404 17 Updated May 23, 2026

FreedomIntelligence / ShareGPT-4o-Image

Python 284 11 Updated Jul 22, 2025

ethz-spylab / modal-aphasia

Jupyter Notebook 5 Updated Feb 13, 2026

CURRENTF / Uni-X

[ICLR 2026] Uni-X: Mitigating Modality Conflict with a Two-End-Separated Architecture for Unified Multimodal Models

Python 10 Updated Apr 1, 2026

NVlabs / cosmos-policy

Cosmos Policy

Python 807 79 Updated Jan 23, 2026

openvla / openvla

Forked from TRI-ML/prismatic-vlms

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 6,443 762 Updated Mar 23, 2025

Gumpest / SparseVLMs

[ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" and "SparseVLM+: Visual Token Sparsification with Improved Text-Vis…

Python 265 22 Updated Dec 22, 2025

bronyayang / Law_of_Vision_Representation_in_MLLMs

[COLM'25] Official implementation of the Law of Vision Representation in MLLMs

Python 177 8 Updated Oct 6, 2025

xuanyuzhang21 / RALI

[ICLR 2026 Oral] Reasoning as Representation: Rethinking Visual Reinforcement Learning in Image Quality Assessment

Python 37 1 Updated Feb 14, 2026

NousResearch / hermes-agent

The agent that grows with you

Python 195,322 34,295 Updated Jun 16, 2026

moojink / openvla-oft

Forked from openvla/openvla

Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success

Python 1,256 182 Updated Sep 9, 2025

smanchajm / Jacquenetta-beamer-theme

Modern, minimalist LaTeX Beamer theme for scientific presentations.

TeX 10 2 Updated Apr 7, 2026

VoltAgent / awesome-design-md

A collection of DESIGN.md files analysis by popular brand design systems. Drop one into your project and let coding agents generate a matching UI.

90,727 10,768 Updated Jun 16, 2026

wanshuiyin / Auto-claude-code-research-in-sleep

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…

Python 12,192 1,121 Updated Jun 15, 2026

StigLidu / GradAlign

Python 10 Updated Mar 4, 2026

multica-ai / andrej-karpathy-skills

A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.

176,862 18,055 Updated Apr 20, 2026

shawn0728 / Unify-Agent

🐧 Unify-Agent: An end-to-end unified multimodal agent for faithful, knowledge-grounded image generation.

Python 82 4 Updated May 2, 2026

OpenDCAI / OpenWorldLib

Unified Codebase for Advanced World Models.

Python 821 43 Updated Jun 11, 2026

OpenDCAI / DataFlex

Data-centric LLM training with dynamic sample selection, domain mixture optimization, and example reweighting inside the LLaMA-Factory training loop.

Python 1,020 129 Updated Jun 1, 2026

Osilly / Vision-R1

[ICLR2026] This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incen…

Python 1,382 27 Updated Mar 20, 2026