Skip to content
View GoGiants1's full-sized avatar
😄
😄
  • Seoul National University
  • Seoul

Organizations

@wafflestudio @wafflestudio18-5

Block or report GoGiants1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR 2026] Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Lear…

Python 404 17 Updated May 23, 2026
Jupyter Notebook 5 Updated Feb 13, 2026

[ICLR 2026] Uni-X: Mitigating Modality Conflict with a Two-End-Separated Architecture for Unified Multimodal Models

Python 10 Updated Apr 1, 2026

Cosmos Policy

Python 807 79 Updated Jan 23, 2026

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 6,443 762 Updated Mar 23, 2025

[ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" and "SparseVLM+: Visual Token Sparsification with Improved Text-Vis…

Python 265 22 Updated Dec 22, 2025

[COLM'25] Official implementation of the Law of Vision Representation in MLLMs

Python 177 8 Updated Oct 6, 2025

[ICLR 2026 Oral] Reasoning as Representation: Rethinking Visual Reinforcement Learning in Image Quality Assessment

Python 37 1 Updated Feb 14, 2026

The agent that grows with you

Python 195,322 34,295 Updated Jun 16, 2026

Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success

Python 1,256 182 Updated Sep 9, 2025

Modern, minimalist LaTeX Beamer theme for scientific presentations.

TeX 10 2 Updated Apr 7, 2026

A collection of DESIGN.md files analysis by popular brand design systems. Drop one into your project and let coding agents generate a matching UI.

90,727 10,768 Updated Jun 16, 2026

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…

Python 12,192 1,121 Updated Jun 15, 2026
Python 10 Updated Mar 4, 2026

A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.

176,862 18,055 Updated Apr 20, 2026

🐧 Unify-Agent: An end-to-end unified multimodal agent for faithful, knowledge-grounded image generation.

Python 82 4 Updated May 2, 2026

Unified Codebase for Advanced World Models.

Python 821 43 Updated Jun 11, 2026

Data-centric LLM training with dynamic sample selection, domain mixture optimization, and example reweighting inside the LLaMA-Factory training loop.

Python 1,020 129 Updated Jun 1, 2026

[ICLR2026] This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incen…

Python 1,382 27 Updated Mar 20, 2026

Drawing Bayesian networks, graphical models, tensors, technical frameworks, and illustrations in LaTeX.

TeX 2,018 189 Updated May 26, 2025

[ACMMM 2024] AesExpert: Towards Multi-modality Foundation Model for Image Aesthetics Perception

105 4 Updated Jan 19, 2025

Spec-driven development (SDD) for AI coding assistants.

TypeScript 55,155 3,866 Updated Jun 13, 2026

The open source coding agent.

TypeScript 175,272 21,266 Updated Jun 16, 2026

Interactively explore unstructured datasets from your dataframe.

TypeScript 1,259 89 Updated Jun 15, 2026

Give your agents the power of the Hugging Face ecosystem

Python 10,680 702 Updated Jun 16, 2026

Code release for our NeurIPS 2024 Spotlight paper "GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing"

Jupyter Notebook 167 10 Updated Oct 23, 2024
Python 11,559 789 Updated Feb 9, 2026

The official repo for "Vidi: Large Multimodal Models for Video Understanding and Editing"

Python 640 43 Updated Mar 4, 2026
Next