Skip to content
View rootyJeon's full-sized avatar

Block or report rootyJeon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A 3B-active-parameter native unified multimodal model for image and video understanding, generation, and editing.

Python 1,207 80 Updated Jun 13, 2026
Python 280 17 Updated Jun 1, 2026

InternVL-U is a 4B-parameter unified multimodal model (UMM) that brings multimodal understanding, reasoning, image generation, image editing into a single framework.

Python 287 16 Updated Mar 21, 2026

Open-source unified multimodal model

Python 6,016 533 Updated May 4, 2026

A paper list of Awesome Latent Space.

912 36 Updated Jun 13, 2026

A list of awesome Robotics resources

6,661 1,032 Updated Sep 22, 2024

🔥An open-source survey of the latest video reasoning tasks, paradigms, and benchmarks.

183 8 Updated Jun 14, 2026

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

Python 1,790 60 Updated Jun 13, 2026

Official codebase for the paper Latent Visual Reasoning

Python 166 9 Updated Oct 22, 2025

A paper list for spatial reasoning

755 42 Updated Jan 19, 2026

A paper list of some recent works about Token Compress for Vit and VLM

921 43 Updated Jun 2, 2026
Jupyter Notebook 541 68 Updated Sep 28, 2025

📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

1,024 47 Updated Sep 27, 2025

NVIDIA Isaac GR00T N1.7 - A Foundation Model for Generalist Robots.

Python 7,345 1,260 Updated Jun 12, 2026

Famous Vision Language Models and Their Architectures

Markdown 1,265 53 Updated Jan 11, 2026

Tips for Writing a Research Paper using LaTeX

TeX 3,782 414 Updated May 4, 2023

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 12,160 1,066 Updated Mar 8, 2026

A PyTorch Lightning solution to training OpenAI's CLIP from scratch.

Python 721 86 Updated Apr 15, 2022

[ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning

Python 329 13 Updated Dec 21, 2025

the resources I use to learn computer science in my spare time

4,739 413 Updated Feb 14, 2023

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,953 884 Updated Jul 18, 2024

Main Web Site (Online Books)

HTML 10,426 1,002 Updated Apr 1, 2026

EPFL Course - Optimization for Machine Learning - CS-439

Jupyter Notebook 1,458 349 Updated Jun 5, 2026

Gaussian Splatting from VGGSfM and Mast3r, and their comparison

Python 231 7 Updated Aug 23, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 19,353 2,477 Updated May 30, 2026

Summer 2026 software engineering, data science, AI, quant, product management, and hardware internship postings. Updated daily by Simplify and Pitt CSC.

Python 44,927 3,181 Updated Jun 15, 2026

2024 Gaussian Splatting Paper List(Arxiv)

307 15 Updated Jan 1, 2025

Kolmogorov Arnold Networks

Jupyter Notebook 16,306 1,563 Updated Jan 19, 2025
Next