Skip to content
View JosephPai's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report JosephPai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 13 Updated Jun 3, 2026

DeepSeek-native AI coding agent for your terminal. Engineered around prefix-cache stability — leave it running.

Go 21,981 1,312 Updated Jun 14, 2026

Your Personal AI Assistant; easy to install, deploy on your own machine or on the cloud; supports multiple chat apps with easily extensible capabilities.

Python 17,621 2,607 Updated Jun 12, 2026

A unified and fully open-source framework for instruction-guided and reference-guided video editing using natural language.

Python 287 24 Updated May 13, 2026

Github repository for World-VLA-Loop.

JavaScript 29 3 Updated Feb 25, 2026

EVOLVE-VLA: Test-Time Training from Environment Feedback for Vision-Language-Action Models

HTML 86 1 Updated Dec 17, 2025

[CVPR 2026] An official implementation of Adv-GRPO. The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation.

Python 84 1 Updated Feb 26, 2026

VCode: SVG as Symbolic Visual Representation

Python 134 6 Updated Feb 21, 2026

Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence

Python 1,481 106 Updated Jan 31, 2025

[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 1,729 113 Updated Jan 6, 2026

Automatically hold idle GPU.

Python 78 1 Updated Nov 9, 2025
Python 633 65 Updated Aug 28, 2025

LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [ICRA 2026]

Python 194 5 Updated Mar 12, 2026

Digital Mind Extension

JavaScript 7,527 1,110 Updated Oct 26, 2025

[ICCV 2025] Balanced Image Stylization with Style Matching Score

Python 70 2 Updated Mar 9, 2026

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 4,156 510 Updated Mar 23, 2026

[ICCVW 2025] This repository includes latest papers, projects and datasets on GenAI for Cel-Animation. Accepted by ICCV 2025 AISTORY Workshop.

204 6 Updated Jan 13, 2026
Jupyter Notebook 145 8 Updated Jun 20, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 2,299 157 Updated Apr 13, 2026

Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"

Python 308 15 Updated Apr 23, 2025
Python 60 1 Updated Apr 28, 2025

ICML 2025 - Impossible Videos

Python 83 8 Updated Jul 23, 2025

Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.

Python 124 12 Updated Jul 27, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 16,256 2,856 Updated Mar 5, 2026

[ICML 2026]A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation

Python 90 Updated Sep 27, 2025

NVIDIA Cosmos is an open platform of world models, datasets, and tools that enables developers to build Physical AI for robots, autonomous vehicles, smart infrastructure, and more.

Jupyter Notebook 10,191 661 Updated Jun 13, 2026

FQGAN: Factorized Visual Tokenization and Generation

Python 59 3 Updated Mar 29, 2025

[IJCV 2025] Paragraph-to-Image Generation with Information-Enriched Diffusion Model

Python 107 5 Updated Mar 24, 2025
Next