Skip to content
View Lupin1998's full-sized avatar
๐ŸŽฏ
Focusing
๐ŸŽฏ
Focusing

Organizations

@Westlake-AI

Block or report Lupin1998

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation of RankE: End-to-End Discrete Text-to-Image Post-Training via Rank-Consistent Alignment

Python 20 Updated May 27, 2026

Free open-source AI text humanizer to convert AI-generated content into undetectable, human-like writing. Bypass Turnitin, GPTZero, and all major AI detectors. No sign-up required. Try our unlimiteโ€ฆ

Python 1,234 62 Updated Jun 8, 2026

Cognitive runtime for language models with memory, metacognition, multimodal channels, native plugins, and a self-evolving Executive.

Rust 401 2 Updated May 30, 2026

YAML-native agent workflow execution engine, written in Rust

Rust 1,235 8 Updated Apr 28, 2026

[RSS26'] Welcome to Psi-Zero, a Humanoid VLA towards Universal Humanoid Intelligence.

Python 2,630 69 Updated Jun 14, 2026

LVOmniBench: Pioneering Long Audio-Video Understanding Evaluation for Omnimodal LLMs

Python 39 Updated Apr 2, 2026

๐Ÿฆž+๐Ÿ”ฌ NanoResearch: The Autonomous AI Research Assistant

Python 1,504 99 Updated May 26, 2026
Python 11,544 786 Updated Feb 9, 2026

Awesome curated collection of images and prompts generated by GPT-4o and gpt-image-1. Explore AI generated visuals created with ChatGPT and Sora, showcasing OpenAIโ€™s advanced image generation capabโ€ฆ

JavaScript 8,073 1,803 Updated May 26, 2025

Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights

JavaScript 32 1 Updated Jan 9, 2026

Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward

Python 60 Updated Nov 27, 2025

The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Jupyter Notebook 961 60 Updated Dec 20, 2025

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,960 208 Updated Jun 6, 2026
Python 74 14 Updated Dec 8, 2025
Python 1,225 107 Updated Jan 25, 2026
Python 8 1 Updated Jul 21, 2025

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion, mixed-CoT, unified RL)

Python 1,650 87 Updated Feb 14, 2026

The Abstraction and Reasoning Corpus

JavaScript 4,782 715 Updated Apr 4, 2025

This is a repo to track the latest autoregressive visual generation papers.

430 6 Updated Jun 25, 2025

๐Ÿ“ [CVPR 2026] GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models

HTML 18 Updated Apr 1, 2026

a family of versatile and state-of-the-art video tokenizers.

Python 451 20 Updated Sep 1, 2025

Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).

Python 105 3 Updated Feb 11, 2025

[NeurIPS'2025] Official implementation of MGUP, a momentum-gradient greedy alignment update policy for stochastic optimization.

Python 9 Updated Oct 20, 2025

๐ŸŒŸ [Survey] A curated collection of research papers, models, and resources tracing the evolution from specialized models to unified world models.

143 12 Updated Mar 19, 2026

Awesome Deep Research list! For more details, please refer to our survey paper -- A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications

433 35 Updated Oct 22, 2025

Code for explaining and evaluating late chunking (chunked pooling)

Python 521 48 Updated Dec 23, 2024

[ICLR 2026] MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding

Python 21 1 Updated Feb 27, 2026

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 6,898 402 Updated Mar 27, 2026

Public release of the code for "Accelerating Vision Transformers with Adaptive Patches"

Python 110 11 Updated May 6, 2026

Being-VL-0.5: Unified Multimodal Understanding via Byte-Pair Visual Encoding (ICCV 2025, Highlight)

Python 52 4 Updated Dec 22, 2025
Next