Skip to content
View SOTAMak1r's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report SOTAMak1r

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AI generates a real, editable PowerPoint from any document — native shapes & animations, speaker notes voiced as audio narration, and the option to follow your own .pptx template, not slide images …

Python 27,909 2,476 Updated Jun 15, 2026

World Model Self-Distillation project website

14 2 Updated Jun 15, 2026

Ideogram 4: Open image model at the forefront of design

Python 2,063 202 Updated Jun 4, 2026

From Automated Idea Factory to Realization

Shell 1,118 91 Updated Jun 13, 2026

🎥 [Awesome] Egocentric / First-Person Video Datasets 📚 Papers, Benchmarks & Resources for Ego Vision

144 4 Updated Jun 15, 2026

Our inference and training framework to run on the Cosmos Models

Python 242 34 Updated Jun 15, 2026

Implementation of Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

612 7 Updated May 28, 2026

Simulations and identifiability proof for LeJEPA

Python 110 12 Updated May 27, 2026

[AAAI 2026] Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices

Python 120 1 Updated Nov 30, 2025

Code, data and weights for the paper **What drives success in physical planning with Joint-Embedding Predictive World Models?**

Python 403 42 Updated Apr 11, 2026

Geo-Align: Video Generation Alignment via Metric Geometry Reward

Python 29 Updated May 25, 2026

Awesome List for On-Policy Distillation

640 10 Updated Jun 13, 2026

PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion

Python 738 36 Updated Jun 3, 2026

Lens is a 3.8B-parameter text-to-image diffusion model that achieves quality competitive with and in several cases surpassing models like FLUX and SD3, while requiring significantly less training c…

Python 242 17 Updated May 25, 2026

HRM-Text is a 1B text generation model based on the HRM architecture, strengthened by task completion and latent space reasoning.

Python 1,383 127 Updated May 27, 2026

Official Implemenation for RAEv2: Improved Baselines with Representation Autoencoders

Python 267 10 Updated May 21, 2026

repository for training action-conditioned latent diffusion world models for robot video generation

Python 66 2 Updated May 29, 2026

[Arxiv 2026] ReactiveGWM: Steering NPC in Reactive Game World Models

Python 74 8 Updated Jun 15, 2026

Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video

Python 215 9 Updated May 30, 2026

[ICML 2026] Orienting Latent Actions for Video World Modeling

Python 108 1 Updated Apr 20, 2026

Flow Map OPD for AnyStep Video Diffusion

Python 366 8 Updated May 23, 2026

VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model

Python 374 26 Updated May 2, 2026

[ICML2026] Official Implementation of AlphaGRPO: Unlocking Self-Reflective Multimodal Generation in Unified Multimodal Models via Decompositional Verifiable Reward

Python 52 Updated Jun 15, 2026
Python 879 61 Updated May 18, 2026

A Minimal and Elegant Framework & Tutorial for Real-Time Interactive World Models

Python 596 10 Updated Jun 15, 2026

Prompt as Code | GPT-Image2 工业级提示词引擎与模板库,470+ 个案例逆向工程,20+ 套工业级模板,并提炼出Skills,持续更新中

JavaScript 7,559 988 Updated Jun 10, 2026

Heuristic Learning Blog Post

Python 564 58 Updated May 25, 2026

Official Repo of "D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models"

Python 247 7 Updated May 22, 2026

[Roadmap] Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

TeX 117 5 Updated Jun 9, 2026
Next