Skip to content
View freemty's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Hangzhou
  • 04:08 (UTC +08:00)

Block or report freemty

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation

Python 8 Updated Nov 4, 2025

Awesome-Paper-list: Visualization meets LLM

55 2 Updated Sep 28, 2025

MotionStream: Real-Time Video Generation with Interactive Motion Controls

85 2 Updated Nov 4, 2025

This is the official repository for the paper "MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning"

Python 36 Updated Oct 30, 2025

https://little-misfit.github.io/GRAG-Image-Editing/

Python 83 1 Updated Nov 5, 2025

Mastering Atari with Discrete World Models

Python 970 206 Updated Jan 21, 2023
49 2 Updated Oct 31, 2025

Code and training scripts for FlexOlmo

Python 113 13 Updated Nov 4, 2025
Python 1,512 65 Updated Oct 28, 2025
Python 325 18 Updated May 31, 2025

Live evaluation of trading agents

Python 23 Updated Nov 4, 2025

Defeating the Training-Inference Mismatch via FP16

Python 119 10 Updated Oct 31, 2025

[arXiv 2025] Generative View Stitching

Python 69 3 Updated Nov 5, 2025

A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.

636 15 Updated Nov 5, 2025
Python 626 49 Updated Apr 12, 2025

OmniNWM: Omniscient Navigation World Models for Autonomous Driving

208 1 Updated Oct 30, 2025
Python 2,465 237 Updated Jul 16, 2025

Native Multimodal Models are World Learners

Python 1,131 39 Updated Nov 5, 2025

Official repo for: Epipolar Geometry Improves Video Generation Models

Python 52 4 Updated Oct 28, 2025

Official implementation of "Understanding multi-view transformers" (ICCV 2025 E2E3D Workshop)

13 Updated Aug 19, 2025

The OpenEXR project provides the specification and reference implementation of the EXR file format, the professional-grade image storage format of the motion picture industry.

C 1,744 658 Updated Nov 5, 2025

Minimal and annotated implementations of key ideas from modern deep learning research.

Python 1,200 97 Updated Sep 28, 2025

Devkit and documentation for the NVIDIA Physical AI Autonomous Vehicles Dataset

100 2 Updated Oct 28, 2025

PyTorch code for hierarchical k-means -- a data curation method for self-supervised learning

Jupyter Notebook 216 14 Updated Jun 21, 2024

NOF0 - 开源的 AI 交易竞技场

Go 2,599 407 Updated Nov 3, 2025

Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives

Python 375 33 Updated Oct 30, 2025

[NeurIPS 2025 Oral] Exploring Diffusion Transformer Designs via Grafting

Jupyter Notebook 61 2 Updated Jun 18, 2025

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,573 234 Updated Jun 14, 2024
Next