Skip to content
View trqminh's full-sized avatar

Highlights

  • Pro

Organizations

@aioz-ai

Block or report trqminh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.

1,507 61 Updated Dec 18, 2025

Awesome curated collection of images and prompts generated by gemini-2.5-flash-image (aka Nano Banana) state-of-the-art image generation and editing model. Explore AI generated visuals created with…

JavaScript 8,163 832 Updated Sep 8, 2025
Python 13 2 Updated Dec 4, 2025
Lua 1 Updated Dec 15, 2025

Pytorch implementation for MeanFlow

Jupyter Notebook 272 23 Updated Jul 30, 2025

[CVPR 2025] h-Edit: Effective and Flexible Diffusion-Based Editing via Doob’s h-Transform

Python 65 1 Updated Jun 11, 2025
Python 9 1 Updated May 24, 2025

A simple UI for SAM 2. Give an input path to a directory of video frames, and the script will let you look through the frames, plot points, then feed the points into SAM 2.

Python 5 Updated Sep 9, 2024
Python 20 3 Updated Nov 11, 2024

Generate videos that interpolate between two given images

Python 102 9 Updated Aug 9, 2023

A summary of related works about flow matching, stochastic interpolants

615 18 Updated Mar 25, 2025

[ISBI 2024] An implementation of SAM3D which adapts Segment Anything Model for Volumetric Medical Image Segmentation

Python 82 11 Updated May 28, 2024

[WACV 2024] Decoding Radiologists’ Intense Focus for Accurate CXR Diagnoses: A Controllable & Interpretable AI System

Python 7 Updated Oct 7, 2024

[Remote Sensing] AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation

Python 67 7 Updated Apr 23, 2024

[ICRA 2024 Oral] Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation

Python 141 12 Updated Aug 19, 2024

Refine high-quality datasets and visual AI models

Python 10,163 693 Updated Dec 21, 2025

[CVPR 2023] This is the official PyTorch implementation for "Dynamic Focus-aware Positional Queries for Semantic Segmentation".

Python 60 3 Updated Mar 4, 2023

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Python 929 53 Updated Jul 6, 2024

CEDNet: A Cascade Encoder-Decoder Network for Dense Prediction (Pattern Recognition 2024)

Python 134 3 Updated Dec 17, 2024
Python 2 Updated Sep 15, 2020

AIOZ-GDANCE: a large-scale dataset & baseline for music-driven group dance generation. (CVPR 2023)

Python 96 20 Updated Nov 25, 2025

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python 20 14 Updated May 7, 2022

An example of the DINO detector using C++ and the Libtorch library

C++ 1 Updated Dec 28, 2022

[Asilomar 2022] Contextual Explainable Video Representation: Human Perception-based Understanding

4 1 Updated Dec 7, 2022

[AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning

Jupyter Notebook 68 6 Updated Feb 16, 2024

Tensors, for human consumption

Jupyter Notebook 1,337 22 Updated Nov 17, 2025

Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.

Python 47 14 Updated Apr 6, 2021
Next