-
HCMC University of Technology
- Saigon Metropolitan Area
- @anhduyle1603
- https://scholar.google.com/citations?user=VSj_iOQAAAAJ&hl=vi
Lists (32)
Sort Name ascending (A-Z)
3D-Large World Model
ASR
audio+visual generation
AWS
book
Conferences
course
diffusion
diffusion+RL
Framework
GAN
GNN
Hand written generation
LLM
Machine Learning
Math Expression Regconition
ML for production
neuroscience
NLP
Object Detection
Opensource alternative
Python
quantum computing
Reinforcement Learning
Research Tip
speedup training
SSL
Text Recognition
tools
Transformer
vision-language-learning
Vision Transformer
Starred repositories
"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"
A list of works on video generation towards world model
Study resources for learning quantum computing
RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
Mathematical principles and theoretical discussions of diffusion models
text window manager, shell multiplexer, integrated DevOps environment
Glances an Eye on your system. A top/htop alternative for GNU/Linux, BSD, Mac OS and Windows operating systems.
Streamlit Component to quickly create Interactive Flow Diagrams using React Flow
[DEIMv2] Real Time Object Detection Meets DINOv3
Minimal JAX implementation unifying Diffusion and Flow Matching algorithms as alternative strategies for transporting data distributions.
An open source SDK for logging, storing, querying, and visualizing multimodal and multi-rate data
[ISMIR 2025] A curated list of vision-to-music generation: methods, datasets, evaluation and challenges.
Awesome Unified Multimodal Models
🚀 An awesome list of curated Nano Banana pro prompts and examples. Your go-to resource for mastering prompt engineering and exploring the creative potential of the Nano banana pro(Nano banana 2) AI…
A pipeline parallel training script for diffusion models.
A Reproduction of GDM's Nested Learning Paper
Agentic AI system to solve Kaggle Competitions
Music-Aligned Holistic 3D Dance Generation via Hierarchical Motion Modeling [ICCV 2025] Official PyTorch implementation
OCR model that handles complex tables, forms, handwriting with full layout.
Fast and differentiable MS-SSIM and SSIM for pytorch.
YOLO-UniOW: Efficient Universal Open-World Object Detection
CommonForms — open models to auto-detect PDF form fields