Stars
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
This is a personal learning repository for the book Hands-On Generative AI with Transformers and Diffusion Models. Here you'll find hands-on projects, solutions, and experiments with Generative AI…
Deep Attentional Guided Image Filtering, Winner solution for ICMR 2021 Real DSR Challenge (IEEE TNNLS 2023)
Visualizer for neural network, deep learning and machine learning models
An open source iOS framework for GPU-based image and video processing
UyaliBeautyFaceSDK supports beauty filters, LUTs (Look-Up Tables), facial adjustments such as enlarging eyes and slimming faces, and millisecond-level facial recognition and tracking.UyaliBeautyFac…
MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills [SIGGRAPH 2025]
Official inference repo for FLUX.1 models
collection of diffusion model papers categorized by their subareas
AAAI2026 X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning
Implementation of Attention-based Deep Multiple Instance Learning in PyTorch
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.
A library for efficient similarity search and clustering of dense vectors.
🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
A curated list of awesome papers on Embodied AI and related research/industry-driven resources.
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
"Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis" (ECCV 2024)
(ICCV 2025) GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
The official Open-Asset-Importer-Library Repository. Loads 40+ 3D-file-formats into one unified and clean data structure.