Lists (1)
Sort Name ascending (A-Z)
Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
High-Resolution Image Synthesis with Latent Diffusion Models
Official Code for DragGAN (SIGGRAPH 2023)
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
OpenMMLab Detection Toolbox and Benchmark
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Open-Sora: Democratizing Efficient Video Production for All
Generative Models by Stability AI
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Data Apps & Dashboards for Python. No JavaScript Required.
Rembg is a tool to remove images background
State-of-the-Art Text Embeddings
An open source implementation of CLIP.
Generate 3D objects conditioned on text or images
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
A collaboration friendly studio for NeRFs
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Pythonic AI generation of images and videos
Stable Diffusion built-in to Blender
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
The ultimate training toolkit for finetuning diffusion models