Lists (1)
Sort Name ascending (A-Z)
Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
High-Resolution Image Synthesis with Latent Diffusion Models
🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.
🔊 Text-Prompted Generative Audio Model
Official Code for DragGAN (SIGGRAPH 2023)
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
The swiss army knife of lossless video/audio editing
OpenMMLab Detection Toolbox and Benchmark
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Open-Sora: Democratizing Efficient Video Production for All
Generative Models by Stability AI
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Data Apps & Dashboards for Python. No JavaScript Required.
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Rembg is a tool to remove images background
State-of-the-Art Text Embeddings
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Instant neural graphics primitives: lightning fast NeRF and more
End-to-End Object Detection with Transformers
An open source implementation of CLIP.
The official Open-Asset-Importer-Library Repository. Loads 40+ 3D-file-formats into one unified and clean data structure.
Generate 3D objects conditioned on text or images
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
PyTorch code and models for the DINOv2 self-supervised learning method.
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
A collaboration friendly studio for NeRFs