Lists (1)
Sort Name ascending (A-Z)
Starred repositories
A latent text-to-image diffusion model
Google Research
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
StableLM: Stability AI Language Models
This repository contains the source code for the paper First Order Motion Model for Image Animation
This repository contains implementations and illustrative code to accompany DeepMind publications
This repository contains demos I made with the Transformers library by HuggingFace.
LAVIS - A One-stop Library for Language-Vision Intelligence
Code release for NeRF (Neural Radiance Fields)
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…
A unified framework for 3D content generation.
Official Code for Stable Cascade
NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
A scikit-learn compatible neural network library that wraps PyTorch
[ICCV 2019] Monocular depth estimation from a single image
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Human Activity Recognition example using TensorFlow on smartphone sensors dataset and an LSTM RNN. Classifying the type of movement amongst six activity categories - Guillaume Chevalier
[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering
Debugging, monitoring and visualization for Python Machine Learning and Data Science
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
Massively parallel rigidbody physics simulation on accelerator hardware.
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022