Stars
- All languages
- AGS Script
- ANTLR
- ASL
- Arduino
- Assembly
- Batchfile
- C
- C#
- C++
- CMake
- COBOL
- CSS
- Clojure
- CoffeeScript
- Common Lisp
- Crystal
- Cuda
- Cython
- Dart
- Dockerfile
- Eagle
- Elixir
- F*
- Fortran
- G-code
- GAP
- GDScript
- GLSL
- Go
- HTML
- Haskell
- Java
- JavaScript
- Jinja
- Julia
- Jupyter Notebook
- KiCad Layout
- Kotlin
- Lua
- MATLAB
- MDX
- MLIR
- Makefile
- Markdown
- Mathematica
- Max
- NASL
- Nim
- Nunjucks
- Objective-C
- Objective-C++
- OpenSCAD
- PHP
- PLSQL
- Pascal
- Perl
- PowerShell
- Prolog
- Puppet
- PureScript
- Python
- QML
- R
- Red
- Rich Text Format
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Stan
- Starlark
- Svelte
- Swift
- SystemVerilog
- Tcl
- TeX
- TypeScript
- V
- VHDL
- Vala
- Verilog
- Vim Script
- Vue
- Zig
- nesC
A latent text-to-image diffusion model
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
🔊 Text-Prompted Generative Audio Model
Google Research
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
A game theoretic approach to explain the output of any machine learning model.
Python programs, usually short, of considerable difficulty, to perfect particular skills.
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
A High-Quality Real Time Upscaler for Anime Video
Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filte…
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Data and code behind the articles and graphics at FiveThirtyEight
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
This repository contains the source code for the paper First Order Motion Model for Image Animation
A multi-voice TTS system trained with an emphasis on quality
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
High-Resolution Image Synthesis with Latent Diffusion Models
Draw pretty maps from OpenStreetMap data! Built with osmnx +matplotlib + shapely
PyTorch code and models for the DINOv2 self-supervised learning method.
Foundational Models for State-of-the-Art Speech and Text Translation
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
Code release for NeRF (Neural Radiance Fields)
Official inference library for Mistral models
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Reference PyTorch implementation and models for DINOv3