Lists (2)
Sort Name ascending (A-Z)
- All languages
- AppleScript
- Assembly
- Batchfile
- Bikeshed
- C
- C#
- C++
- CMake
- CSS
- Cuda
- Cython
- Dart
- Dockerfile
- F#
- Fortran
- FreeBASIC
- GLSL
- Go
- HTML
- Haskell
- Java
- JavaScript
- Jinja
- Julia
- Jupyter Notebook
- Kotlin
- Lua
- MATLAB
- Makefile
- Metal
- Objective-C
- Objective-C++
- Pawn
- Perl
- PostScript
- PowerShell
- Processing
- Python
- QML
- R
- RobotFramework
- Ruby
- Rust
- Scala
- ShaderLab
- Shell
- Swift
- TeX
- TypeScript
- VBA
- Visual Basic
- Vue
- Wikitext
- Zig
- templ
Starred repositories
Examples and guides for using the OpenAI API
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
🔊 Text-Prompted Generative Audio Model
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
A simple screen parsing tool towards pure vision based GUI agent
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Get started with building Fullstack Agents using Gemini 2.5 and LangGraph
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Examples and guides for using the Gemini API
StableLM: Stability AI Language Models
This repository contains the source code for the paper First Order Motion Model for Image Animation
A multi-voice TTS system trained with an emphasis on quality
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
Foundational Models for State-of-the-Art Speech and Text Translation
AirLLM 70B inference with single 4GB GPU
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Reference PyTorch implementation and models for DINOv3
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
Silero Models: pre-trained text-to-speech models made embarrassingly simple
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
CoTracker is a model for tracking any point (pixel) on a video.
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
[ICCV 2019] Monocular depth estimation from a single image
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer