Stars
- All languages
- Assembly
- Astro
- Bikeshed
- C
- C#
- C++
- CMake
- CSS
- Clojure
- CodeQL
- CoffeeScript
- Crystal
- Cuda
- Dart
- Dockerfile
- EJS
- Elixir
- Erlang
- GLSL
- Go
- Groovy
- HTML
- Haskell
- Haxe
- JSON
- Java
- JavaScript
- Jinja
- Julia
- Jupyter Notebook
- Kotlin
- Lua
- MATLAB
- MDX
- Makefile
- Markdown
- Meson
- Mojo
- Mustache
- Nim
- Nunjucks
- Nushell
- OCaml
- Objective-C
- Objective-C++
- Objective-J
- PHP
- PLpgSQL
- Perl
- Prolog
- Python
- QML
- Raku
- Roff
- Ruby
- Rust
- SAS
- SCSS
- SVG
- Scala
- Shell
- Solidity
- Starlark
- Svelte
- Swift
- SystemVerilog
- TeX
- TypeScript
- Vim Script
- Vue
- WebAssembly
- Zig
A latent text-to-image diffusion model
Examples and guides for using the OpenAI API
12 Lessons to Get Started Building AI Agents
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
🔊 Text-Prompted Generative Audio Model
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
Google Research
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
The fastai book, published as Jupyter Notebooks
A simple screen parsing tool towards pure vision based GUI agent
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
A guidance language for controlling large language models.
Anthropic's educational courses
Instruct-tune LLaMA on consumer hardware
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Data and code behind the articles and graphics at FiveThirtyEight
A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding). Translations: 🇺🇸 🇨🇳 🇯🇵 🇮🇹 🇰🇷 🇷🇺 🇧🇷 🇪🇸
StableLM: Stability AI Language Models
This repository contains the source code for the paper First Order Motion Model for Image Animation
A multi-voice TTS system trained with an emphasis on quality
FinRL®: Financial Reinforcement Learning. 🔥
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
High-Resolution Image Synthesis with Latent Diffusion Models
Draw pretty maps from OpenStreetMap data! Built with osmnx +matplotlib + shapely
Foundational Models for State-of-the-Art Speech and Text Translation