Stars
Examples and guides for using the OpenAI API
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
12 Lessons to Get Started Building AI Agents
Google Research
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
Anthropic's Interactive Prompt Engineering Tutorial
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Anthropic's educational courses
Natural Language Processing Tutorial for Deep Learning Researchers
Companion webpage to the book "Mathematics For Machine Learning"
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
PyTorch code and models for the DINOv2 self-supervised learning method.
Foundational Models for State-of-the-Art Speech and Text Translation
This repository contains demos I made with the Transformers library by HuggingFace.
LAVIS - A One-stop Library for Language-Vision Intelligence
Official inference library for Mistral models
A collection of pre-trained, state-of-the-art models in the ONNX format
A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM …
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Lab Materials for MIT 6.S191: Introduction to Deep Learning
AirLLM 70B inference with single 4GB GPU
This project reproduces the book Dive Into Deep Learning (https://d2l.ai/), adapting the code from MXNet into PyTorch.
Documentation for Google's Gen AI site - including the Gemini API and Gemma
Machine Learning Course, Sharif University of Technology
Collection of notebook guides created by the Brev.dev team!
Text To Video Synthesis Colab