Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 23,128 2,597 Updated Mar 3, 2026

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,789 1,701 Updated Jan 30, 2026

mml-book / mml-book.github.io

Companion webpage to the book "Mathematics For Machine Learning"

Jupyter Notebook 15,264 2,743 Updated Mar 13, 2025

WongKinYiu / yolov7

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

Jupyter Notebook 14,120 4,392 Updated Aug 19, 2024

facebookresearch / dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 12,599 1,198 Updated Mar 12, 2026

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,194 1,100 Updated Nov 18, 2024

microsoft / computervision-recipes

Best Practices, code samples, and documentation for Computer Vision.

Jupyter Notebook 9,836 1,206 Updated Feb 16, 2024

DmitryUlyanov / deep-image-prior

Image restoration with neural networks but without learning.

Jupyter Notebook 8,075 1,447 Updated Apr 27, 2023

open-mmlab / mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…

Jupyter Notebook 7,409 1,100 Updated Aug 6, 2024

facebookresearch / DensePose

A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body

Jupyter Notebook 7,230 1,323 Updated Jan 18, 2023

probml / pyprobml

Python code for "Probabilistic Machine learning" book by Kevin Murphy

Jupyter Notebook 7,045 1,618 Updated Feb 26, 2026

cocodataset / cocoapi

COCO API - Dataset @ http://cocodataset.org/

Jupyter Notebook 6,366 3,754 Updated Apr 17, 2024

tensorflow / swift

Swift for TensorFlow

Jupyter Notebook 6,143 611 Updated Jan 12, 2022

snakers4 / silero-models

Silero Models: pre-trained text-to-speech models made embarrassingly simple

Jupyter Notebook 5,839 360 Updated Mar 27, 2026

ChaoningZhang / MobileSAM

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Jupyter Notebook 5,671 569 Updated Dec 19, 2025

pkmital / tensorflow_tutorials

From the basics to slightly more interesting applications of Tensorflow

Jupyter Notebook 5,664 1,165 Updated Dec 11, 2021

deep-learning-with-pytorch / dlwpt-code

Code for the book Deep Learning with PyTorch by Eli Stevens, Luca Antiga, and Thomas Viehmann.

Jupyter Notebook 5,204 2,155 Updated Jul 25, 2024

balancap / SSD-Tensorflow

Single Shot MultiBox Detector in TensorFlow

Jupyter Notebook 4,106 1,856 Updated Aug 12, 2021

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,966 323 Updated Jun 12, 2025

z-x-yang / Segment-and-Track-Anything

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…

Jupyter Notebook 3,119 357 Updated Mar 13, 2026

kwea123 / nerf_pl

NeRF (Neural Radiance Fields) and NeRF in the Wild using pytorch-lightning

Jupyter Notebook 2,807 466 Updated Aug 3, 2023

PhoebusSi / Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,801 251 Updated Dec 12, 2023

rpautrat / SuperPoint

Efficient neural feature detector and descriptor

Jupyter Notebook 2,404 468 Updated May 5, 2025

dscripka / openWakeWord

An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.

Jupyter Notebook 2,029 248 Updated Dec 30, 2025

tinghuiz / SfMLearner

An unsupervised learning framework for depth and ego-motion estimation from monocular videos

Jupyter Notebook 2,015 555 Updated Oct 26, 2021

edyoda / data-science-complete-tutorial

For extensive instructor led learning

Jupyter Notebook 1,820 767 Updated Oct 31, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kai Sun GPTAlgoPro

Block or report GPTAlgoPro

Starred repositories

GokuMohandas / Made-With-ML

google-research / google-research

datawhalechina / self-llm

fastai / fastai

facebookresearch / audiocraft