Starred repositories
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Examples and guides for using the OpenAI API
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Google Research
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
A guidance language for controlling large language models.
Neural Networks: Zero to Hero
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
This repository delivers end-to-end, code-first tutorials covering every layer of production-grade GenAI agents, guiding you from spark to scale with proven patterns and reusable blueprints for re…
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
PyTorch tutorials and fun projects including neural talk, neural style, poem writing, anime generation (《深度学习框架PyTorch:入门与实战》)
QLoRA: Efficient Finetuning of Quantized LLMs
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
A collection of pre-trained, state-of-the-art models in the ONNX format
MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks
Build your neural network easy and fast, 莫烦Python中文教学
A series of large language models trained from scratch by developers @01-ai
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Code repo for realtime multi-person pose estimation in CVPR'17 (Oral)
Tutorials for creating and using ONNX models
This repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, suc…
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
[DEIMv2] Real Time Object Detection Meets DINOv3
《深度学习入门-基于Python的理论与实现》,包含源代码和高清PDF(带书签);慕课网imooc《深度学习之神经网络(CNN-RNN-GAN)算法原理-实战》;《菜菜的机器学习sklearn》
Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion
Official implementation of the paper "Watermark Anything with Localized Messages"
PyTorch implementation of "Learning to Discover Cross-Domain Relations with Generative Adversarial Networks"