Stars
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Models and examples built with TensorFlow
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.
A library for efficient similarity search and clustering of dense vectors.
Official Code for DragGAN (SIGGRAPH 2023)
程序员延寿指南 | A programmer's guide to live longer
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
OpenMMLab Detection Toolbox and Benchmark
📄 Awesome CV is LaTeX template for your outstanding job application
Official inference repo for FLUX.1 models
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Image-to-Image Translation in PyTorch
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Fast and memory-efficient exact attention
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
sketch + style = paints 🎨 (TOG2018/SIGGRAPH2018ASIA)
"🐈 nanobot: The Ultra-Lightweight OpenClaw"
Build and run Docker containers leveraging NVIDIA GPUs
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
✨✨Latest Advances on Multimodal Large Language Models
NVIDIA Linux open GPU kernel module source
Lets make video diffusion practical!
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".