Stars
Python - 100天从新手到大师
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
A latent text-to-image diffusion model
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
A curated list of insanely awesome libraries, packages and resources for Quants (Quantitative Finance)
Learn OpenCV : C++ and Python Examples
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
This repository contains demos I made with the Transformers library by HuggingFace.
My blogs and code for machine learning. http://cnblogs.com/pinard
Inpaint anything using Segment Anything and inpainting models.
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…
YOLOv6: a single-stage object detection framework dedicated to industrial applications.
标注自己的数据集,训练、评估、测试、部署自己的人工智能算法
Debugging, monitoring and visualization for Python Machine Learning and Data Science
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
汉语自然语言处理视频教程-开源学习资料
🙄 Difficult algorithm, Simple code.
Implementation for <SphereFace: Deep Hypersphere Embedding for Face Recognition> in CVPR'17.
Data Augmentation For Object Detection
KITTI Object Visualization (Birdview, Volumetric LiDar point cloud )
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
CVPR 2022 HFGI: High-Fidelity GAN Inversion for Image Attribute Editing
Testing adaptation of the DINOv2/3 encoders for vision tasks with Low-Rank Adaptation (LoRA)