Highlights
- Pro
Stars
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
A high-throughput and memory-efficient inference and serving engine for LLMs
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than 100 days, follow your own pace. These videos m…
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择ChatGPT/Claude/DeepSeek/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
deep learning for image processing including classification and object-detection etc.
Fully open reproduction of DeepSeek-R1
Fast and memory-efficient exact attention
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
✨✨Latest Advances on Multimodal Large Language Models
✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】
Ongoing research training transformer models at scale
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide
A PyTorch implementation of the Transformer model in "Attention is All You Need".
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Solve Visual Understanding with Reinforced VLMs
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!