Highlights
- Pro
Stars
Ongoing research training transformer models at scale
A high-throughput and memory-efficient inference and serving engine for LLMs
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Fast and memory-efficient exact attention
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)
Awesome weakly-supervised image semantic segmentation;scribble,bounding box, point, image tag, and heterogeneous of them. 2016-2025
Agent0 Series: Self-Evolving Agents from Zero Data
✨✨Latest Advances on Multimodal Large Language Models
OpenMMLab Foundational Library for Training Deep Learning Models
✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】
[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Fully open reproduction of DeepSeek-R1
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
A comprehensive and up-to-date compilation of datasets, tools, methods, review papers, and competitions for remote sensing change detection.
30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than 100 days, follow your own pace. These videos m…
A Change Detection Repo Standing on the Shoulders of Giants
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥
A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择ChatGPT/Claude/DeepSeek/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
NEO Series: Native Vision-Language Models from First Principles