Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
No fortress, purely open ground. OpenManus is Coming.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
OpenMMLab Detection Toolbox and Benchmark
Official inference repo for FLUX.1 models
Rembg is a tool to remove images background
PyTorch implementations of Generative Adversarial Networks.
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
verl: Volcano Engine Reinforcement Learning for LLMs
Ongoing research training transformer models at scale
"RAG-Anything: All-in-One RAG Framework"
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
Multilingual Document Layout Parsing in a Single Vision-Language Model
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
General technology for enabling AI capabilities w/ LLMs and MLLMs
A python module to repair invalid JSON from LLMs
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
A Python-based Xiaozhi AI for users who want the full Xiaozhi experience without owning specialized hardware.
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.
Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"