Skip to content
View 1liuren's full-sized avatar

Block or report 1liuren

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 9,199 945 Updated Aug 12, 2024

Official inference repo for FLUX.1 models

Python 24,598 1,808 Updated Jul 31, 2025

"RAG-Anything: All-in-One RAG Framework"

Python 9,979 1,181 Updated Oct 20, 2025

Multilingual Document Layout Parsing in a Single Vision-Language Model

Python 5,583 562 Updated Oct 31, 2025

A python module to repair invalid JSON from LLMs

Python 3,857 151 Updated Nov 1, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 50,669 8,847 Updated Nov 3, 2025
Python 8,128 571 Updated Nov 5, 2025

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

Python 3,096 526 Updated May 9, 2024

A curated list of resources dedicated to table recognition

404 51 Updated Dec 12, 2024

Rembg is a tool to remove images background

Python 20,942 2,161 Updated Oct 25, 2025

一个自动化执行手机任务的项目

Python 1 Updated Sep 18, 2025

A Python-based Xiaozhi AI for users who want the full Xiaozhi experience without owning specialized hardware.

Python 2,793 571 Updated Nov 5, 2025

本项目为xiaozhi-esp32提供后端服务,帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.

Python 1 1 Updated Sep 15, 2025

A lightweight LMM-based Document Parsing Model

Python 1 Updated Jun 26, 2025

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Python 1,747 133 Updated Apr 14, 2025

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 48,145 3,974 Updated Nov 4, 2025

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Python 22,297 2,317 Updated Apr 29, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 52,064 5,704 Updated Sep 10, 2025

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 33,072 4,129 Updated Aug 6, 2024

基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.

Python 8,399 1,044 Updated Jun 26, 2025

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 4,165 342 Updated Jun 30, 2025

llm & rl

Jupyter Notebook 242 19 Updated Oct 24, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,134 2,428 Updated Nov 5, 2025

PyTorch implementations of Generative Adversarial Networks.

Python 17,316 4,096 Updated Jun 18, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,854 7,480 Updated Nov 5, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,375 1,112 Updated Nov 5, 2025

收藏的一些经典的历史、政治、心理、哲学、数学、计算机方面电子书(约10万本)

JavaScript 6,442 857 Updated Sep 28, 2023

✨✨Latest Advances on Multimodal Large Language Models

16,626 1,072 Updated Nov 4, 2025

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 179,502 46,096 Updated Nov 5, 2025

Ultralytics YOLO 🚀

Python 48,299 9,311 Updated Nov 5, 2025
Next