Lists (14)
Sort Name ascending (A-Z)
Stars
A multi-platform proxy client based on ClashMeta,simple and easy to use, open-source and ad-free.
Retrieval and Retrieval-augmented LLMs
Nightly release of ControlNet 1.1
[ICCV2025] PromptDresser: Improving the Quality and Controllability of Virtual Try-On via Generative Textual Prompt and Prompt-aware Mask
Code repository for Part Grouping Network, ECCV 2018
Code repo for realtime multi-person pose estimation in CVPR'17 (Oral)
MagicTryOn is a video virtual try-on framework based on a large-scale video diffusion Transformer.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
Textin xParse Web 端集成 - React
⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)
A curated list of the most impressive AI papers
🐧 一个更完整、更优雅的 Linux Clash / Mihomo 代理运行平台
A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval
Run Generative AI models with simple C++/Python API and using OpenVINO Runtime
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
[IROS 2025] Generalizable Humanoid Manipulation with 3D Diffusion Policies. Part 1: Train & Deploy of iDP3
Detect and recognize the faces from camera / 调用摄像头进行人脸识别,支持多张人脸同时识别
Honor of Kings AI Open Environment of Tencent
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …