Lists (15)
Sort Name ascending (A-Z)
Starred repositories
9
stars
written in Jupyter Notebook
Clear filter
🔊 Text-Prompted Generative Audio Model
A simple screen parsing tool towards pure vision based GUI agent
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks
智能视频多语言AI配音/翻译工具 - Linly-Dubbing — “AI赋能,语言无界”
JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.