Lists (18)
Sort Name ascending (A-Z)
About Transformer & LLM
AI AGENT
Audio LLM
Avatar数字人
Document intelligence
Graph vis
to visualize graoh in frontendImage edit
image/video gen
Invoice Gen
Language learning assistant
LLM Reasoning
Low code
N2SQL/Data Analytics/Tabular
Non-LLM
Object detection/Computer Vision
OCR
python runtime
RAG
Including RAG, GraphRAG, function callingStars
The Cursor for Designers • An Open-Source AI-First Design tool • Visually build, style, and edit your React App with AI
Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.
✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL
哔哩哔哩视频解析下载工具,支持 8K 视频、Hi-Res 音频、杜比视界下载、批量解析,可扫码登录,常驻托盘。
Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)
High accurate text detection (OCR) Javascript/Typescript library that runs on Node.js, Browser, React Native and C++. Based on PaddleOCR and ONNX runtime
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning
A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.
AI conversations that actually remember. Never re-explain your project to your AI again. Join our Discord: https://discord.gg/tyvKNccgqN
[VLDB' 25] Synthesizing High-quality Text-to-SQL Data at Scale. SynSQL-2.5M is the first million-scale cross-domain text-to-SQL dataset.
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
Unleash Next-Level AI! 🚀 💻 Code Generation: DeepSeek r1 + Claude 3.7 Sonnet - Unparalleled Performance! 📝 Content Creation: DeepSeek r1 + Gemini 2.5 Pro - Superior Quality! 🔌 OpenAI-Compatible. 🌊 S…
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
A ComfyUI custom node designed for advanced image background removal and object, face, clothes, and fashion segmentation, utilizing multiple models including RMBG-2.0, INSPYRENET, BEN, BEN2, BiRefN…
Rembg is a tool to remove images background
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.
A tool to write JS libraries using AI. The first and only tool that is using ASTs to perform surgical changes to existing code files with out mangling the code. https://aicoderproject.com/
A series of technical report on Slow Thinking with LLM
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching