-
BUPT
Stars
A Claude Code plugin that shows what's happening - context usage, active tools, running agents, and todo progress
Awesome AI Memory | LLM Memory | A curated knowledge base on AI memory for LLMs and agents, covering long-term memory, reasoning, retrieval, and memory-native system design. Awesome-AI-Memory 是一个 集…
A complete AI agency at your fingertips - From frontend wizards to Reddit community ninjas, from whimsy injectors to reality checkers. Each agent is a specialized expert with personality, processes…
Fast, small, and fully autonomous AI personal assistant infrastructure, ANY OS, ANY PLATFORM — deploy anywhere, swap anything 🦀
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Faster Whisper transcription with CTranslate2
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
The headless rich text editor framework for web artisans.
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.
The mouse and trackpad utility for Mac.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
A latent text-to-image diffusion model
High-Resolution Image Synthesis with Latent Diffusion Models
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A cross-platform protocol library to communicate with iOS devices
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Probably the fastest coroutine lib in the world!
Espressif IoT Development Framework. Official development framework for Espressif SoCs.