Stars
Storybook is the industry standard workshop for building, documenting, and testing UI components in isolation
Convert the model in PaddleOCR to ONNX format
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
DECIMER Image Transformer is a deep-learning-based tool designed for automated recognition of chemical structure images. Leveraging transformer architectures, the model converts chemical images int…
An easy 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, a…
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.
PALLAIDIUM — a generative AI movie studio, seamlessly integrated into the Blender Video Editor (VSE), enabling end-to-end production from script to screen and back.
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
Automatically Edits Videos and Uploads to Tiktok with CLI, Requests not Selenium.
A Powerful and All-in-One MQTT 5.0 client toolbox for Desktop, CLI and WebSocket.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
anhlbt / vanna
Forked from vanna-ai/vanna🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
Discover Healthsearch: Unlocking Health with Semantic Search ✨
General-purpose AI designed for knowledge workers — creators, strategists, and operators — and individuals seeking AI systems they can truly control to help them get work done, with full flexibilit…
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…
Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.
Draw a mockup and generate html for it
State-of-the-Art Embeddings, Retrieval, and Reranking
Apache Superset is a Data Visualization and Data Exploration Platform
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
INTERSPEECH 2019 Tutorial Materials
Material for the tutorial: "Deep Diving into GANs: from theory to production"
A list of papers on Generative Adversarial (Neural) Networks
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.