-
IMT Solutions
- TP HCM
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Stable Diffusion web UI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Robust Speech Recognition via Large-Scale Weak Supervision
The Web framework for perfectionists with deadlines.
The Python micro framework for building web applications.
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline
Scrapy, a fast high-level web crawling & scraping framework for Python.
all of the workflows of n8n i could find (also from the site itself)
We write your reusable computer vision tools. 💜
Instant voice cloning by MIT and MyShell. Audio foundation model.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Generative Models by Stability AI
Official inference repo for FLUX.1 models
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.
Universal LLM Deployment Engine with ML Compilation
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.
Rembg is a tool to remove images background
🕵️♂️ Collect a dossier on a person by username from thousands of sites
Automate browser based workflows with AI
DALL·E Mini - Generate images from a text prompt
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
A Terminal Client for MySQL with AutoCompletion and Syntax Highlighting.