- Italy
- @MithrilMan
- @FabioAngela79
Starred repositories
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
ASP.NET Core is a cross-platform .NET framework for building modern cloud-based web applications on Windows, Mac, or Linux.
🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Generative AI extensions for onnxruntime
A framework for building, orchestrating and deploying AI agents and multi-agent workflows with support for Python and .NET.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Krita is a free and open source cross-platform application that offers an end-to-end solution for creating digital art files from scratch built on the KDE and Qt frameworks.
Humanizer meets all your .NET needs for manipulating and displaying strings, enums, dates, times, timespans, numbers and quantities
Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown
Integrate cutting-edge LLM technology quickly and easily into your apps
.NET news, announcements, release notes, and more!
We write your reusable computer vision tools. 💜
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
Universal LLM Deployment Engine with ML Compilation
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …
RAG architecture: index and query any data using LLM and natural language, track sources, show citations, asynchronous memory patterns.
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
🔍 A Hex Editor for Reverse Engineers, Programmers and people who value their retinas when working at 3 AM.
Whisper.net. Speech to text made simple using Whisper Models
Multilingual Document Layout Parsing in a Single Vision-Language Model