- Italy
- @MithrilMan
- @FabioAngela79
Starred repositories
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Stable Diffusion web UI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
We write your reusable computer vision tools. 💜
State-of-the-art 2D and 3D Face Analysis Project
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Official inference repo for FLUX.1 models
Universal LLM Deployment Engine with ML Compilation
A TTS model capable of generating ultra-realistic dialogue in one pass.
A Conversational Speech Generation Model
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
Hierarchical Reasoning Model Official Release
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
TripoSR: Fast 3D Object Reconstruction from a Single Image
Multilingual Document Layout Parsing in a Single Vision-Language Model
This is the Personality Core for GLaDOS, the first steps towards a real-life implementation of the AI from the Portal series by Valve.