MithrilMan

MithrilMan MithrilMan

Fabio Angela aka Mithril Man Full stack dev by trade and passion, too many years of exp to mention. Never stop learning!

19 followers · 0 following

Achievements

Starred repositories

49 stars written in Python

Clear filter

Significant-Gravitas / AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 179,504 46,101 Updated Nov 6, 2025

AUTOMATIC1111 / stable-diffusion-webui

Stable Diffusion web UI

Python 157,866 29,300 Updated Oct 7, 2025

comfyanonymous / ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 92,866 10,452 Updated Nov 6, 2025

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,320 5,733 Updated Aug 16, 2024

LAION-AI / Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,488 3,300 Updated Aug 17, 2024

roboflow / supervision

We write your reusable computer vision tools. 💜

Python 35,818 2,992 Updated Nov 5, 2025

deepinsight / insightface

State-of-the-art 2D and 3D Face Analysis Project

Python 26,949 5,813 Updated Sep 27, 2025

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,745 2,933 Updated Sep 2, 2024

squidfunk / mkdocs-material

Documentation that simply works

Python 25,062 3,946 Updated Nov 5, 2025

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 24,599 1,808 Updated Jul 31, 2025

fishaudio / fish-speech

SOTA Open Source TTS

Python 23,987 1,957 Updated Nov 3, 2025

mlc-ai / mlc-llm

Universal LLM Deployment Engine with ML Compilation

Python 21,565 1,851 Updated Nov 4, 2025

mkdocs / mkdocs

Project documentation with Markdown.

Python 21,245 2,555 Updated Oct 20, 2025

nari-labs / dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,762 1,628 Updated Jul 6, 2025

meta-llama / codellama

Inference code for CodeLlama models

Python 16,358 1,935 Updated Aug 12, 2024

resemble-ai / chatterbox

SoTA open-source TTS

Python 14,428 1,942 Updated Sep 25, 2025

SesameAILabs / csm

A Conversational Speech Generation Model

Python 14,254 1,426 Updated May 27, 2025

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,537 1,988 Updated Nov 3, 2025

OpenTalker / SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 13,332 2,546 Updated Jun 26, 2024

Tencent-Hunyuan / Hunyuan3D-2

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 12,278 1,202 Updated Oct 28, 2025

sapientinc / HRM

Hierarchical Reasoning Model Official Release

Python 11,631 1,698 Updated Sep 9, 2025

microsoft / TRELLIS

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

Python 10,905 990 Updated Nov 5, 2025

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9,073 827 Updated Nov 3, 2025

Lightricks / LTX-Video

Official repository for LTX-Video

Python 8,702 797 Updated Oct 25, 2025

yangchris11 / samurai

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python 6,988 479 Updated Mar 18, 2025

QwenLM / Qwen-Image

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 5,924 323 Updated Sep 30, 2025

VAST-AI-Research / TripoSR

TripoSR: Fast 3D Object Reconstruction from a Single Image

Python 5,858 715 Updated Aug 16, 2024

rednote-hilab / dots.ocr

Multilingual Document Layout Parsing in a Single Vision-Language Model

Python 5,586 562 Updated Oct 31, 2025

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 5,252 454 Updated Oct 27, 2025

dnhkng / GLaDOS

This is the Personality Core for GLaDOS, the first steps towards a real-life implementation of the AI from the Portal series by Valve.

Python 5,124 384 Updated Sep 19, 2025

MithrilMan MithrilMan

Starred repositories

talking-head

Deep learning

C#