natowi

I may be slow to respond.

natowi

I may be slow to respond.

160 followers · 11 following

07:42 (UTC +02:00)

Achievements

x2 x3 x2

Achievements

x2 x3 x2

Organizations

Lists (7)

Sort

Starred repositories

okdalto / ComfyUI-PersonaLive

This is a ComfyUI custom node implementation of 'PersonaLive: Expressive Portrait Image Animation for Live Streaming'.

Python 120 16 Updated Jan 25, 2026

facebookresearch / sam-audio

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,584 324 Updated May 26, 2026

Francis-Rings / FlashPortrait

[CVPR2026]We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while achieving up to 6$\times$ acceleration in inference…

Python 480 38 Updated Feb 21, 2026

Tongyi-MAI / Z-Image

Python 11,796 808 Updated Feb 9, 2026

microsoft / VibeVoice

Open-Source Frontier Voice AI

Python 50,993 5,689 Updated Jul 24, 2026

resemble-ai / chatterbox

SoTA open-source TTS

Python 25,745 3,428 Updated Jul 21, 2026

VaclavNezerka / Cloud2BIM

Codes for automatic point-cloud-to-BIM conversion

Python 122 30 Updated Apr 7, 2026

sisakat / e57inspector

Cross-platform E57 file viewer to list and view stored point clouds, images and metadata.

C++ 21 2 Updated Jul 11, 2026

Dijji / XstReader

Xst Reader is an open source viewer for Microsoft Outlook’s .ost and .pst files, written entirely in C#. To download an executable of the current version, go to the releases tab.

C# 680 134 Updated Sep 11, 2023

PozzettiAndrea / ComfyUI-SAM3DBody

ComfyUI wrapper for sam-3d-body

Python 322 31 Updated Jun 6, 2026

yulunwu0108 / NeuSurf

[AAAI'24] NeuSurf: On-Surface Priors for Neural Surface Reconstruction from Sparse Input Views

Python 86 2 Updated Jan 6, 2025

nari-labs / dia2

TTS model capable of streaming conversational audio in realtime.

Python 1,160 99 Updated Nov 29, 2025

FORARTfe / HyMPS

HyMPS will be a platform-indipendent software suite for advanced audio/video contents production.

323 20 Updated Jul 14, 2026

ThinkInAIXYZ / deepchat

🐬DeepChat - A smart assistant that connects powerful AI to your personal world

TypeScript 6,171 709 Updated Jul 29, 2026

k2-fsa / sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 13,842 1,596 Updated Jul 29, 2026

cjpais / Handy

A free, open source, and extensible speech-to-text application that works completely offline.

Rust 27,785 2,414 Updated Jul 28, 2026

ashesbloom / LocalLens

Local Lens is a privacy-first, AI-powered photo organizer for your PC. Sort and group photos by faces, dates, and locations—all locally, with no cloud upload. Enjoy a modern, intuitive UI and keep …

Python 139 12 Updated Jul 24, 2026

alam00000 / bentopdf

The Privacy First PDF Toolkit

JavaScript 14,434 1,197 Updated Jul 27, 2026

Ircama / epson_print_conf

Epson Printer Configuration tool and waste ink counter resetter

Python 601 117 Updated Dec 30, 2025

JanuszBedkowski / mandeye_controller

C++ 114 15 Updated Jul 25, 2026

VAST-AI-Research / MIDI-3D

[CVPR 2025] MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation

Python 937 71 Updated Jun 12, 2025

Open3DVLab / GigaGS

[AAAI 2025] GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction

151 3 Updated Sep 11, 2024

theialab / 3dgs-flats

3D Gaussian Flats: Hybrid 2D/3D Photometric Scene Reconstruction

Python 65 1 Updated Nov 26, 2025

QianyiWu / GSRec

🔄 [ECCV‘24] Pytorch implementation of 'Surface Reconstruction from 3D Gaussian Splatting via Local Structural Hints'

Python 124 5 Updated Jan 19, 2026

doubletwisted / ComfyUI-Deadline-Plugin

ComfyUI plugin for submitting workflows to Thinkbox Deadline for distributed rendering

Python 34 4 Updated Jun 25, 2026

Billionmail / BillionMail

BillionMail gives you open-source MailServer, NewsLetter, Email Marketing — fully self-hosted, dev-friendly, and free from monthly fees. Join the discord: https://discord.gg/asfXzBUhZr

Go 15,370 1,665 Updated Jun 11, 2026

zhangganlin / vista-slam

[3DV 2026] ViSTA-SLAM: Visual SLAM with Symmetric Two-view Association

Python 269 13 Updated Apr 5, 2026

vibevoice-community / VibeVoice

VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)

natowi

Organizations

Lists (7)

CameraCalibration

🔮 Future ideas

Meshroom

ML

NERF

RTI

sfm

Starred repositories

ocr-android

keypoint-detection

speech-enhancement

speech-synthesis

voice-synthesis

feature-detection

camera-model

noise-cancellation

structured-light

texture-mapping