natowi

I may be slow to respond.

natowi

I may be slow to respond.

158 followers · 11 following

14:02 (UTC +02:00)

Achievements

x2 x3 x2

Achievements

x2 x3 x2

Organizations

Lists (7)

Sort

Starred repositories

okdalto / ComfyUI-PersonaLive

This is a ComfyUI custom node implementation of 'PersonaLive: Expressive Portrait Image Animation for Live Streaming'.

Python 116 14 Updated Jan 25, 2026

facebookresearch / sam-audio

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,533 319 Updated May 26, 2026

Francis-Rings / FlashPortrait

[CVPR2026]We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while achieving up to 6$\times$ acceleration in inference…

Python 476 39 Updated Feb 21, 2026

Tongyi-MAI / Z-Image

Python 11,589 790 Updated Feb 9, 2026

microsoft / VibeVoice

Open-Source Frontier Voice AI

Python 49,522 5,522 Updated May 6, 2026

resemble-ai / chatterbox

SoTA open-source TTS

Python 25,156 3,335 Updated Jun 10, 2026

VaclavNezerka / Cloud2BIM

Codes for automatic point-cloud-to-BIM conversion

Python 112 25 Updated Apr 7, 2026

sisakat / e57inspector

Cross-platform E57 file viewer to list and view stored point clouds, images and metadata.

C++ 21 2 Updated May 9, 2026

Dijji / XstReader

Xst Reader is an open source viewer for Microsoft Outlook’s .ost and .pst files, written entirely in C#. To download an executable of the current version, go to the releases tab.

C# 675 133 Updated Sep 11, 2023

PozzettiAndrea / ComfyUI-SAM3DBody

ComfyUI wrapper for sam-3d-body

Python 315 30 Updated Jun 6, 2026

yulunwu0108 / NeuSurf

[AAAI'24] NeuSurf: On-Surface Priors for Neural Surface Reconstruction from Sparse Input Views

Python 86 2 Updated Jan 6, 2025

nari-labs / dia2

TTS model capable of streaming conversational audio in realtime.

Python 1,147 98 Updated Nov 29, 2025

FORARTfe / HyMPS

HyMPS will be a platform-indipendent software suite for advanced audio/video contents production.

317 18 Updated Jun 6, 2026

ThinkInAIXYZ / deepchat

🐬DeepChat - A smart assistant that connects powerful AI to your personal world

TypeScript 6,032 683 Updated Jun 20, 2026

k2-fsa / sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 13,091 1,500 Updated Jun 18, 2026

cjpais / Handy

A free, open source, and extensible speech-to-text application that works completely offline.

Rust 24,396 2,056 Updated Jun 20, 2026

ashesbloom / LocalLens

Local Lens is a privacy-first, AI-powered photo organizer for your PC. Sort and group photos by faces, dates, and locations—all locally, with no cloud upload. Enjoy a modern, intuitive UI and keep …

Python 126 9 Updated May 23, 2026

alam00000 / bentopdf

The Privacy First PDF Toolkit

JavaScript 13,782 1,129 Updated Jun 20, 2026

Ircama / epson_print_conf

Epson Printer Configuration tool and waste ink counter resetter

Python 578 108 Updated Dec 30, 2025

JanuszBedkowski / mandeye_controller

C++ 111 15 Updated Jun 15, 2026

VAST-AI-Research / MIDI-3D

[CVPR 2025] MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation

Python 926 70 Updated Jun 12, 2025

Open3DVLab / GigaGS

[AAAI 2025] GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction

151 3 Updated Sep 11, 2024

theialab / 3dgs-flats

3D Gaussian Flats: Hybrid 2D/3D Photometric Scene Reconstruction

Python 66 1 Updated Nov 26, 2025

QianyiWu / GSRec

🔄 [ECCV‘24] Pytorch implementation of 'Surface Reconstruction from 3D Gaussian Splatting via Local Structural Hints'

Python 123 5 Updated Jan 19, 2026

doubletwisted / ComfyUI-Deadline-Plugin

ComfyUI plugin for submitting workflows to Thinkbox Deadline for distributed rendering

Python 30 4 Updated Jun 15, 2026

Billionmail / BillionMail

BillionMail gives you open-source MailServer, NewsLetter, Email Marketing — fully self-hosted, dev-friendly, and free from monthly fees. Join the discord: https://discord.gg/asfXzBUhZr

Go 15,184 1,627 Updated Jun 11, 2026

zhangganlin / vista-slam

[3DV 2026] ViSTA-SLAM: Visual SLAM with Symmetric Two-view Association

Python 258 13 Updated Apr 5, 2026

vibevoice-community / VibeVoice

VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)

natowi

Organizations

Lists (7)

CameraCalibration

🔮 Future ideas

Meshroom

ML

NERF

RTI

sfm

Starred repositories

ocr-android

keypoint-detection

speech-enhancement

speech-synthesis

voice-synthesis

feature-detection

camera-model

noise-cancellation

structured-light

texture-mapping