Skip to content
View natowi's full-sized avatar
:octocat:
I may be slow to respond.
:octocat:
I may be slow to respond.

Organizations

@alicevision

Block or report natowi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

This is a ComfyUI custom node implementation of 'PersonaLive: Expressive Portrait Image Animation for Live Streaming'.

Python 116 14 Updated Jan 25, 2026

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,533 319 Updated May 26, 2026

[CVPR2026]We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while achieving up to 6$\times$ acceleration in inference…

Python 476 39 Updated Feb 21, 2026
Python 11,589 790 Updated Feb 9, 2026

Open-Source Frontier Voice AI

Python 49,522 5,522 Updated May 6, 2026

SoTA open-source TTS

Python 25,156 3,335 Updated Jun 10, 2026

Codes for automatic point-cloud-to-BIM conversion

Python 112 25 Updated Apr 7, 2026

Cross-platform E57 file viewer to list and view stored point clouds, images and metadata.

C++ 21 2 Updated May 9, 2026

Xst Reader is an open source viewer for Microsoft Outlook’s .ost and .pst files, written entirely in C#. To download an executable of the current version, go to the releases tab.

C# 675 133 Updated Sep 11, 2023

ComfyUI wrapper for sam-3d-body

Python 315 30 Updated Jun 6, 2026

[AAAI'24] NeuSurf: On-Surface Priors for Neural Surface Reconstruction from Sparse Input Views

Python 86 2 Updated Jan 6, 2025

TTS model capable of streaming conversational audio in realtime.

Python 1,147 98 Updated Nov 29, 2025

HyMPS will be a platform-indipendent software suite for advanced audio/video contents production.

317 18 Updated Jun 6, 2026

🐬DeepChat - A smart assistant that connects powerful AI to your personal world

TypeScript 6,032 683 Updated Jun 20, 2026

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 13,091 1,500 Updated Jun 18, 2026

A free, open source, and extensible speech-to-text application that works completely offline.

Rust 24,396 2,056 Updated Jun 20, 2026

Local Lens is a privacy-first, AI-powered photo organizer for your PC. Sort and group photos by faces, dates, and locations—all locally, with no cloud upload. Enjoy a modern, intuitive UI and keep …

Python 126 9 Updated May 23, 2026

The Privacy First PDF Toolkit

JavaScript 13,782 1,129 Updated Jun 20, 2026

Epson Printer Configuration tool and waste ink counter resetter

Python 578 108 Updated Dec 30, 2025

[CVPR 2025] MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation

Python 926 70 Updated Jun 12, 2025

[AAAI 2025] GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction

151 3 Updated Sep 11, 2024

3D Gaussian Flats: Hybrid 2D/3D Photometric Scene Reconstruction

Python 66 1 Updated Nov 26, 2025

🔄 [ECCV‘24] Pytorch implementation of 'Surface Reconstruction from 3D Gaussian Splatting via Local Structural Hints'

Python 123 5 Updated Jan 19, 2026

ComfyUI plugin for submitting workflows to Thinkbox Deadline for distributed rendering

Python 30 4 Updated Jun 15, 2026

BillionMail gives you open-source MailServer, NewsLetter, Email Marketing — fully self-hosted, dev-friendly, and free from monthly fees. Join the discord: https://discord.gg/asfXzBUhZr

Go 15,184 1,627 Updated Jun 11, 2026

[3DV 2026] ViSTA-SLAM: Visual SLAM with Symmetric Two-view Association

Python 258 13 Updated Apr 5, 2026

VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)

Python 1,127 429 Updated Jun 12, 2026

A collection of MCP servers.

89,543 11,886 Updated Jun 19, 2026

Model Context Protocol (MCP) that allows LLMs to use QGIS Desktop

Python 987 163 Updated Oct 1, 2025
Next