Skip to content
View natowi's full-sized avatar
:octocat:
I may be slow to respond.
:octocat:
I may be slow to respond.

Organizations

@alicevision

Block or report natowi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

This is a ComfyUI custom node implementation of 'PersonaLive: Expressive Portrait Image Animation for Live Streaming'.

Python 91 7 Updated Jan 25, 2026

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,318 279 Updated Jan 5, 2026

We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while achieving up to 6$\times$ acceleration in inference speed.

Python 443 31 Updated Jan 10, 2026
Python 10,127 655 Updated Feb 9, 2026

Open-Source Frontier Voice AI

Python 23,307 2,559 Updated Feb 7, 2026

SoTA open-source TTS

Python 22,710 2,979 Updated Feb 3, 2026

Codes for automatic point-cloud-to-BIM conversion

Python 78 21 Updated Feb 14, 2026

Cross-platform E57 file viewer to list and view stored point clouds, images and metadata.

C++ 17 1 Updated Oct 29, 2025

Xst Reader is an open source viewer for Microsoft Outlook’s .ost and .pst files, written entirely in C#. To download an executable of the current version, go to the releases tab.

C# 645 114 Updated Sep 11, 2023

ComfyUI wrapper for sam-3d-body

Python 269 24 Updated Feb 17, 2026

[AAAI'24] NeuSurf: On-Surface Priors for Neural Surface Reconstruction from Sparse Input Views

Python 86 2 Updated Jan 6, 2025

TTS model capable of streaming conversational audio in realtime.

Python 1,067 89 Updated Nov 29, 2025

HyMPS will be a platform-indipendent software suite for advanced audio/video contents production.

287 18 Updated Feb 17, 2026

🐬DeepChat - A smart assistant that connects powerful AI to your personal world

TypeScript 5,505 632 Updated Feb 17, 2026

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 10,370 1,166 Updated Feb 16, 2026

A free, open source, and extensible speech-to-text application that works completely offline.

Rust 15,310 1,056 Updated Feb 17, 2026

Local Lens is a privacy-first, AI-powered photo organizer for your PC. Sort and group photos by faces, dates, and locations—all locally, with no cloud upload. Enjoy a modern, intuitive UI and keep …

CSS 96 3 Updated Dec 31, 2025

A Privacy First PDF Toolkit

JavaScript 11,472 894 Updated Feb 17, 2026

Epson Printer Configuration tool and waste ink counter resetter

Python 472 80 Updated Dec 30, 2025

[CVPR 2025] MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation

Python 880 66 Updated Jun 12, 2025

[AAAI 2025] GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction

148 3 Updated Sep 11, 2024

3D Gaussian Flats: Hybrid 2D/3D Photometric Scene Reconstruction

Python 57 Updated Nov 26, 2025

🔄 [ECCV‘24] Pytorch implementation of 'Surface Reconstruction from 3D Gaussian Splatting via Local Structural Hints'

Python 119 5 Updated Jan 19, 2026

ComfyUI plugin for submitting workflows to Thinkbox Deadline for distributed rendering

Python 25 3 Updated Jan 14, 2026

BillionMail gives you open-source MailServer, NewsLetter, Email Marketing — fully self-hosted, dev-friendly, and free from monthly fees. Join the discord: https://discord.gg/asfXzBUhZr

Go 13,524 1,376 Updated Dec 30, 2025

[3DV 2026] ViSTA-SLAM: Visual SLAM with Symmetric Two-view Association

Python 225 9 Updated Feb 7, 2026

VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)

Python 972 365 Updated Jan 23, 2026

A collection of MCP servers.

80,986 7,279 Updated Feb 17, 2026

Model Context Protocol (MCP) that allows LLMs to use QGIS Desktop

Python 785 116 Updated Oct 1, 2025
Next