Durgesh92

🎯

Focusing

Durgesh Durgesh92

🎯

Focusing

26 followers · 40 following

Infolabs Global
Dubai

Stars

monkira99 / edge-lipsync-model

Python 1 Updated Jun 11, 2026

HKUDS / ViMax

"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"

Python 10,404 1,517 Updated Jun 13, 2026

usemoss / moss

The retrieval layer for production AI systems. Lightning-fast (<10ms) search without vector databases. Built for browser, edge, on-device, and cloud.

Python 423 50 Updated Jun 18, 2026

OfekSaar1234 / Signify-RealTime-SignLanguage-Avatar

A standalone desktop/smartTV overlay that translates system audio into 3D Sign Language animation in real-time.

TypeScript 3 Updated May 26, 2026

sentiuminc / holler

Open-source American english TTS model. 6 voices and a high performance inference library for Apple Silicon.

Python 17 2 Updated May 20, 2026

TencentARC / Pixal3D

[SIGGRAPH 2026] Pixal3D: Pixel-Aligned 3D Generation from Images

Python 1,784 165 Updated May 24, 2026

jasonkneen / tiny-world-builder

tiny-world-builder

JavaScript 1,075 143 Updated Jun 18, 2026

microsoft / TRELLIS

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

Python 12,944 1,250 Updated Nov 5, 2025

Alibaba-Quark / LiveAvatar

[ECCV 2026] Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"

Python 2,169 242 Updated Jun 18, 2026

Lynpoint / CyberVerse

Self hosted, real-time digital human agent platform. Build voice-first AI agents with WebRTC, persona memory, tools, RAG, and optional digital-human video.

Python 1,230 171 Updated Jun 17, 2026

flashrt-project / FlashRT

FlashRT is a high-performance realtime inference engine for small-batch, latency-sensitive AI workloads. The flagship integration is production VLA control for Pi0, Pi0.5, GROOT N1.6, and Pi0-FAST.…

C++ 357 41 Updated Jun 15, 2026

SakanaAI / kame

Python 92 14 Updated May 14, 2026

Magkino / vocoloco_tts

Browser-based text-to-speech powered by OmniVoice. Runs entirely locally via WebGPU and WebAssembly.

JavaScript 12 3 Updated Apr 12, 2026

suitenumerique / meet

Open source video conferencing app powered by LiveKit. Built with Django and React.

Python 2,106 241 Updated Jun 18, 2026

aloware / livekit-plugins-dtln

Self-hosted DTLN noise suppression plugin for LiveKit Agents — no cloud API, no per-minute fees

Python 42 10 Updated Apr 16, 2026

Scicom-AI-Enterprise-Organization / Multilingual-TTS

Building actual open source including dataset Multilingual TTS more than 150 languages with Voice Cloning.

Jupyter Notebook 55 4 Updated Apr 23, 2026

vllm-project / vllm-omni

A framework for efficient model inference with omni-modality models

Python 5,193 1,134 Updated Jun 18, 2026

CodeBySonu95 / VoxSherpa-TTS

🎙️ VoxSherpa TTS Offline Neural Text-to-Speech Engine for Android ⚡ Sherpa-ONNX powered 🔊 Natural voice synthesis 📱 Fully offline processing 🚀 No cloud • No limits

Java 132 22 Updated Jun 16, 2026

pnnbao97 / VieNeu-TTS

Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio quality • Chuyển văn bản thành giọng nói tiếng Việt • Text to speech tiếng Việt • TTS tiếng Việt

Python 1,875 558 Updated Jun 10, 2026

FujiwaraChoki / MoneyPrinterV2

Automate the process of making money online.

Python 30,970 3,346 Updated Jun 14, 2026

herimor / voxtream

VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control

Python 241 30 Updated May 30, 2026

FelixChan9527 / LPIPS-AttnWav2Lip

This repository contains the official code for LPIPS-AttnWav2Lip. The paper has been accepted by the journal Speech Communication.

Python 13 4 Updated Jan 30, 2026

mkturkcan / DART

Detect Anything in Real Time: Real-time object detection using frontier object detection models.

Python 294 42 Updated Mar 26, 2026

ibm-granite / granite-speech-models

Jupyter Notebook 45 6 Updated Apr 28, 2026

reka-ai / vllm-reka

vLLM plugin for Reka models

Python 9 Updated Jun 15, 2026

HumeAI / tada

Open Source Speech Language Model

Jupyter Notebook 995 107 Updated May 11, 2026

paperclipai / paperclip

The open-source app everyone uses to manage agents at work

TypeScript 70,874 13,187 Updated Jun 18, 2026

FireRedTeam / FireRedVAD

A SOTA Industrial-Grade Voice Activity Detection & Audio Event Detection, supporting 100+ languages, outperforming Silero-VAD, TEN-VAD, FunASR-VAD and WebRTC-VAD

Python 426 28 Updated May 6, 2026

myned-ai / avatar-chat-server

Real-time voice-to-avatar interaction server combining OpenAI Realtime API for conversational AI with an Audio to Expression model for synchronized avatar facial animation.

Python 9 Updated May 18, 2026

D4Vinci / Scrapling

🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!

Python 64,798 6,366 Updated Jun 17, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly