-
PREP
- Ha Noi, Viet Nam
-
17:59
(UTC +07:00) - in/dangvansam
- https://fb.com/sam.rngd
-
-
Turn detector plugin for LiveKit Agents with Namo model
conversational-agents conversational-ai voice-activity-detection videosdk livekit turn-detection turn-detectorPython Apache License 2.0 UpdatedDec 15, 2025 -
-
-
livekit-plugins-tenvad Public
LiveKit plugin for TEN VAD: low-latency voice activity detection for real-time streaming, integrated with livekit-agents
-
-
R2R Public
Forked from SciPhi-AI/R2RSoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
Python MIT License UpdatedNov 7, 2025 -
-
Turn detector plugin for LiveKit Agents with external model (Triton, OpenAI)
-
-
dots.ocr Public
Forked from rednote-hilab/dots.ocrMultilingual Document Layout Parsing in a Single Vision-Language Model
-
agents Public
Forked from livekit/agentsA powerful framework for building realtime voice AI agents 🤖🎙️📹
-
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…
C++ Apache License 2.0 UpdatedAug 28, 2025 -
-
wenet Public
Forked from wenet-e2e/wenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Python Apache License 2.0 UpdatedJul 10, 2025 -
Deep-Live-Cam Public
Forked from hacksider/Deep-Live-Camreal time face swap and one-click video deepfake with only a single image
-
screenshot-to-code Public
Forked from abi/screenshot-to-codeDrop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
-
whisperY Public
Forked from m-bain/whisperXWhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
-
-
Orpheus-TTS Public
Forked from canopyai/Orpheus-TTSTowards Human-Sounding Speech
Python Apache License 2.0 UpdatedMay 6, 2025 -
-
chatbot Public
Forked from langgenius/difyDify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
TypeScript UpdatedApr 26, 2025 -
Spark-TTS Public
Forked from SparkAudio/Spark-TTSSpark-TTS Inference Code
Python Apache License 2.0 UpdatedApr 9, 2025 -
translate Public
Forked from sign/translateEffortless Real-Time Sign Language Translation
-
RWKVTTS Public
Forked from yynil/RWKVTTSThis project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).
Python Apache License 2.0 UpdatedMar 23, 2025 -
F5-TTS Public
Forked from SWivid/F5-TTSOfficial code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Python MIT License UpdatedFeb 9, 2025 -
fish-speech Public
Forked from fishaudio/fish-speechBrand new TTS solution
Python UpdatedJan 17, 2025 -
ultravox Public
Forked from fixie-ai/ultravoxA fast multimodal LLM for real-time voice
Python MIT License UpdatedJan 17, 2025 -
ichigo Public
Forked from janhq/ichigoLocal realtime voice AI
Python Apache License 2.0 UpdatedJan 17, 2025 -
SenseVoice Public
Forked from FunAudioLLM/SenseVoiceMultilingual Voice Understanding Model
Python Other UpdatedJan 8, 2025