Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 17,257 1,564 Updated Sep 5, 2024

D-X-Y / AutoDL-Projects

Automated deep learning algorithms implemented in PyTorch.

Python 1,582 285 Updated Apr 24, 2022

jiaaro / pydub

Manipulate audio with a simple and easy high level interface

Python 9,680 1,124 Updated Jul 26, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,764 12,065 Updated Dec 19, 2025

myshell-ai / MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Python 7,068 1,004 Updated Dec 24, 2024

fishaudio / fish-speech

SOTA Open Source TTS

Python 24,355 2,000 Updated Dec 1, 2025

openai / codex

Lightweight coding agent that runs in your terminal

Rust 54,295 6,875 Updated Dec 19, 2025

JiehangXie / PaddleBoBo

基于飞桨开发的虚拟主播

Python 1,071 304 Updated Mar 12, 2023

jerryuhoo / VTuberTalk

Python 384 53 Updated Sep 30, 2022

navinfoeurope / anonymizer

Detection and blurring of human faces and license plates in images.

Jupyter Notebook 11 2 Updated Jan 14, 2025

navinfoeurope / segmentation-152-classes

Semantic Segmentation Model 152 classes is an AWS marketplace model package on 152 class segmentation for autonomous driving use-cases

Jupyter Notebook 1 Updated May 20, 2021

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 12,445 1,946 Updated Oct 20, 2025

nari-labs / dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,977 1,650 Updated Nov 19, 2025

resemble-ai / chatterbox

SoTA open-source TTS

Python 16,369 2,235 Updated Dec 15, 2025

nazdridoy / kokoro-tts

A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats including EPUB books and PDF documents.

Python 985 117 Updated Dec 15, 2025

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,925 5,855 Updated Aug 16, 2024

mozilla / TTS

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Jupyter Notebook 10,086 1,326 Updated Nov 9, 2023

index-tts / index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 16,835 2,017 Updated Dec 2, 2025

understand-ai / anonymizer

**ARCHIVED** An anonymizer to obfuscate faces and license plates.

Python 273 99 Updated Apr 11, 2022

sotirismos / anonymization-pipeline

Anonymization pipeline (faces, license plates detection & blurring) of video frames utilizing various deep learning models, part of GRUBLES project

Python 8 2 Updated Nov 7, 2022

varungupta31 / dashcam_anonymizer

Code to Blur Human Faces and Vehicle License Plates in Video and Images using a SoTA Object Detection model YOLOv8

Python 73 16 Updated Sep 18, 2025

teeeps / traccar-vps-guide

How to set up a traccar server on an Amazon Lightsail VPS

18 7 Updated Feb 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NickMa mayujie

Block or report mayujie

Stars

tfaehse / DashcamCleaner

lhyfst / awesome-autonomous-driving-datasets

CSAILVision / places365

hongleizhang / RSPapers

ultralytics / ultralytics

szad670401 / HyperLPR

ibaiGorordo / ONNX-YOLOv6-Object-Detection

kingjosephm / vehicle_make_model_dataset

IDEA-Research / Grounded-Segment-Anything