- Warsaw
-
11:19
(UTC +01:00) - https://www.linkedin.com/in/yujie-ma-338434134/
Stars
Censor identifiable information in videos, in particular dashcam recordings in Germany.
an awesome list of autonomous driving datasets
The Places365-CNNs for Scene Classification
RSTutorials: A Curated List of Must-read Papers on Recommender System.
High Performance Chinese License Plate Recognition Framework.
Python scripts performing object detection using the YOLOv6 model in ONNX.
Scrapes Google to create a ~700k sample of US passenger vehicle images with 574 distinct make-models
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Automated deep learning algorithms implemented in PyTorch.
Manipulate audio with a simple and easy high level interface
A high-throughput and memory-efficient inference and serving engine for LLMs
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Lightweight coding agent that runs in your terminal
Detection and blurring of human faces and license plates in images.
Semantic Segmentation Model 152 classes is an AWS marketplace model package on 152 class segmentation for autonomous driving use-cases
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
A TTS model capable of generating ultra-realistic dialogue in one pass.
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats including EPUB books and PDF documents.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
**ARCHIVED** An anonymizer to obfuscate faces and license plates.
Anonymization pipeline (faces, license plates detection & blurring) of video frames utilizing various deep learning models, part of GRUBLES project
Code to Blur Human Faces and Vehicle License Plates in Video and Images using a SoTA Object Detection model YOLOv8
How to set up a traccar server on an Amazon Lightsail VPS