Skip to content
View 6gsn's full-sized avatar

Highlights

  • Pro

Block or report 6gsn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.

Python 126 13 Updated Apr 10, 2026

G2P

Python 439 90 Updated Aug 11, 2025

[ICLR 2026] PixNerd: Pixel Neural Field Diffusion

Python 175 7 Updated Dec 10, 2025

JavaScript animation engine

JavaScript 67,075 4,488 Updated Feb 13, 2026

Project page for "MG-Gen: Single Image to Motion Graphics Generation with Layer Decomposition"

Python 14 3 Updated Apr 18, 2025

Mapping Mediapipe's 52 blendshapes to FLAME's expression coefficients and poses.

Jupyter Notebook 57 5 Updated Sep 26, 2025

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various r…

Python 313 15 Updated Mar 12, 2025

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 3,288 201 Updated Oct 31, 2024

[EMNLP2024 Demo], [ICASSP 2025], [ICASSP 2026] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.

Python 245 19 Updated Mar 26, 2026

[ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation

Python 420 18 Updated May 30, 2025

[NeurIPS 2020] Official code for the paper "DeepSVG: A Hierarchical Generative Network for Vector Graphics Animation". Includes a PyTorch library for deep learning with SVG data.

Jupyter Notebook 1,140 115 Updated Aug 26, 2024

[WIP] Scripts for fine-tuning Whisper

Python 222 30 Updated May 29, 2023

An integrated Japanese analyzer based on foundation models

Python 142 7 Updated Apr 6, 2026

Library to build speech synthesis systems designed for easy and fast prototyping.

Python 399 71 Updated Jun 29, 2024

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

Python 268 57 Updated Jan 13, 2025

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Jupyter Notebook 14,537 1,698 Updated Feb 22, 2026

A Python toolkit for sound source separation.

Python 167 15 Updated May 6, 2025

This is a repository of YACIS corpus and information of how to obtain the whole corpus as well as its annotations.

6 Updated Jan 18, 2022
174 10 Updated Sep 11, 2025

Neural network-based singing voice synthesis library for research

Python 734 83 Updated Oct 9, 2023

Robust Speech Recognition via Large-Scale Weak Supervision

Python 97,633 12,026 Updated Mar 27, 2026

A fork of open_jtalk

C++ 71 40 Updated Mar 31, 2025

HTS-style full-context labels for JSUT v1.1

51 2 Updated Apr 16, 2021

context labels and pronunciation data for JSUT corpus

77 13 Updated Sep 2, 2021

Bring projects, wikis, and teams together with AI. AppFlowy is the AI collaborative workspace where you achieve more without losing control of your data. The leading open source Notion alternative.

Dart 69,595 5,099 Updated Apr 8, 2026

Google AI 2018 BERT pytorch implementation

Python 6,527 1,321 Updated Sep 15, 2023

Face recognition using Tensorflow

Python 14,321 4,791 Updated Jul 24, 2023

Siamese and triplet networks with online pair/triplet mining in PyTorch

Python 3,169 633 Updated Apr 29, 2023

Official PyTorch implementation of "Synthesis of Screentone Patterns of Manga Characters"

Python 4 Updated Jul 30, 2023

This is a deep learning project on Manga109 dataset by using Yolov3

Python 4 2 Updated Jul 22, 2020
Next