lifeiteng

Feiteng lifeiteng

Full stack Algorithm Engineer

536 followers · 131 following

Achievements

x3 x2

Achievements

x3 x2

k2 Public
Forked from k2-fsa/k2

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Cuda Apache License 2.0 Updated Dec 13, 2025
OmniSenseVoice Public

Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯

Python 880 41 Apache License 2.0 Updated Dec 10, 2025
VibeVoice Public
Forked from microsoft/VibeVoice

Open-Source Frontier Voice AI

Python MIT License Updated Dec 9, 2025
DeepPhonemizer Public
Forked from spring-media/DeepPhonemizer

Grapheme to phoneme conversion with deep learning.

Python 1 MIT License Updated Dec 5, 2025
sebbs Public
Forked from merlresearch/sebbs

Prediction of sound event bounding boxes (SEBBs)

Python 1 GNU Affero General Public License v3.0 Updated Dec 4, 2025
NeMo Public
Forked from NVIDIA-NeMo/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 2 Apache License 2.0 Updated Dec 1, 2025
torch-audiomentations Public
Forked from iver56/torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Python 1 MIT License Updated Nov 24, 2025
pysubs2 Public
Forked from tkarabela/pysubs2

A Python library for editing subtitle files

Python MIT License Updated Nov 16, 2025
am-cf-tunnel Public template
Forked from amclubs/am-cf-tunnel

这是一个基于 Cloudflare Workers 和 Pages平台的脚本,通过EDtunnel修改，使用该脚本可以自动生成VLESS、Trojan免费节点,并配置信息使用在线配置转换到 Clash、 Singbox 、Quantumult X等工具中。

JavaScript Apache License 2.0 Updated Oct 20, 2025
Wan2.2 Public
Forked from Wan-Video/Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 1 Apache License 2.0 Updated Oct 15, 2025
DiffSynth-Studio Public
Forked from modelscope/DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python Apache License 2.0 Updated Sep 19, 2025
vall-e Public

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

text-to-speech tts valle in-context-learning large-language-models chatgpt vall-e

Python 2,194 334 Apache License 2.0 Updated Sep 10, 2025
wenyan-mcp Public
Forked from caol64/wenyan-mcp

文颜 MCP Server 可以让 AI 自动将 Markdown 文章排版后发布至微信公众号。

CSS Updated Aug 7, 2025
DiscoSeqSampler Public

Distributed Coordinated Sequence Sampler

Python 1 Apache License 2.0 Updated Aug 3, 2025
flash-attention Public
Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python BSD 3-Clause "New" or "Revised" License Updated Jul 24, 2025
NotebookTTS Public

Text-To-Speech for NotebookLM

tts gpt notebookllama notebookl

35 Apache License 2.0 Updated Jul 20, 2025
ZipVoice Public
Forked from k2-fsa/ZipVoice

Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching

Python Apache License 2.0 Updated Jul 18, 2025
Magic-TryOn Public
Forked from vivoCameraResearch/Magic-TryOn

MagicTryOn is a video virtual try-on framework based on a large-scale video diffusion Transformer.

Python Other Updated Jun 16, 2025
HunyuanVideo-Avatar Public
Forked from Tencent-Hunyuan/HunyuanVideo-Avatar

Python Other Updated Jun 9, 2025
descript-audio-codec Public
Forked from descriptinc/descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 2 MIT License Updated May 26, 2025
lhotse Public
Forked from lhotse-speech/lhotse

Tools for handling speech data in machine learning projects.

Python 3 2 Apache License 2.0 Updated May 24, 2025
audio2py Public

Python 2 Updated May 24, 2025
sed_scores_eval Public
Forked from fgnt/sed_scores_eval

Python MIT License Updated May 14, 2025
Wan2.1 Public
Forked from Wan-Video/Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python Apache License 2.0 Updated May 9, 2025
HunyuanCustom Public
Forked from Tencent-Hunyuan/HunyuanCustom

HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

Python Other Updated May 8, 2025
Aligner-SUPERB Public

Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark

Python 34 4 Apache License 2.0 Updated May 7, 2025
demoPanel Public
Forked from leegical/demoPanel

Updated Apr 21, 2025
fairseq Public
Forked from facebookresearch/fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python MIT License Updated Mar 1, 2025
sed-hsmm Public
Forked from b-sigpro/sed-hsmm

AED

Python MIT License Updated Feb 10, 2025
audiossl Public
Forked from Audio-WestlakeU/audiossl

A library built for easier audio self-supervised training, downstream tasks evaluation

Python 4 Other Updated Feb 10, 2025

Feiteng lifeiteng

Achievements

Achievements

k2 Public

Uh oh!

OmniSenseVoice Public

Uh oh!

VibeVoice Public

Uh oh!

DeepPhonemizer Public

Uh oh!

sebbs Public

Uh oh!

NeMo Public

Uh oh!

torch-audiomentations Public

Uh oh!

pysubs2 Public

Uh oh!

am-cf-tunnel Public template

Uh oh!

Wan2.2 Public

Uh oh!

DiffSynth-Studio Public

Uh oh!

vall-e Public

Uh oh!

wenyan-mcp Public

Uh oh!

DiscoSeqSampler Public

Uh oh!

flash-attention Public

Uh oh!

NotebookTTS Public

Uh oh!

ZipVoice Public

Uh oh!

Magic-TryOn Public

Uh oh!

HunyuanVideo-Avatar Public

Uh oh!

descript-audio-codec Public

Uh oh!

lhotse Public

Uh oh!

audio2py Public

Uh oh!

sed_scores_eval Public

Uh oh!

Wan2.1 Public

Uh oh!

HunyuanCustom Public

Uh oh!

Aligner-SUPERB Public

Uh oh!

demoPanel Public

Uh oh!

fairseq Public

Uh oh!

sed-hsmm Public

Uh oh!

audiossl Public

Uh oh!