ZhikangNiu

🎯

focus

Zhikang Niu-SII ZhikangNiu

🎯

focus

Ph.D. Student, SJTU @X-LANCE & SII @sii-research | Intern @MiniMax-AI @InternLM (Shanghai AILab) @microsoft (MSRA)

395 followers · 655 following

Shanghai Jiao Tong University & Shanghai Innovation Institute
Shanghai
05:57 (UTC +08:00)
https://zhikangniu.github.io/

Achievements

x2 x3 x2

Achievements

x2 x3 x2

arxiv_daily Public

Python 17 2 Apache License 2.0 Updated Apr 30, 2026
vllm-omni Public
Forked from vllm-project/vllm-omni

A framework for efficient model inference with omni-modality models

Python Apache License 2.0 Updated Apr 25, 2026
F5-TTS Public
Forked from SWivid/F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 2 MIT License Updated Apr 23, 2026
ZhikangNiu Public

2 1 Updated Apr 18, 2026
terminal-setup Public

Shell 2 Updated Mar 31, 2026
CLIProxyAPI Public
Forked from router-for-me/CLIProxyAPI

Wrap Gemini CLI, Antigravity, ChatGPT Codex, Claude Code, Qwen Code, iFlow as an OpenAI/Gemini/Claude/Codex compatible API service, allowing you to enjoy the free Gemini 2.5 Pro, GPT 5, Claude, Qwe…

Go MIT License Updated Mar 25, 2026
sub2api Public
Forked from Wei-Shaw/sub2api

Sub2API-CRS2 一站式开源中转服务，让 Claude、Openai 、Gemini、Antigravity订阅统一接入，支持拼车共享，更高效分摊成本，原生工具无缝使用。

Go MIT License Updated Mar 24, 2026
codex-manager Public
Forked from cnlimiter/codex-manager

Python MIT License Updated Mar 24, 2026
Curator Public
Forked from NVIDIA-NeMo/Curator

Scalable data pre processing and curation toolkit for LLMs

Python Apache License 2.0 Updated Mar 19, 2026
Qwen3-TTS Public
Forked from QwenLM/Qwen3-TTS

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 1 Apache License 2.0 Updated Mar 16, 2026
stable-audio-tools Public
Forked from Stability-AI/stable-audio-tools

Generative models for conditional audio generation

Python 1 MIT License Updated Mar 9, 2026
diffusers Public
Forked from huggingface/diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python Apache License 2.0 Updated Mar 7, 2026
VibeVoice Public
Forked from microsoft/VibeVoice

Open-Source Frontier Voice AI

Python MIT License Updated Mar 6, 2026
StarBench Public
Forked from InternLM/StarBench

Python MIT License Updated Mar 3, 2026
nanochat Public
Forked from karpathy/nanochat

The best ChatGPT that $100 can buy.

Python 1 MIT License Updated Mar 3, 2026
ZhikangNiu.github.io Public
Forked from yyysjz1997/yyysjz1997.github.io

HTML Updated Feb 12, 2026
Idea2Paper Public
Forked from AgentAlphaAGI/Idea2Paper

Idea2Paper Offical Demo

Python MIT License Updated Feb 1, 2026
Semantic-VAE Public

Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"

Python 112 8 Updated Dec 20, 2025
SII_Thesis_Template Public
Forked from HYLZ-2019/SII_Thesis_Template

TeX Updated Dec 18, 2025
flux2 Public
Forked from black-forest-labs/flux2

Official inference repo for FLUX.2 models

Python Apache License 2.0 Updated Nov 25, 2025
DC-Speech-VAE Public
Forked from KdaiP/DC-Speech-VAE

5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs

Python Apache License 2.0 Updated Nov 19, 2025
CosyVoice Public
Forked from FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python Apache License 2.0 Updated Nov 18, 2025
calm Public
Forked from shaochenze/calm

Official implementation of "Continuous Autoregressive Language Models"

Python MIT License Updated Nov 10, 2025
SAC Public
Forked from Soul-AILab/SAC

Trainging, inference, and testing of the SAC speech codec model.

Python 1 Apache License 2.0 Updated Nov 6, 2025
Hybrid-SAC Public

Updated Nov 6, 2025
Ming-UniAudio Public
Forked from inclusionAI/Ming-UniAudio

Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation

Python MIT License Updated Oct 28, 2025
metaquery Public
Forked from facebookresearch/metaquery

Official Implementation of Paper Transfer between Modalities with MetaQueries

Python Other Updated Oct 12, 2025
NeMo-speech-data-processor Public
Forked from NVIDIA/NeMo-speech-data-processor

A toolkit for processing speech data and creating speech datasets

Python 4 Apache License 2.0 Updated Sep 29, 2025
flux Public
Forked from black-forest-labs/flux

Official inference repo for FLUX.1 models

Python Apache License 2.0 Updated Jul 31, 2025
SongBloom Public
Forked from tencent-ailab/SongBloom

Python Updated Jun 30, 2025

Zhikang Niu-SII ZhikangNiu

Achievements

Achievements

arxiv_daily Public

Uh oh!

vllm-omni Public

Uh oh!

F5-TTS Public

Uh oh!

ZhikangNiu Public

Uh oh!

terminal-setup Public

Uh oh!

CLIProxyAPI Public

Uh oh!

sub2api Public

Uh oh!

codex-manager Public

Uh oh!

Curator Public

Uh oh!

Qwen3-TTS Public

Uh oh!

stable-audio-tools Public

Uh oh!

diffusers Public

Uh oh!

VibeVoice Public

Uh oh!

StarBench Public

Uh oh!

nanochat Public

Uh oh!

ZhikangNiu.github.io Public

Uh oh!

Idea2Paper Public

Uh oh!

Semantic-VAE Public

Uh oh!

SII_Thesis_Template Public

Uh oh!

flux2 Public

Uh oh!

DC-Speech-VAE Public

Uh oh!

CosyVoice Public

Uh oh!

calm Public

Uh oh!

SAC Public

Uh oh!

Hybrid-SAC Public

Uh oh!

Ming-UniAudio Public

Uh oh!

metaquery Public

Uh oh!

NeMo-speech-data-processor Public

Uh oh!

flux Public

Uh oh!

SongBloom Public

Uh oh!