smksyj

Follow

smksyj

Follow

19 followers · 33 following

Achievements

Achievements

Lists (30)

Sort

agent

15 repositories

alg

architecture

23 repositories

audio

113 repositories

backend

10 repositories

conditioning

diffusion

101 repositories

disentangle

flow

23 repositories

frontend

19 repositories

infra

15 repositories

language

22 repositories

llm

59 repositories

lora

manifold

ml_materials

mlops

MoE

monitoring_and_operation

music

optimization

personalization

quantization

18 repositories

reinforcement_learning

13 repositories

Scala

70 repositories

small_model

style_transfer

video

12 repositories

vision

26 repositories

web

Starred repositories

Tencent-Hunyuan / UniRL

UniRL is a Framework for Unified Multimodal Model Reinforcement Learning

Python 672 42 Updated Jun 22, 2026

ghostdogpr / sage

Modern Scala 3 client for Redis and Valkey, native and multi-backend

Scala 29 1 Updated Jun 21, 2026

lifeiteng / naturalspeech3_facodec

FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3

Python 251 23 Updated Apr 20, 2024

xzf-thu / Audio-Interaction

Python 550 29 Updated Jun 4, 2026

litagin02 / Style-Bert-VITS2

Forked from fishaudio/Bert-VITS2

Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.

Python 1,305 208 Updated Dec 7, 2025

KoljaB / RealtimeSTT

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python 9,924 842 Updated Jun 12, 2026

softwaremill / ox

Safe direct-style streaming, concurrency and resiliency for Scala on the JVM

Scala 516 35 Updated Jun 22, 2026

VirtusLab / orca

Deterministic, AI-driven development flows.

Scala 116 9 Updated Jun 16, 2026

krafton-ai / Raon-Speech

Open-source speech AI models from KRAFTON, including Raon-Speech and Raon-SpeechChat for speech understanding, generation, and real-time full-duplex conversation.

Python 63 12 Updated Apr 7, 2026

Lakonik / LakonLab

Official implementation of AsymFlow, pi-Flow, GMFlow

Python 440 24 Updated Jun 13, 2026

stepfun-ai / Step-Audio2

Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.

Python 1,464 107 Updated Mar 16, 2026

xzf-thu / Mega-ASR

First foundation ASR built for the real world - 7 atomic acoustic conditions, 54 compound scenarios, 2.6M samples, and up to ~30% gains over SOTA where every other model falls apart. **You'll come …

Python 1,033 67 Updated Jun 2, 2026

FunAudioLLM / SenseVoice

Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.

C 8,634 785 Updated Jun 22, 2026

stepfun-ai / Step-Audio-R1

Python 678 50 Updated Apr 29, 2026

OpenBMB / MiniCPM-V

A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone

Python 25,672 2,008 Updated Jun 4, 2026

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 58,922 6,440 Updated Jun 20, 2026

siddharthvaddem / openscreen

Create stunning demos for free. Open-source, no subscriptions, no watermarks, and free for commercial use. An alternative to Screen Studio.

TypeScript 38,734 2,763 Updated Jun 17, 2026

warpdotdev / warp

Warp is an agentic development environment, born out of the terminal.

Rust 62,152 5,072 Updated Jun 22, 2026

k2-fsa / OmniVoice

High-Quality Voice Cloning TTS for 600+ Languages

Python 7,657 1,199 Updated Jun 11, 2026

oyvindberg / jatatui

A Java port of ratatui — build rich terminal UIs from Java

Java 256 13 Updated May 17, 2026

mattlianje / layoutz

Simple, beautiful CLI output

Scala 345 12 Updated Jun 19, 2026

nealchen2003 / LangFlow

The first continuous diffusion language model that rivals discrete counterparts on standard language modeling benchmarks like LM1B and OpenWebText.

Python 78 2 Updated Jun 14, 2026

anomalyco / opencode

The open source coding agent.

TypeScript 177,081 21,619 Updated Jun 22, 2026

kubuszok / kindlings

Hearth fire starter - incubator/dogfooding for Hearth-based macro libraries

Scala 59 6 Updated Jun 21, 2026

JuliusBrussee / caveman

🪨 why use many token when few token do trick — Claude Code skill that cuts 65% of tokens by talking like caveman

JavaScript 75,531 4,270 Updated Jun 12, 2026

czg1225 / DMax

DMax: Aggressive Parallel Decoding for dLLMs

Python 126 7 Updated May 25, 2026

philwalk / uni

NumPy inspired Linear Algebra Library

Scala 12 1 Updated Jun 20, 2026

ghostdogpr / proteus

Code-first Protobuf and gRPC library for Scala

Scala 84 3 Updated Jun 22, 2026

ghostdogpr / purelogic

Direct-style pure domain logic for Scala

Scala 50 7 Updated Jun 20, 2026

VineeTagarwaL-code / claude-code

TypeScript 276 270 Updated Mar 31, 2026

Starred topics

Scala

speech-synthesis