Skip to content
View leileah's full-sized avatar

Block or report leileah

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

基于阶跃星辰开放平台语音api的android 语音sdk,支持tts 流式与非流式,asr,流式,非流式音频播放器,语音录制能力

Kotlin 4 Updated Dec 15, 2025

Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets

Python 826 55 Updated Sep 8, 2025

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 2,071 88 Updated Dec 15, 2025

Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.

Python 1,274 92 Updated Sep 22, 2025
Python 426 28 Updated Nov 27, 2025

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech

Python 788 52 Updated Dec 8, 2025

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Python 214 5 Updated Dec 10, 2025

The Intelligent GUI Agent for Mobile Phones

Python 1,553 187 Updated Dec 21, 2025

Automate your mobile devices with natural language commands - an LLM agnostic mobile Agent 🤖

Python 6,999 716 Updated Dec 18, 2025

GELab: GUI Exploration Lab. One of the best GUI agent solutions in the galaxy, built by the StepFun-GELab team and powered by Step’s research capabilities.

Python 1,647 136 Updated Dec 19, 2025