Lists (1)
Sort Name ascending (A-Z)
Stars
基于阶跃星辰开放平台语音api的android 语音sdk,支持tts 流式与非流式,asr,流式,非流式音频播放器,语音录制能力
Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets
A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.
Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech
PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning
The Intelligent GUI Agent for Mobile Phones
Automate your mobile devices with natural language commands - an LLM agnostic mobile Agent 🤖
GELab: GUI Exploration Lab. One of the best GUI agent solutions in the galaxy, built by the StepFun-GELab team and powered by Step’s research capabilities.