Lists (1)
Sort Name ascending (A-Z)
Stars
Automate your mobile devices with natural language commands - an LLM agnostic mobile Agent 🤖
A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.
STEP-GUI: The top GUI agent solution in the galaxy. Developed by the StepFun-GELab team and powered by StepFun’s cutting-edge research capabilities.
Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech
Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets
PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning