-
SkyworkAI
- Beijing, China
- yuqiang-xie.github.io
- @IndexFziQ
Lists (3)
Sort Name ascending (A-Z)
Starred repositories
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [ICRA 2026]
Official Code for GPIC: A Giant Permissive Image Corpus for Visual Generation
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
An agentic skills framework & software development methodology that works.
An open-source AI agent that brings the power of Gemini directly into your terminal.
An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.
SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds
SkyReels V3: Multimodal Video Generation Model
Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)
The official repo for "Vidi: Large Multimodal Models for Video Understanding and Editing"
Development repository for the Triton language and compiler
Structured Video Comprehension of Real-World Shorts
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Next-generation AI Agent Optimization Platform: Cozeloop addresses challenges in AI agent development by providing full-lifecycle management capabilities from development, debugging, and evaluation…
🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.
Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.
Open-source, accurate and easy-to-use video speech recognition & clipping tool. LLM-based AI clipping integrated.
An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.
Download pictures (or videos) along with their captions and other metadata from Instagram.
A feature-rich command-line audio/video downloader
Command-line program to download image galleries and collections from several image hosting sites
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.