Skip to content
View PeterYoungQaQ's full-sized avatar
☯️
Active
☯️
Active

Block or report PeterYoungQaQ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Dependency-Aware Structural Retrieval for Massive Agent Skills

Python 50 3 Updated Apr 9, 2026

Let Skills Evolve Collectively with Agentic Evolver

Python 503 61 Updated Apr 10, 2026

The agent that grows with you

Python 76,588 10,238 Updated Apr 13, 2026

Opinionated skills for AI coding agents to create stunning diagrams and visualizations directly in Markdown. These skills extend agent capabilities across diagram generation, data visualization, an…

1,529 94 Updated Apr 13, 2026

MOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, it is designed for realtime speech generation, can run direc…

Python 328 20 Updated Apr 13, 2026

JoyAI-Image is the unified multimodal foundation model for image understanding, text-to-image generation, and instruction-guided image editing.

Python 1,594 82 Updated Apr 12, 2026

CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies

Rust 25,518 1,474 Updated Apr 13, 2026

Official skills for the GLM family of models.

Python 308 24 Updated Apr 7, 2026

PixelSmile: Fine-grained facial expression editing with continuous control, reduced semantic entanglement, and strong identity preservation.

Python 270 12 Updated Apr 10, 2026

Multimodal OCR: Parse Anything from Documents

Python 174 13 Updated Mar 20, 2026

A fast, helpful, and open-source document parser

TypeScript 4,202 280 Updated Apr 13, 2026
Python 376 29 Updated Mar 25, 2026

Covo-Audio is a 7B-parameter end-to-end large audio language model that directly processes continuous audio inputs and generates audio outputs within a single unified architecture.

Python 137 14 Updated Mar 17, 2026

Open Source Speech Language Model

Jupyter Notebook 960 100 Updated Mar 24, 2026

QIE-Object-Remover-Bbox is an advanced, AI-powered image editing application specifically designed to perform precise object removal and background inpainting based on user-defined bounding box coo…

Python 8 Updated Mar 16, 2026

FireRed-Image-Edit is a powerful image editing foundation model achieving open-source state-of-the-art performance with precise instruction following, high-fidelity generation, superior identity co…

Python 1,146 66 Updated Apr 3, 2026

Speech recognition API service powered by FunASR and Qwen-ASR, supporting 52 languages, compatible with OpenAI API and Alibaba Cloud Speech API. 基于 FunASR 与 Qwen3-ASR 的语音识别 API 服务,支持 52 种语言,兼容 Open…

Python 232 37 Updated Apr 13, 2026

Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.

Python 88 11 Updated Jan 14, 2026

Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.

Python 1,004 94 Updated Feb 25, 2026

[CVPR 2026] Offical implementation of the paper "HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images".

79 5 Updated Mar 3, 2026

🚀 AI 全自动短视频引擎 | AI Fully Automated Short Video Engine

Python 3,771 625 Updated Apr 13, 2026

Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.

Python 1,390 103 Updated Mar 16, 2026

[EMNLP 2025 Findings] A complete cross-modal RAG system for end-to-end speech-to-speech large models, including ASR-based Retrieval and E2E Retrieval.

Python 31 3 Updated Jul 11, 2025

A unified and fully open-source framework for instruction-guided and reference-guided video editing using natural language.

Python 249 23 Updated Mar 11, 2026

Helios: Real Real-Time Long Video Generation Model

Python 1,678 129 Updated Apr 11, 2026

SoulX-FlashHead: A unified 1.3B-parameter framework designed for high-fidelity, infinite-length, and real-time streaming portrait video generation.

Python 665 53 Updated Apr 2, 2026

Linux kernel source tree

C 228,560 61,573 Updated Apr 13, 2026

The library for web and native user interfaces.

JavaScript 244,461 50,913 Updated Apr 13, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 356,516 72,267 Updated Apr 13, 2026
Next