Skip to content
View MaxMax2016's full-sized avatar
  • UESTC
  • ChengDu,China

Block or report MaxMax2016

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 1,834 144 Updated Dec 20, 2025

SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation

Python 505 35 Updated Dec 13, 2025

🔥 OneThinker: All-in-one Reasoning Model for Image and Video

Python 326 25 Updated Dec 9, 2025

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 11,424 1,201 Updated Dec 21, 2025

[ICASSP2025] Official code for VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis

Python 40 4 Updated Apr 9, 2025

Optimized Whisper models for streaming and on-device use

Python 768 54 Updated Dec 17, 2025
Python 7,532 444 Updated Dec 14, 2025

[NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personali…

Python 3,080 311 Updated Sep 4, 2025

This is the official implementation for Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1.

HTML 150 12 Updated Oct 27, 2025

Official implementation of YingMusic-SVC.

Python 91 7 Updated Dec 15, 2025

Extracting time features from text using a Finite State Transducer (FST) in Python

Python 50 7 Updated Dec 1, 2025

TextOp: Real-time Interactive Text-Driven Humanoid Robot Motion Generation and Control

Python 219 14 Updated Dec 12, 2025

Minimal reproduction of OneRec

Python 747 107 Updated Dec 17, 2025

Lightning-Fast, On-Device TTS — running natively via ONNX.

JavaScript 1,867 171 Updated Dec 15, 2025

EverMemOS is an open-source, enterprise-grade intelligent memory system. Our mission is to build AI memory that never forgets, making every conversation built on previous understanding.

Python 1,339 123 Updated Dec 15, 2025

The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…

Python 2,298 216 Updated Dec 19, 2025

5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs

Python 56 9 Updated Nov 19, 2025

Momentum Human Rig is an anatomically-inspired parametric full-body digital human model developed at Meta. It includes: A parametric body skeletal model; A realistic 3D mesh skinned to the skeleton…

Python 504 31 Updated Dec 16, 2025

A minimal yet professional single agent demo project that showcases the core execution pipeline and production-grade features of agents.

Python 928 126 Updated Dec 11, 2025

MiroMind Research Agent: Fully Open-Source Deep Research Agent with Reproducible State-of-the-Art Performance on FutureX, GAIA, HLE, BrowserComp and xBench.

Python 1,584 172 Updated Nov 30, 2025

MiroThinker is a series of open-source agentic models trained for deep research and complex tool use scenarios.

Python 1,343 92 Updated Dec 20, 2025

The official repo of BridgeVoC, which explores using the Schrödinger Bridge framework for neural vocoding.

Python 237 37 Updated Nov 20, 2025

Neural Accent Conversion via Disentangled Speech Representations

Jupyter Notebook 6 1 Updated Nov 14, 2025

Thin wrapper for "pandoc" (MIT)

Python 1,087 121 Updated Dec 3, 2025

a Dify plugin to convert markdown text into docx file

Python 32 16 Updated Jun 8, 2025

Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios"(AAAI 2026)

49 Updated Nov 18, 2025

一款提示词优化器,助力于编写高质量的提示词

TypeScript 17,951 2,232 Updated Dec 20, 2025

🎯 告别信息过载,AI 助你看懂新闻资讯热点,简单的舆情监控分析 - 多平台热点聚合+基于 MCP 的AI分析工具。监控35个平台(抖音、知乎、B站、华尔街见闻、财联社等),智能筛选+自动推送+AI对话分析(用自然语言深度挖掘新闻:趋势追踪、情感分析、相似检索等13种工具)。支持企业微信/个人微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 推送,1分钟手机通知,无需…

Python 39,897 20,779 Updated Dec 20, 2025

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech

Python 787 52 Updated Dec 8, 2025

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,491 213 Updated Dec 16, 2025
Next