Skip to content
View SOLARleisu's full-sized avatar

Block or report SOLARleisu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Teams-first Multi-agent orchestration for Claude Code

TypeScript 10,857 744 Updated Mar 22, 2026

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,962 3,381 Updated Mar 22, 2026

A PyTorch-based Speech Toolkit

Python 11,349 1,670 Updated Mar 1, 2026

A python package to analyze and compare voices with deep learning

Python 3,232 478 Updated Oct 12, 2023

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,271 73 Updated Aug 7, 2025

[ICLR 2026] LongLive: Real-time Interactive Long Video Generation

Python 1,126 103 Updated Feb 26, 2026

Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"

Python 1,923 218 Updated Jan 30, 2026

Utility for generating 3D Gaussian head avatars directly from monocular 2D video streams

Python 5 Updated Feb 24, 2026

[NeurIPS 2025] OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication

JavaScript 422 29 Updated Sep 19, 2025

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…

TypeScript 11,230 1,830 Updated Mar 20, 2026

Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021

Python 109 24 Updated May 27, 2024

This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".

Python 615 127 Updated Jun 22, 2025

This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolution"

Python 16 7 Updated Aug 26, 2020

This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis"

Python 713 154 Updated Jul 6, 2023

This is LipNet network where model learn from Lip movement and predict text without voice.

Jupyter Notebook 3 1 Updated Dec 18, 2024

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 81,935 9,640 Updated Mar 22, 2026

Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars

Jupyter Notebook 396 28 Updated Apr 8, 2025

OmniTransfer implementation for LTX-2 (work in progress)

Python 10 2 Updated Mar 5, 2026
Python 160 29 Updated Dec 23, 2025

Implicit Motion Function - (unofficial) Microsoft recreation

Python 27 1 Updated Nov 19, 2024

wip - running some training with overfitting - https://wandb.ai/snoozie/vasa-overfitting

Python 312 39 Updated Jan 24, 2026

Slimmed, cleaned and fine-tuned oh-my-opencode fork, consumes much less tokens

TypeScript 2,257 162 Updated Mar 21, 2026
Python 17 2 Updated Dec 23, 2025

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

JavaScript 94,973 12,427 Updated Mar 22, 2026

Fast, small, and fully autonomous AI personal assistant infrastructure, ANY OS, ANY PLATFORM β€” deploy anywhere, swap anything πŸ¦€

Rust 28,313 3,861 Updated Mar 22, 2026

πŸ’« Toolkit to help you get started with Spec-Driven Development

Python 79,543 6,732 Updated Mar 20, 2026

Mandarin Chinese audio datasets aligned with Montreal Forced Aligner

Python 17 4 Updated Aug 13, 2024

Skill Directory for OpenClaw

TypeScript 6,565 1,033 Updated Mar 22, 2026

πŸ“¦ The Extras bucket for Scoop.

PowerShell 2,038 1,602 Updated Mar 22, 2026

omo; the best agent harness - previously oh-my-opencode

TypeScript 42,235 3,144 Updated Mar 21, 2026
Next