Skip to content
View bmwas's full-sized avatar

Highlights

  • Pro

Block or report bmwas

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official Implementation of ReCo: Region-Constraint In-Context Generation for Instructional Video Editing

Python 37 Updated Dec 22, 2025

A high quality and fast TTS repository

Python 332 27 Updated Dec 22, 2025

Pivotal Token Search

Python 140 9 Updated Dec 20, 2025

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 2,462 185 Updated Dec 23, 2025

We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while achieving up to 6$\times$ acceleration in inference speed.

Python 228 12 Updated Dec 24, 2025

LiveKit Client SDK for ESP32 series chips. Easily enable real-time audio, video, and data for embedded projects.

C 69 14 Updated Dec 20, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,977 129 Updated Dec 18, 2025

Can we build an addordable open source health ring ?

C++ 10 2 Updated May 8, 2025

Your CrewAI Powered Video Editing Assistant

Python 706 115 Updated Sep 27, 2024

EMNLP 2025 - "Mapping the Minds of LLMs: A Graph-Based Analysis of Reasoning LLMs", Official Implementation

Python 10 1 Updated Oct 18, 2025

PersonaLive! : Expressive Portrait Image Animation for Live Streaming

Python 721 76 Updated Dec 23, 2025

Offical Implementation of SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations

Python 508 22 Updated Dec 22, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,811 1,086 Updated Dec 24, 2025

A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using autoregressive diffusion.

Python 248 35 Updated Dec 15, 2025
HTML 50 1 Updated Dec 8, 2025

Paper Debugger is the best overleaf companion

TypeScript 1,175 53 Updated Dec 21, 2025

One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer

Python 403 42 Updated Dec 21, 2025

Official source code for the paper: "Reading Between the Frames Multi-Modal Non-Verbal Depression Detection in Videos"

Python 83 13 Updated May 16, 2024

Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"

Python 1,171 105 Updated Dec 18, 2025

SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds

Python 239 21 Updated Dec 23, 2025

Ovis-Image is a 7B text-to-image model specifically optimized for high-quality text rendering, designed to operate efficiently under stringent computational constraints.

Python 279 13 Updated Dec 21, 2025

A list of papers for child ASR

50 6 Updated Oct 8, 2024

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,505 214 Updated Dec 16, 2025

An end-to-end Data Scientist

Python 456 61 Updated Dec 7, 2025

Effortless monitoring and analytics for API frameworks.

Go 598 56 Updated Dec 22, 2025

A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux.

Python 3,504 493 Updated Dec 23, 2025
Next