Starred repositories
AirLLM 70B inference with single 4GB GPU
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering
0xSojalSec / airllm
Forked from lyogavin/airllmAirLLM 70B inference with single 4GB GPU
Paper2Agent is a multi-agent AI system that automatically transforms research papers into interactive AI agents with minimal human input.
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
A webui for propainter. Easily pick up objects from the video and eliminate them.
AudioStory: Generating Long-Form Narrative Audio with Large Language Models
[3DV 2026] "SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass"
Examples for using Hyperbrowser
An extensive node suite for ComfyUI with over 210 new nodes