An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 4,017 331 Updated Aug 14, 2025

vikhyat / moondream

tiny vision language model

Python 9,532 753 Updated Nov 14, 2025

VectorSpaceLab / OmniGen2

OmniGen2: Exploration to Advanced Multimodal Generation. https://arxiv.org/abs/2506.18871

Jupyter Notebook 4,039 23 Updated Mar 20, 2026

KoljaB / RealtimeTTS

Converts text to speech in realtime

Python 3,837 383 Updated Apr 1, 2026

alexanderlerch / Tonmeister-Grundlagen

Grundlagenskript fuer Tonmeisterstudenten (2000)

TeX 6 Updated Jul 13, 2025

skrbnv / javad

Python 66 3 Updated Jan 27, 2025

Fosowl / agenticSeek

Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…

Python 25,792 2,876 Updated Mar 16, 2026

FurkanGozukara / Stable-Diffusion

FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, Comfy…

JavaScript 2,663 364 Updated Mar 30, 2026

ChenHsing / AID

19 2 Updated Jun 13, 2024

SamurAIGPT / AI-Youtube-Shorts-Generator

A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.

Python 3,182 546 Updated Feb 5, 2026

csslc / CCSR

[TIP2026] Official codes of CCSRv2 and CCSRv1: Improving the Stability and Efficiency of Diffusion Models for Content Consistent Super-Resolution

Python 597 45 Updated Jul 17, 2025

browser-use / browser-use

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 85,775 9,919 Updated Apr 3, 2026

PacktPublishing / Machine-Learning-for-Imbalanced-Data

Machine Learning for Imbalanced Data, published by Packt

Jupyter Notebook 279 85 Updated Mar 2, 2026

musicinformationretrieval / musicinformationretrieval.com

Instructional notebooks on music information retrieval.

Jupyter Notebook 1,269 414 Updated Mar 25, 2026

udlbook / udlbook

Understanding Deep Learning - Simon J.D. Prince

Jupyter Notebook 9,309 2,196 Updated Feb 24, 2026

XPixelGroup / DiffBIR

[ECCV 2024] codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

Python 4,055 351 Updated Jul 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

thomaschhh

Achievements

Achievements

Organizations

Block or report thomaschhh

Stars

allenai / olmocr

timvancann / timnology-youtube

HKUDS / DeepCode

SakanaAI / AI-Scientist

ydqmkkx / Respiro-en

Yuliang-Liu / MonkeyOCR

bytedance / Dolphin

boson-ai / higgs-audio

ace-step / ACE-Step

guidance-ai / llguidance

LeanModels / DFloat11

modelscope / ClearerVoice-Studio