-
NeMo Public
Forked from NVIDIA-NeMo/NeMoA scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Python Apache License 2.0 UpdatedMar 20, 2026 -
OmniTransfer-hack Public
Forked from johndpope/OmniTransfer-hackOmniTransfer implementation for LTX-2 (work in progress)
Python Other UpdatedMar 5, 2026 -
aidlc-workflows Public
Forked from awslabs/aidlc-workflowsAI-Driven Life Cycle (AI-DLC) adaptive workflow steering rules for AI coding agents
MIT No Attribution UpdatedFeb 8, 2026 -
VASA-1-hack Public
Forked from johndpope/VASA-1-hackwip - running some training with overfitting - https://wandb.ai/snoozie/vasa-overfitting
Python MIT License UpdatedJan 24, 2026 -
-
-
video-subtitle-extractor Public
Forked from xlbbb517/video-subtitle-extractor视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
Python Apache License 2.0 UpdatedDec 19, 2025 -
zotero-arxiv-daily Public
Forked from TideDra/zotero-arxiv-dailyRecommend new arxiv papers of your interest daily according to your Zotero libarary.
Python GNU Affero General Public License v3.0 UpdatedOct 20, 2025 -
MiniMax-Remover Public
Forked from zibojia/MiniMax-RemoverThis is the official implementation of our paper: "MiniMax-Remover: Taming Bad Noise Helps Video Object Removal"
Python UpdatedJul 27, 2025 -
DiT Public
Forked from facebookresearch/DiTOfficial PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Python Other UpdatedJul 16, 2025 -
LipGAN Public
Forked from Rudrabha/LipGANThis repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".
Python MIT License UpdatedJun 22, 2025 -
Wav2Lip Public
Forked from Rudrabha/Wav2LipThis repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Python UpdatedJun 22, 2025 -
LatentSync Public
Forked from bytedance/LatentSyncTaming Stable Diffusion for Lip Sync!
Python Apache License 2.0 UpdatedJun 20, 2025 -
gpu-burn Public
Forked from wilicc/gpu-burnMulti-GPU CUDA stress test
C++ BSD 2-Clause "Simplified" License UpdatedMay 8, 2025 -
InstantCharacter Public
Forked from Tencent-Hunyuan/InstantCharacterPython Other UpdatedApr 18, 2025 -
EMOPortraits Public
Forked from neeek2303/EMOPortraitsOfficial implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars
Jupyter Notebook Apache License 2.0 UpdatedApr 8, 2025 -
SegMAN Public
Forked from yunxiangfu2001/SegMAN[CVPR 2025] SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation
Python UpdatedMar 29, 2025 -
label-studio Public
Forked from HumanSignal/label-studioLabel Studio is a multi-type data labeling and annotation tool with standardized output format
JavaScript Apache License 2.0 UpdatedMar 7, 2025 -
label-studio-sdk Public
Forked from HumanSignal/label-studio-sdkLabel Studio SDK
Python Apache License 2.0 UpdatedMar 7, 2025 -
mmcv Public
Forked from open-mmlab/mmcvOpenMMLab Computer Vision Foundation
Python Apache License 2.0 UpdatedFeb 10, 2025 -
mmengine Public
Forked from open-mmlab/mmengineOpenMMLab Foundational Library for Training Deep Learning Models
Python Apache License 2.0 UpdatedJan 24, 2025 -
Retrieval-based-Voice-Conversion-WebUI Public
Forked from RVC-Project/Retrieval-based-Voice-Conversion-WebUIEasily train a good VC model with voice data <= 10 mins!
Python MIT License UpdatedNov 24, 2024 -
LAVIS-projects-blip2 Public
Forked from salesforce/LAVISLAVIS - A One-stop Library for Language-Vision Intelligence
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedNov 18, 2024 -
BiRefNet Public
Forked from ZhengPeng7/BiRefNet[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
Python MIT License UpdatedNov 18, 2024 -
opencv Public
Forked from opencv/opencvOpen Source Computer Vision Library
C++ Apache License 2.0 UpdatedOct 24, 2024 -
opencv-mobile Public
Forked from nihui/opencv-mobileThe minimal opencv for Android, iOS, ARM Linux, Windows, Linux, MacOS, HarmonyOS, WebAssembly, watchOS, tvOS, visionOS
C++ Apache License 2.0 UpdatedOct 22, 2024 -
mmdeploy Public
Forked from open-mmlab/mmdeployOpenMMLab Model Deployment Framework
Python Apache License 2.0 UpdatedSep 30, 2024 -
facefusion Public
Forked from facefusion/facefusionNext generation face swapper and enhancer
Python Other UpdatedSep 3, 2024 -
CodeFormer Public
Forked from sczhou/CodeFormer[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Python Other UpdatedAug 11, 2024 -
Real-ESRGAN Public
Forked from xinntao/Real-ESRGANReal-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Python BSD 3-Clause "New" or "Revised" License UpdatedAug 6, 2024