Skip to content
View LemonTency's full-sized avatar

Block or report LemonTency

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
569 stars written in Python
Clear filter

Res-SAM Framework for GPR Underground Hazard Detection

Python 1,185 62 Updated Sep 23, 2025

Code for SCIS-2025 Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".

Python 1,172 62 Updated Apr 15, 2025

AI Manus is a general-purpose AI Agent system that supports running various tools and operations in a sandbox environment.

Python 1,160 272 Updated Nov 5, 2025

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 1,155 74 Updated Oct 21, 2024

PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.

Python 1,154 158 Updated Jul 1, 2025

PantoMatrix: Generating Face and Body Animation from Speech

Python 1,133 181 Updated Jan 16, 2025

[ICLR 2025 Oral] TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation

Python 1,125 148 Updated Aug 24, 2025

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 1,105 71 Updated Feb 7, 2025

We introduce the Audio Logical Reasoning (ALR) dataset, consisting of 6,446 text-audio annotated samples specifically designed for complex reasoning tasks. Building on this resource, we propose Sou…

Python 1,099 131 Updated Aug 23, 2025

Neural Network Compression Framework for enhanced OpenVINO™ inference

Python 1,098 264 Updated Nov 7, 2025

[NeurIPS 2024] An official implementation of "ShareGPT4Video: Improving Video Understanding and Generation with Better Captions"

Python 1,078 44 Updated Oct 9, 2024

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

Python 1,070 89 Updated Jun 13, 2024

[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

Python 1,067 60 Updated Sep 19, 2025

Memory-Guided Diffusion for Expressive Talking Video Generation

Python 1,066 102 Updated Aug 6, 2025

A unified ensemble framework for PyTorch to improve the performance and robustness of your deep learning model.

Python 1,037 96 Updated Jun 16, 2024

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Python 1,011 35 Updated Aug 4, 2025

[CVPR 2025 Highlight] 3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion

Python 1,008 39 Updated Jul 14, 2025

JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.

Python 1,007 142 Updated Mar 17, 2025

A codebase and a curated list of awesome deep long-tailed learning (TPAMI 2023).

Python 996 125 Updated Nov 5, 2025

Customized ID Consistent for human

Python 975 72 Updated Feb 19, 2025

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsens…

Python 968 105 Updated Feb 27, 2023

Code implementation of "Learning Efficient Online 3D Bin Packing on Packing Configuration Trees". We propose to enhance the practical applicability of online 3D Bin Packing Problem (BPP) via learni…

Python 934 57 Updated Aug 3, 2024

Video generation from text&image, 1st-gen

Python 921 53 Updated May 10, 2025

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Python 916 23 Updated Mar 17, 2025

A comprehensive benchmark of deepfake detection

Python 906 145 Updated Aug 20, 2025
Python 901 122 Updated Dec 11, 2024

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

Python 881 51 Updated Jan 3, 2025

Next-Generation Interactive Intelligent Programming Assistant

Python 876 128 Updated Oct 13, 2024

Official code for TimeCraft: A Time Series Generation Framework for Real-World Applications

Python 867 60 Updated Nov 7, 2025

🧠+🎧 Build your music algorithms and AI models with the next-gen DAW 🔥

Python 851 104 Updated Jun 6, 2023