Psjs

Psjs

1 follower · 1 following

Stars

smkim37 / TripleSumm

The official code of "TripleSumm: Adaptive Triple-Modality Fusion for Video Summarization" (ICLR 2026)

Python 8 2 Updated Mar 26, 2026

MRHiSum / MR.HiSum

Python 50 2 Updated Nov 1, 2024

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,692 2,231 Updated Feb 1, 2025

fylimas / nsfc

nsfc - 国家自然科学基金项目LaTeX模版(面地青CBA)

TeX 1,224 313 Updated Mar 5, 2026

BIT-DA / I2V-GAN

ACMMM2021 paper "I2V-GAN: Unpaired Infrared-to-Visible Video Translation"

Python 126 25 Updated Feb 11, 2022

I2V-Adapter / I2V-Adapter-repo

I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models

214 4 Updated Dec 30, 2023

jy0205 / LaVIT

LaVIT: Empower the Large Language Model to Understand and Generate Visual Content

Jupyter Notebook 605 31 Updated Oct 6, 2024

ysy31415 / direct_a_video

Python 93 7 Updated May 25, 2024

simofoti / 3DVAE-SwapDisentangled

Code of "3D Shape Variational Autoencoder Latent Disentanglement via Mini-Batch Feature Swapping for Bodies and Faces"

Python 68 12 Updated Jul 2, 2022

yerfor / Real3DPortrait

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

Python 1,092 130 Updated Oct 18, 2024

zhoubolei / bolei_awesome_posters

CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!

1,899 168 Updated May 9, 2023

LeMei / UniMSE

repository for HapticLLaMA: A Multimodal Sensory Language Model for Haptic Captioning

Python 202 27 Updated Sep 3, 2025

iver56 / audiomentations

A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.

Python 2,255 215 Updated Apr 13, 2026

AndreyGuzhov / AudioCLIP

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

Python 863 104 Updated Sep 30, 2021

dondongwon / LPMDataset

Jupyter Notebook 54 13 Updated Oct 17, 2023

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 56,686 6,187 Updated Feb 9, 2026

DefangChen / SimKD

[CVPR-2022] Official implementation for "Knowledge Distillation with the Reused Teacher Classifier".

Python 103 19 Updated Jun 16, 2022

e-apostolidis / AC-SUM-GAN

A PyTorch Implementation of AC-SUM-GAN from "AC-SUM-GAN: Connecting Actor-Critic and Generative Adversarial Networks for Unsupervised Video Summarization" (IEEE TCSVT 2021)

Python 28 10 Updated May 4, 2022

TIBHannover / UnsupervisedVideoSummarization

Source code for the paper "Unsupervised Video Summarization via Multi-source Features" published at ICMR 2021

Python 21 10 Updated Apr 5, 2022

jnzs1836 / intent-vizor

Python 16 3 Updated Jul 10, 2024

WujiangXu / MHSCNet

The code for ICASSP23 paper "MHSCNet: A Multimodal Hierarchical Shot-aware Convolutional Network for Video Summarization"

Python 10 2 Updated Aug 12, 2024

mehryar72 / RS-SUM

Python 11 Updated Feb 29, 2024

BerserkerMother / Video-Summarization

video summarization research repo

Python 2 1 Updated Jul 10, 2022

pangzss / pytorch-CTVSUM

Pytorch code for paper Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization

Python 21 3 Updated Jan 7, 2023

boheumd / A2Summ

The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)

Python 86 12 Updated Apr 24, 2023

Exploration-Lab / Shapes-of-Emotion

Jupyter Notebook 15 3 Updated Mar 29, 2023

ppfliu / emotion-recognition

Group Gated Fusion on Attention-based Bidirectional Alignment for Multimodal Emotion Recognition

Python 14 Updated May 10, 2022

skeletonNN / CFN-SR

Python 27 7 Updated Oct 7, 2021

haibao-yu / FFNet-VIC3D

Python 86 8 Updated Mar 27, 2024

Zhaozixiang1228 / MMIF-CDDFuse

[CVPR 2023] Official implementation for "CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion."

Python 616 55 Updated Jan 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly