Skip to content
View youcaiSUN's full-sized avatar

Block or report youcaiSUN

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A curated list of papers, models, datasets, and benchmarks for unified multi-modal embedding models.

38 2 Updated Apr 29, 2026

Official repository for the paper “Rethinking Facial Expression Recognition in the Era of Multimodal Large Language Models”

Python 28 4 Updated Nov 5, 2025

A vision foundation model for affective and facial recognition tasks

Python 5 1 Updated Sep 17, 2025

EmoCapCLIP: Learning Transferable Facial Emotion Representations from Large-Scale Semantically Rich Captions

21 Updated Jul 29, 2025

Open-source unified multimodal model

Python 6,016 533 Updated May 4, 2026

[CVPR'25] AVF-MAE++ : Scaling Affective Video Facial Masked Autoencoders via Efficient Audio-Visual Self-Supervised Learning

Python 21 1 Updated Jun 11, 2026

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,746 2,230 Updated Feb 1, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 12,210 1,251 Updated Nov 21, 2025

A repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8…

Python 4,441 655 Updated Jun 7, 2026

Official implementation of MagicFace: High-Fidelity Facial Expression Editing with Action-Unit Control

Python 62 7 Updated Nov 20, 2025

LAFS: Landmark-based Facial Self-supervised Learning for Face Recognition

Python 44 6 Updated Nov 14, 2024

EC-STFL: Expression-Clustered Spatiotemporal Feature Learning. It is proposed for video-based Facial Expression Recognition (FER) task.

Python 5 Updated Sep 6, 2024

This repository provides the codes for MMA-DFER: multimodal (audiovisual) emotion recognition method. This is an official implementation for the paper MMA-DFER: MultiModal Adaptation of unimodal mo…

Python 57 8 Updated Sep 16, 2024

[CVPR 2024] EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning

Python 40 2 Updated Apr 20, 2025

Awesome speech/audio LLMs, representation learning, and codec models

1,231 75 Updated Jun 1, 2026

📄 适合中文的简历模板收集(LaTeX,HTML/JS and so on)由 @hoochanlon 维护

7,808 506 Updated Mar 2, 2025

[IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer

Python 234 15 Updated Nov 30, 2025

[Information Fusion 2024] HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition

Python 121 9 Updated Aug 29, 2025

Toolkits for Multimodal Emotion Recognition

Python 4 Updated Jan 10, 2024
Python 180 12 Updated Jul 9, 2024

GPT-4V with Emotion

Python 96 7 Updated Dec 8, 2023
Jupyter Notebook 12,968 954 Updated Oct 25, 2025

Official code of "VRA: Variational Rectifed Activation for Out-of-distribution Detection"

Python 9 1 Updated Sep 25, 2023

A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models (ICASSP 2024)

Python 58 4 Updated Apr 17, 2024

Repository with the code of the paper: A proposal for Multimodal Emotion Recognition using auraltransformers and Action Units on RAVDESS dataset

Python 111 31 Updated Mar 29, 2024

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).

1,230 59 Updated Jun 28, 2024

This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.

510 38 Updated Mar 18, 2025

A curated list of prompt-based paper in computer vision and vision-language learning.

927 68 Updated Dec 18, 2023
Next