-
HFUT
- Hefei, China
- https://cyinen.github.io/
- https://orcid.org/0009-0002-4436-6613
Stars
[TCSS 2025] The official implementation code for "PhysioSync: Temporal and Cross-Modal Contrastive Learning Inspired by Physiological Synchronization for EEG-Based Emotion Recognition"
Awesome-Emotion-Reasoning is a collection of Emotion-Reasoning works, including papers, codes and datasets
[ ICCV 2025 ] FaceXFormer: A Unified Transformer for Facial Analysis
CLAIP-Emo: Parameter-Efficient Adaptation of Language-supervised models for In-the-Wild Audiovisual Emotion Recognition
Codebase for our CVPR 2025 paper "SMILE 😊 : Infusing Spatial and Motion Semantics in Masked Video Learning"
[ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding
Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].
Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention
[CVPR 2025] DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation & [ICLR 2024] DFormer & [NeuriPS 2025] OmniSegmentor
[CVPRW 2023]The Winner's Solution of CVPR2023-ABAW5 Emotional Reaction Intensity (ERI) Estimation Challenge
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
EmoCapCLIP: Learning Transferable Facial Emotion Representations from Large-Scale Semantically Rich Captions
Automatically crawl arXiv papers daily and summarize them using AI. Illustrating them using GitHub Pages.
Perplexica is an AI-powered answering engine. It is an Open source alternative to Perplexity AI
FaRL for Facial Representation Learning [Official, CVPR 2022]
Explainable Multimodal Emotion Reasoning (EMER), OV-MER (ICML), and AffectGPT (ICML, Oral)
Psyche-R1 (Chinese Psychological Reasoning LLM)
A Survey of Reinforcement Learning for Large Reasoning Models
Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)
Production-ready platform for agentic workflow development.
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…
A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).
The official code for Boosting Multimodal Learning via Disentangled Gradient Learning
A simple and easy-to-use library to enjoy videogames programming
[ECCV 2022] A pytorch implementation for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
基于 ChatGPT API 的 Raycast 翻译插件 - Raycast extension for translation based on ChatGPT API.