Stars
A RL Framework for multi LLM agent system
A high-throughput and memory-efficient inference and serving engine for LLMs
Official code for "EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting"
A simple yet powerful agent framework that delivers with open-source models
Benchmarking and Bridging Emotion Conflicts for Multimodal Emotion Reasoning (ACM MM 2025 Oral)
Official Repo of Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents
[2025 NeurlPS Spotlight] MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems
[ACM MM 2025] Official repository of "EmoSym: A Symbiotic Framework for Unified Emotional Understanding and Generation via Latent Reasoning"
The official code for the paper: LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs
Provide with pre-build flash-attention package wheels on Linux and Windows platforms using GitHub Actions
(NIPS 2025) OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-Time Self-Aware Emotional Speech Synthesis
Now: Translation Quality Evaluation of Sign Language Avatar (Before: Quality Evaluation of Sign Language Avatars Translation(QESLAT))
A collection of resources that investigate social agents.
Multimodal Prompting with Missing Modalities for Visual Recognition, CVPR'23
[ACL 2024] CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling
Modeling Dynamic Topics in Chain-Free Fashion by Evolution-Tracking Contrastive Learning and Unassociated Word Exclusion (ACL 2024 Findings)
Python library for audio and music analysis
[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications