Highlights
- Pro
Stars
Vector Quantizer for Sign Language MediaPipe Poses
The Data and Code of Prompt2Sign: A Comprehensive Multilingual Sign Language Dataset.
An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.
Official project page of the paper "Towards Surveillance Video-and-Language Understanding: New Dataset, Baselines, and Challenges" (Accepted by CVPR 2024)
🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.
Official repo for "Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge" ICLR2025
image description human evaluation tool
Algorithms for Intelligent Assessment of Human Personality Traits based on His Multimodal Data for ranking potential candidates to perform professional responsibilities
EG2025: A multimodal personality prediction framework based on adaptive graph transformer network and multi-task learning
Code for the paper "Low Latency Automotive Vision with Event Cameras", published in Nature
A paper list of spiking neural networks, including papers, codes, and related websites. 本仓库收集脉冲神经网络相关的顶会顶刊以及CNS论文和代码,正在持续更新中。
Event-based Vision Resources. Community effort to collect knowledge on event-based vision technology (papers, workshops, datasets, code, videos, etc)
RealPersonaChat: A Realistic Persona Chat Corpus with Interlocutors' Own Personalities
[NAACL 2025 Main] AgentMove: A Large Language Model based Agentic Framework for Zero-shot Next Location Prediction.
Implementation of "A Hybrid ANN-SNN Architecture for Low-Power and Low-Latency Visual Perception". CVPRW 2024
Banchmark for personality traits prediction with neural networks
The repository for all Azure OpenAI Samples complementing the OpenAI cookbook.
Deep functional residue identification
Code for the paper "Spectrum Guided Topology Augmentation for Graph Contrastive Learning"