default search action
6th PRCV 2023: Xiamen, China - Part I
- Qingshan Liu, Hanzi Wang, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang, Rongrong Ji:
Pattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13-15, 2023, Proceedings, Part I. Lecture Notes in Computer Science 14425, Springer 2024, ISBN 978-981-99-8428-2
Action Recognition
- Chengguo Yuan, Yu Jin, Zongzhen Wu, Fanting Wei, Yangzirui Wang, Lan Chen, Xiao Wang:
Learning Bottleneck Transformer for Event Image-Voxel Feature Fusion Based Classification. 3-15 - Yang Shu, Wanggen Li, Doudou Li, Kun Gao, Biao Jie:
Multi-scale Dilated Attention Graph Convolutional Network for Skeleton-Based Action Recognition. 16-28 - Wentian Xin, Yi Liu, Ruyi Liu, Qiguang Miao, Cheng Shi, Chi-Man Pun:
Auto-Learning-GCN: An Ingenious Framework for Skeleton-Based Action Recognition. 29-42 - Xiaowei Zhu, Qian Huang, Chang Li, Jingwen Cui, Yingying Chen:
Skeleton-Based Action Recognition with Combined Part-Wise Topology Graph Convolutional Networks. 43-59 - Mingliang Xue, Siwei Wang, Bing Fu, Zhengyang Zhao, Tao Liu, Lingfeng Lai:
Segmenting Key Clues to Induce Human-Object Interaction Detection. 60-71 - Teng Huang, Weiqing Kong, Jiaming Liang, Ziyu Ding, Hui Li, Xi Zhang:
Lightweight Multispectral Skeleton and Multi-stream Graph Attention Networks for Enhanced Action Prediction with Multiple Modalities. 72-83 - Wanchuan Yu, Hanyu Guo, Yan Yan, Jie Li, Hanzi Wang:
Spatio-Temporal Self-supervision for Few-Shot Action Recognition. 84-96 - Jiulin Li, Mengyu Yang, Yang Liu, Gongli Xi, Lanshan Zhang, Ye Tian:
A Fuzzy Error Based Fine-Tune Method for Spatio-Temporal Recognition Model. 97-108 - Jinzhao Luo, Lu Zhou, Guibo Zhu, Guojing Ge, Beiying Yang, Jinqiao Wang:
Temporal-Channel Topology Enhanced Network for Skeleton-Based Action Recognition. 109-119 - Ying Zhou, Yana Zhang, Aiqiu Wu:
HFGCN-Based Action Recognition System for Figure Skating. 120-130
Multi-modal Information Processing
- Zhengyu Li, Yao Wu, Yanyun Qu:
Image Priors Assisted Pre-training for Point Cloud Shape Analysis. 133-145 - Wei Yue:
AMM-GAN: Attribute-Matching Memory for Person Text-to-Image Generation. 146-158 - Liucun Lu, Jinghui Qin, Zequn Jie, Lin Ma, Liang Lin, Xiaodan Liang:
RecFormer: Recurrent Multi-modal Transformer with History-Aware Contrastive Learning for Visual Dialog. 159-171 - Jiancheng Huang, Yifan Liu, Jin Qin, Shifeng Chen:
KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image Action Editing. 172-184 - Jiaer Xia, Haozhe Yang, Yan Zhang, Pingyang Dai:
Enhancing Text-Image Person Retrieval Through Nuances Varied Sample. 185-196 - Yi Zhang, Ce Zhang, Xueting Hu, Zhihai He:
Unsupervised Prototype Adapter for Vision-Language Models. 197-209 - Wenjun Feng, Dazhen Lin, Donglin Cao:
Multimodal Causal Relations Enhanced CLIP for Image-to-Text Retrieval. 210-221 - Longzheng Wang, Chuang Zhang, Hongbo Xu, Yongxiu Xu, Siqi Wang:
Exploring Cross-Modal Inconsistency in Entities and Emotions for Multimodal Fake News Detection. 222-234 - Mengluan Li, Yanqing Guo, Haiyan Fu, Yi Li, Hong Su:
Deep Consistency Preserving Network for Unsupervised Cross-Modal Hashing. 235-246 - Mintu Yang, Xianxu Hou, Hao Li, Linlin Shen, Lixin Fan:
Learning Adapters for Text-Guided Portrait Stylization with Pretrained Diffusion Models. 247-258 - Zikun Song, Pinle Qin, Jianchao Zeng, Shuangjiao Zhai, Rui Chai, JunYi Yan:
EdgeFusion: Infrared and Visible Image Fusion Algorithm in Low Light. 259-270 - Yuanyuan Qiu, Zhenning Yu, Zhenguo Gao:
An Efficient Momentum Framework for Face-Voice Association Learning. 271-283 - Yuan Qing, Naixing Wu, Shaohua Wan, Lixin Duan:
Multi-modal Instance Refinement for Cross-Domain Action Recognition. 284-296 - Yang Xu, Junyi Wu, Yan Yan, Xinsheng Du, Huiji Zhang, Jianqiang Zhao, Zhipeng Gao:
Modality Interference Decoupling and Representation Alignment for Caricature-Visual Face Recognition. 297-308 - Jie Wang, Yixiao Zheng, Ruoyi Du, Yiming Zhang, Kongming Liang, Zhanyu Ma:
Plugging Stylized Controls in Open-Stylized Image Captioning. 309-320 - Taoying Zhang, Hesong Li, Qiankun Liu, Xiaoyong Wang, Ying Fu:
MGT: Modality-Guided Transformer for Infrared and Visible Image Fusion. 321-332 - Chenyu Zhou, Xiuhong Li, Zhe Li, Fan Chen, Xiaofan Wang, Dan Yang, Bin Chen, Songlin Li:
Multimodal Rumor Detection by Using Additive Angular Margin with Class-Aware Attention for Hard Samples. 333-344 - Lingfeng Hu, Si Liu, Hanzi Wang:
An Effective Dynamic Reweighting Method for Unbiased Scene Graph Generation. 345-356 - Zejun Wang, Xinglong Wu, Hongwei Yang, Hui He, Yu Tai, Weizhe Zhang:
Multi-modal Graph and Sequence Fusion Learning for Recommendation. 357-369 - Guoyong Cai, Shunjie Wang, Guangrui Lv:
Co-attention Guided Local-Global Feature Fusion for Aspect-Level Multimodal Sentiment Analysis. 370-382 - Qing Zhang, Haocheng Lv, Jie Liu, Zhiyun Chen, Jianyong Duan, Mingying Xv, Hao Wang:
Discovering Multimodal Hierarchical Structures with Graph Neural Networks for Multi-modal and Multi-hop Question Answering. 383-394 - Chengjie Sun, Weiwei Chen, Lei Lin, Lili Shan:
Enhancing Recommender System with Multi-modal Knowledge Graph. 395-407 - Guoqing Xu, Min Hu, Xiaohua Wang, Jiaoyun Yang, Nan Li, Qingyu Zhang:
Location Attention Knowledge Embedding Model for Image-Text Matching. 408-421 - Dan Liu, Wei Song, Xiaobing Zhao:
Pedestrian Attribute Recognition Based on Multimodal Transformer. 422-433 - Xinyi Wu, Xia Yuan, YanChao Cui, Chunxia Zhao:
RGB-D Road Segmentation Based on Geometric Prior Information. 434-445 - Tingting Han, Yuanxin Lv, Zhou Yu, Jun Yu, Jianping Fan, Liu Yuan:
Contrastive Perturbation Network for Weakly Supervised Temporal Sentence Grounding. 446-460 - Feng Li, Enguang Zuo, Chen Chen, Cheng Chen, Mingrui Ma, Yunling Wang, Xiaoyi Lv, Min Li:
MLDF-Net: Metadata Based Multi-level Dynamic Fusion Network. 461-473 - Ran Yan, Ruiying Du, Kun He, Jing Chen:
Efficient Adversarial Training with Membership Inference Resistance. 474-486 - Hongyu Wang, Pengpeng Qiang, Hongye Tan, Jingchang Hu:
Enhancing Image Comprehension for Computer Science Visual Question Answering. 487-498 - Wei Bao, Jingjing Hu, Meiyu Huang, Xueshuang Xiang:
Cross-Modal Attentive Recalibration and Dynamic Fusion for Multispectral Pedestrian Detection. 499-510
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.