default search action
26th ACM Multimedia 2018: Seoul, Republic of Korea
- Susanne Boll, Kyoung Mu Lee, Jiebo Luo, Wenwu Zhu, Hyeran Byun, Chang Wen Chen, Rainer Lienhart, Tao Mei:
2018 ACM Multimedia Conference on Multimedia Conference, MM 2018, Seoul, Republic of Korea, October 22-26, 2018. ACM 2018, ISBN 978-1-4503-5665-7
FF-1
- Max Mühlhäuser:
Session details: FF-1. - Chuan-Xiang Li, Zhen-Duo Chen, Peng-Fei Zhang, Xin Luo, Liqiang Nie, Wei Zhang, Xin-Shun Xu:
SCRATCH: A Scalable Discrete Matrix Factorization Hashing for Cross-Modal Retrieval. 1-9 - Ana Garcia del Molino, Joo-Hwee Lim, Ah-Hwee Tan:
Predicting Visual Context for Unsupervised Event Segmentation in Continuous Photo-streams. 10-17 - Xingxing Wei, Jun Zhu, Sitong Feng, Hang Su:
Video-to-Video Translation with Global Temporal Consistency. 18-25 - Jinxing Li, Bob Zhang, Guangming Lu, David Zhang:
Shared Linear Encoder-based Gaussian Process Latent Variable Model for Visual Classification. 26-34 - Jia-Xing Zhong, Nannan Li, Weijie Kong, Tao Zhang, Thomas H. Li, Ge Li:
Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector. 35-44 - Jianshu Li, Jian Zhao, Yunpeng Chen, Sujoy Roy, Shuicheng Yan, Jiashi Feng, Terence Sim:
Multi-Human Parsing Machines. 45-53 - Xuanyi Dong, Linchao Zhu, De Zhang, Yi Yang, Fei Wu:
Fast Parameter Adaptation for Few-shot Image Captioning and Visual Question Answering. 54-62 - Junbo Wang, Wei Wang, Yan Huang, Liang Wang, Tieniu Tan:
Hierarchical Memory Modelling for Video Captioning. 63-71 - Zheng Wang, Xiang Bai, Mang Ye, Shin'ichi Satoh:
Incremental Deep Hidden Attribute Learning. 72-80 - Huarong Chen, Bin Wang, Tianxiang Pan, Liwang Zhou, Hua Zeng:
CropNet: Real-Time Thumbnailing. 81-89 - Zhi-Qi Cheng, Xiao Wu, Siyu Huang, Jun-Xiu Li, Alexander G. Hauptmann, Qiang Peng:
Learning to Transfer: Generalizable Attribute Learning with Multitask Neural Model Search. 90-98 - Yingying Zhu, Jiong Wang, Lingxi Xie, Liang Zheng:
Attention-based Pyramid Aggregation Network for Visual Place Recognition. 99-107 - Changde Du, Changying Du, Hao Wang, Jinpeng Li, Wei-Long Zheng, Bao-Liang Lu, Huiguang He:
Semi-supervised Deep Generative Modelling of Incomplete Multi-Modality Emotional Data. 108-116 - Yuxiao Chen, Jianbo Yuan, Quanzeng You, Jiebo Luo:
Twitter Sentiment Analysis via Bi-sense Emoji Embedding and Attention-based LSTM. 117-125 - Feifei Zhang, Tianzhu Zhang, Qirong Mao, Lingyu Duan, Changsheng Xu:
Facial Expression Recognition in the Wild: A Cycle-Consistent Adversarial Attention Transfer Approach. 126-135 - Runnan Li, Zhiyong Wu, Jia Jia, Jingbei Li, Wei Chen, Helen Meng:
Inferring User Emotive State Changes in Realistic Human-Computer Conversational Dialogs. 136-144 - Zhengzhe Liu, Xiaojuan Qi, Lei Pang:
Self-boosted Gesture Interactive System with ST-Net. 145-153 - Felix Kosmalla, Christian Murlowski, Florian Daiber, Antonio Krüger:
Slackliner - An Interactive Slackline Training Assistant. 154-162 - Yaoyu Li, Tianzhu Zhang, Lingyu Duan, Changsheng Xu:
A Unified Generative Adversarial Framework for Image Generation and Person Re-identification. 163-172 - Anahita Mahzari, Afshin Taghavi Nasrabadi, Aliehsan Samiei, Ravi Prakash:
FoV-Aware Edge Caching for Adaptive 360° Video Streaming. 173-181
Keynote 1
- Susanne Boll:
Session details: Keynote 1. - Marianna Obrist:
Don't just Look - Smell, Taste, and Feel the Interaction. 182
FF-2
- Peng Cui:
Session details: FF-2. - Rui Zhang, Sheng Tang, Yu Li, Junbo Guo, Yongdong Zhang, Jintao Li, Shuicheng Yan:
Style Separation and Synthesis via Generative Adversarial Networks. 183-191 - Hao Xiao, Weiyao Lin, Bin Sheng, Ke Lu, Junchi Yan, Jingdong Wang, Errui Ding, Yihao Zhang, Hongkai Xiong:
Group Re-Identification: Leveraging and Integrating Multi-Grain Information. 192-200 - Xu Gao, Tingting Jiang:
OSMO: Online Specific Models for Occlusion in Multiple Object Tracking under Surveillance Scene. 201-210 - Yuke Li:
Video Forecasting with Forward-Backward-Net: Delving Deeper into Spatiotemporal Consistency. 211-219 - Rui Shao, Xiangyuan Lan, Pong C. Yuen:
Feature Constrained by Pixel: Hierarchical Adversarial Deep Domain Adaptation. 220-228 - Zhixing Chen, Di Huang, Yunhong Wang, Liming Chen:
Fast and Light Manifold CNN based 3D Facial Expression Recognition across Pose Variations. 229-238 - Xiaomeng Song, Yucheng Shi, Xin Chen, Yahong Han:
Explore Multi-Step Reasoning in Video Question Answering. 239-247 - Shancheng Fang, Hongtao Xie, Zheng-Jun Zha, Nannan Sun, Jianlong Tan, Yongdong Zhang:
Attention and Language Ensemble for Scene Text Recognition with Convolutional Sequence Modeling. 248-256 - Zhaoyang Zhang, Zhanghui Kuang, Ping Luo, Litong Feng, Wei Zhang:
Temporal Sequence Distillation: Towards Few-Frame Action Recognition in Videos. 257-264 - Zhihang Fu, Zhongming Jin, Guo-Jun Qi, Chen Shen, Rongxin Jiang, Yaowu Chen, Xian-Sheng Hua:
Previewer for Multi-Scale Object Detector. 265-273 - Guanshuo Wang, Yufeng Yuan, Xiong Chen, Jiwei Li, Xi Zhou:
Learning Discriminative Features with Multiple Granularities for Person Re-Identification. 274-282 - Guoxiang Qu, Wenwei Zhang, Zhe Wang, Xing Dai, Jianping Shi, Junjun He, Fei Li, Xiulan Zhang, Yu Qiao:
StripNet: Towards Topology Consistent Strip Structure Segmentation. 283-291 - Samuel Albanie, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman:
Emotion Recognition in Speech using Cross-Modal Transfer in the Wild. 292-301 - Can Wang, Shangfei Wang:
Personalized Multiple Facial Action Unit Recognition through Generative Adversarial Recognition Network. 302-310 - Cigdem Beyan, Muhammad Shahid, Vittorio Murino:
Investigation of Small Group Social Interactions Using Deep Visual Activity-Based Nonverbal Features. 311-319 - Eugene Yujun Fu, Michael Xuelin Huang, Hong Va Leong, Grace Ngai:
Cross-Species Learning: A Low-Cost Approach to Learning Human Fight from Animal Fight. 320-327 - Qianli Xu, Vigneshwaran Subbaraju, Chee How Cheong, Aijing Wang, Kathleen Kang, Munirah Bashir, Yanhong Dong, Liyuan Li, Joo-Hwee Lim:
Personalized Serious Games for Cognitive Intervention with Lifelog Visual Analytics. 328-336 - Wendy Bolier, Wolfgang Hürst, Guido van Bommel, Joost Bosman, Harriët Bosman:
Drawing in a Virtual 3D Space - Introducing VR Drawing in Elementary School Art Education. 337-345 - Luca Lovagnini, Wenxiao Zhang, Farshid Hassani Bijarbooneh, Pan Hui:
CIRCE: Real-Time Caching for Instance Recognition on Cloud Environments and Multi-Core Architectures. 346-354 - Wenxiao Zhang, Bo Han, Pan Hui:
Jaguar: Low Latency Mobile Augmented Reality with Flexible Tracking. 355-363
Keynote 2
- Tao Mei:
Session details: Keynote 2. - Xian-Sheng Hua:
Challenges and Practices of Large Scale Visual Intelligence in the Real-World. 364
Deep-1 (Image Translation)
- Nicu Sebe:
Session details: Deep-1 (Image Translation). - Yuheng Zhi, Huawei Wei, Bingbing Ni:
Structure Guided Photorealistic Style Transfer. 365-373 - Xuewen Yang, Dongliang Xie, Xin Wang:
Crossing-Domain Generative Adversarial Networks for Unsupervised Multi-Domain Image-to-Image Translation. 374-382 - Bo Zhao, Xiao Wu, Zhi-Qi Cheng, Hao Liu, Zequn Jie, Jiashi Feng:
Multi-View Image Generation from a Single-View. 383-391 - Jichao Zhang, Yezhi Shu, Songhua Xu, Gongze Cao, Fan Zhong, Meng Liu, Xueying Qin:
Sparsely Grouped Multi-Task Generative Adversarial Networks for Facial Attribute Manipulation. 392-401
Vision-1 (Machine Learning)
- Jingkuan Song:
Session details: Vision-1 (Machine Learning). - Jindong Wang, Wenjie Feng, Yiqiang Chen, Han Yu, Meiyu Huang, Philip S. Yu:
Visual Domain Adaptation with Manifold Embedded Distribution Alignment. 402-410 - Zheyan Shen, Peng Cui, Kun Kuang, Bo Li, Peixuan Chen:
Causally Regularized Learning with Agnostic Data Selection Bias. 411-419 - Yanjie Liang, Qiangqiang Wu, Yi Liu, Yan Yan, Hanzi Wang:
Robust Correlation Filter Tracking with Shepherded Instance-Aware Proposals. 420-428 - Fan Qi, Xiaoshan Yang, Changsheng Xu:
A Unified Framework for Multimodal Domain Adaptation. 429-437
Multimedia-1 (Multimedia Recommendation & Discovery)
- Mark Liao:
Session details: Multimedia-1 (Multimedia Recommendation & Discovery). - Shintami Chusnul Hidayati, Cheng-Chun Hsu, Yu-Ting Chang, Kai-Lung Hua, Jianlong Fu, Wen-Huang Cheng:
What Dress Fits Me Best?: Fashion Recommendation on the Clothing Style for Personal Body Shape. 438-446 - Xiaowen Huang, Shengsheng Qian, Quan Fang, Jitao Sang, Changsheng Xu:
CSAN: Contextual Self-Attention Network for User Sequential Recommendation. 447-455 - Jun Hu, Shengsheng Qian, Quan Fang, Changsheng Xu:
Attentive Interactive Convolutional Matching for Community Question Answering in Social Multimedia. 456-464 - Francesco Gelli, Tiberio Uricchio, Xiangnan He, Alberto Del Bimbo, Tat-Seng Chua:
Beyond the Product: Discovering Image Posts for Brands in Social Media. 465-473
Vision-2 (Object & Scene Understanding)
- Zheng-Jun Zha:
Session details: Vision-2 (Object & Scene Understanding). - Lishi Zhang, Chenghan Fu, Jia Li:
Collaborative Annotation of Semantic Objects in Images with Multi-granularity Supervisions. 474-482 - Mengyang Pu, Yaping Huang, Qingji Guan, Qi Zou:
GraphNet: Learning Image Pseudo Annotations for Weakly-Supervised Semantic Segmentation. 483-491 - Hengcan Shi, Hongliang Li, Qingbo Wu, Fanman Meng, King N. Ngan:
Boosting Scene Parsing Performance via Reliable Scale Prediction. 492-500 - Fan Zhu, Li Liu, Jin Xie, Fumin Shen, Ling Shao, Yi Fang:
Learning to Synthesize 3D Indoor Scenes from Monocular Images. 501-509
Multimodal-1 (Multimodal Reasoning)
- Xian-Sheng Hua:
Session details: Multimodal-1 (Multimodal Reasoning). - Chaojun Han, Fumin Shen, Li Liu, Yang Yang, Heng Tao Shen:
Visual Spatial Attention Network for Relationship Detection. 510-518 - Chenfei Wu, Jinlai Liu, Xiaojie Wang, Xuan Dong:
Object-Difference Attention: A Simple Relational Attention for Visual Question Answering. 519-527 - Jinwei Qi, Yuxin Peng, Yunkan Zhuo:
Life-long Cross-media Correlation Learning. 528-536 - Yue Gu, Xinyu Li, Kaixiang Huang, Shiyu Fu, Kangning Yang, Shuhong Chen, Moliang Zhou, Ivan Marsic:
Human Conversation Analysis Using Attentive Multimodal Networks with Hierarchical Encoder-Decoder. 537-545
System-1 (Video Analysis & Streaming)
- Xin Yang:
Session details: System-1 (Video Analysis & Streaming). - Wentao Liu, Zhengfang Duanmu, Zhou Wang:
End-to-End Blind Quality Assessment of Compressed Videos Using Deep Neural Networks. 546-554 - Ibrahim Ben Mustafa, Tamer Nadeem, Emir Halepovic:
FlexStream: Towards Flexible Adaptive Video Streaming on End Devices using Extreme SDN. 555-563 - Lan Xie, Xinggong Zhang, Zongming Guo:
CLS: A Cross-user Learning based System for Improving QoE in 360-degree Video Adaptive Streaming. 564-572 - Abdelhak Bentaleb, Ali C. Begen, Saad Harous, Roger Zimmermann:
A Distributed Approach for Bitrate Selection in HTTP Adaptive Streaming. 573-581
FF-3
- Zhu Li:
Session details: FF-3. - Qing Zhang, Ganzhao Yuan, Chunxia Xiao, Lei Zhu, Wei-Shi Zheng:
High-Quality Exposure Correction of Underexposed Photos. 582-590 - Qianqian Xu, Jiechao Xiong, Xinwei Sun, Zhiyong Yang, Xiaochun Cao, Qingming Huang, Yuan Yao:
A Margin-based MLE for Crowdsourced Partial Ranking. 591-599 - Ana Garcia del Molino, Michael Gygli:
PHD-GIFs: Personalized Highlight Detection for Automatic GIF Creation. 600-608 - Lu Pang, Yaowei Wang, Yi-Zhe Song, Tiejun Huang, Yonghong Tian:
Cross-Domain Adversarial Feature Learning for Sketch Re-identification. 609-617 - Quan Chen, Tiezheng Ge, Yanyu Xu, Zhiqiang Zhang, Xinxin Yang, Kun Gai:
Semantic Human Matting. 618-626 - Lingxiao Song, Zhihe Lu, Ran He, Zhenan Sun, Tieniu Tan:
Geometry Guided Adversarial Facial Expression Synthesis. 627-635 - Siqi Wang, Yijie Zeng, Qiang Liu, Chengzhang Zhu, En Zhu, Jianping Yin:
Detecting Abnormality without Knowing Normality: A Two-stage Approach for Unsupervised Video Abnormal Event Detection. 636-644 - Tingting Li, Ruihe Qian, Chao Dong, Si Liu, Qiong Yan, Wenwu Zhu, Liang Lin:
BeautyGAN: Instance-level Facial Makeup Transfer with Deep Generative Adversarial Network. 645-653 - Xianghui Luo, Zhuo Su, Jiaming Guo, Gengwei Zhang, Xiangjian He:
Trusted Guidance Pyramid Network for Human Parsing. 654-662 - Jingjing Li, Lei Zhu, Zi Huang, Ke Lu, Jidong Zhao:
I read, I saw, I tell: Texts Assisted Fine-Grained Visual Classification. 663-671 - Ziwei Wang, Yadan Luo, Yang Li, Zi Huang, Hongzhi Yin:
Look Deeper See Richer: Depth-aware Image Paragraph Captioning. 672-680 - Huaiwen Zhang, Quan Fang, Shengsheng Qian, Changsheng Xu:
Learning Multimodal Taxonomy via Variational Deep Graph Embedding and Clustering. 681-689 - Junyu Gao, Tianzhu Zhang, Changsheng Xu:
Watch, Think and Attend: End-to-End Video Classification via Dynamic Knowledge Evolution Modeling. 690-699 - Yongcheng Liu, Lu Sheng, Jing Shao, Junjie Yan, Shiming Xiang, Chunhong Pan:
Multi-Label Image Classification via Knowledge Distillation from Weakly-Supervised Detection. 700-708 - Jiayu Wang, Wengang Zhou, Jinhui Tang, Zhongqian Fu, Qi Tian, Houqiang Li:
Unregularized Auto-Encoder with Generative Adversarial Networks for Image Generation. 709-717 - Yangbangyan Jiang, Zhiyong Yang, Qianqian Xu, Xiaochun Cao, Qingming Huang:
When to Learn What: Deep Cognitive Subspace Clustering. 718-726 - Wendong Zhang, Feng Gao, Bingbing Ni, Lingyu Duan, Yichao Yan, Jingwei Xu, Xiaokang Yang:
Depth Structure Preserving Scene Image Generation. 727-736 - Jiawei Liu, Zheng-Jun Zha, Hongtao Xie, Zhiwei Xiong, Yongdong Zhang:
CA3Net: Contextual-Attentional Attribute-Appearance Network for Person Re-Identification. 737-745 - Gusi Te, Wei Hu, Amin Zheng, Zongming Guo:
RGCNN: Regularized Graph CNN for Point Cloud Segmentation. 746-754 - Bin Liu, Yue Cao, Mingsheng Long, Jianmin Wang, Jingdong Wang:
Deep Triplet Quantization. 755-763
Keynote 3
- Jiebo Luo:
Session details: Keynote 3. - Ernest A. Edmonds:
What has Art Got to do With It? 773
Best Paper Session
- Rainer Lienhart, Tao Mei:
Session details: Best Paper Session. - Hao Tang, Wei Wang, Dan Xu, Yan Yan, Nicu Sebe:
GestureGAN for Hand Gesture-to-Gesture Translation in the Wild. 774-782 - Bei Liu, Jianlong Fu, Makoto P. Kato, Masatoshi Yoshikawa:
Beyond Narrative Description: Generating Poetry from Images by Multi-Adversarial Training. 783-791 - Jian Zhao, Jianshu Li, Yu Cheng, Terence Sim, Shuicheng Yan, Jiashi Feng:
Understanding Humans in Crowded Scenes: Deep Nested Adversarial Learning and A New Benchmark for Multi-Human Parsing. 792-800 - Lizi Liao, Yunshan Ma, Xiangnan He, Richang Hong, Tat-Seng Chua:
Knowledge-aware Multimodal Dialogue Systems. 801-809
Doctoral Symposium
- Meng Wang:
Session details: Doctoral Symposium. - Na Zhao:
End2End Semantic Segmentation for 3D Indoor Scenes. 810-814 - Sabrina Kletz:
On Reducing Effort in Evaluating Laparoscopic Skills. 815-819 - Tianran Hu:
Decode Human Life from Social Media. 820-824
FF-4
- Wen-Huang Cheng:
Session details: FF-4. - Yiling Wu, Shuhui Wang, Qingming Huang:
Learning Semantic Structure-preserved Embeddings for Cross-modal Retrieval. 825-833 - Zhendong Mao, Quan Wang, Yongdong Zhang, Bin Wang:
Post Tuned Hashing: A New Approach to Indexing High-dimensional Data. 834-842 - Meng Liu, Xiang Wang, Liqiang Nie, Qi Tian, Baoquan Chen, Tat-Seng Chua:
Cross-modal Moment Localization in Videos. 843-851 - Zhaoda Ye, Yuxin Peng:
Multi-Scale Correlation for Sequential Cross-modal Hashing Learning. 852-860 - Litao Yu, Yongsheng Gao, Jun Zhou:
Generative Adversarial Product Quantisation. 861-869 - Yubin Deng, Chen Change Loy, Xiaoou Tang:
Aesthetic-Driven Image Enhancement by Adversarial Learning. 870-878 - Kekai Sheng, Weiming Dong, Chongyang Ma, Xing Mei, Feiyue Huang, Bao-Gang Hu:
Attention-based Multi-Patch Aggregation for Image Aesthetic Assessment. 879-886 - Zheqi He, Yafeng Zhou, Yongtao Wang, Siwei Wang, Xiaoqing Lu, Zhi Tang, Ling Cai:
An End-to-End Quadrilateral Regression Network for Comic Panel Extraction. 887-895 - Xin Yang, Jinyu Chen, Zhiwei Wang, Qiaozhe Zhang, Wenyu Liu, Chunyuan Liao, Kwang-Ting Cheng:
Monocular Camera Based Real-Time Dense Mapping Using Generative Adversarial Network. 896-904 - Xiaojing Ma, Changming Liu, Sixing Cao, Bin Zhu:
JPEG Decompression in the Homomorphic Encryption Domain. 905-913 - Mengbai Xiao, Shuoqian Wang, Chao Zhou, Li Liu, Zhenhua Li, Yao Liu, Songqing Chen:
MiniView Layout for Bandwidth-Efficient 360-Degree Video. 914-922 - Guoxian Song, Jianfei Cai, Tat-Jen Cham, Jianmin Zheng, Juyong Zhang, Henry Fuchs:
Real-time 3D Face-Eye Performance Capture of a Person Wearing VR Headset. 923-931 - Chen Li, Mai Xu, Xinzhe Du, Zulin Wang:
Bridge the Gap Between VQA and Human Behavior on Omnidirectional Video: A Large-Scale Dataset and a Deep Learning Model. 932-940 - Zongpu Zhang, Yang Hua, Tao Song, Zhengui Xue, Ruhui Ma, Neil Martin Robertson, Haibing Guan:
Tracking-assisted Weakly Supervised Online Visual Object Segmentation in Unconstrained Videos. 941-949 - Praveen Tirupattur, Yogesh Singh Rawat, Concetto Spampinato, Mubarak Shah:
ThoughtViz: Visualizing Human Thoughts Using Generative Adversarial Network. 950-958 - Xiaoju Zheng, Zheng-Jun Zha, Liansheng Zhuang:
A Feature-Adaptive Semi-Supervised Framework for Co-saliency Detection. 959-966 - Jogendra Nath Kundu, Aditya Ganeshan, Rahul M. V., Aditya Prakash, Venkatesh Babu R.:
iSPA-Net: Iterative Semantic Pose Alignment Network. 967-975 - Litong Feng, Ziyin Li, Zhanghui Kuang, Wei Zhang:
Extractive Video Summarizer with Memory Augmented Neural Networks. 976-983 - Jing Zhang, Yang Cao, Yang Wang, Chenglin Wen, Chang Wen Chen:
Fully Point-wise Convolutional Neural Network for Modeling Statistical Regularities in Natural Images. 984-992 - Jingjia Huang, Nannan Li, Jia-Xing Zhong, Thomas H. Li, Ge Li:
Online Action Tube Detection via Resolving the Spatio-temporal Context Pattern. 993-1001 - Zhiwei Fang, Jing Liu, Yanyuan Qiao, Qu Tang, Yong Li, Hanqing Lu:
Enhancing Visual Question Answering Using Dropout. 1002-1010 - Shota Horiguchi, Naoyuki Kanda, Kenji Nagamatsu:
Face-Voice Matching using Cross-modal Embeddings. 1011-1019 - Jingjing Chen, Chong-Wah Ngo, Fuli Feng, Tat-Seng Chua:
Deep Understanding of Cooking Procedure for Cross-modal Recipe Retrieval. 1020-1028 - Yu Wu, Linchao Zhu, Lu Jiang, Yi Yang:
Decoupled Novel Object Captioner. 1029-1037 - David Semedo, João Magalhães:
Temporal Cross-Media Retrieval with Soft-Smoothing. 1038-1046 - Yu Song, Fan Tang, Weiming Dong, Xiaopeng Zhang, Oliver Deussen, Tong-Yee Lee:
Photo Squarization by Deep Multi-Operator Retargeting. 1047-1055 - Guanbin Li, Xiang He, Wei Zhang, Huiyou Chang, Le Dong, Liang Lin:
Non-locally Enhanced Encoder-Decoder Network for Single Image De-raining. 1056-1064 - Pu Zhao, Sijia Liu, Yanzhi Wang, Xue Lin:
An ADMM-Based Universal Framework for Adversarial Attacks on Deep Neural Networks. 1065-1073 - Jiwei Yang, Xu Shen, Xinmei Tian, Houqiang Li, Jianqiang Huang, Xian-Sheng Hua:
Local Convolutional Neural Networks for Person Re-Identification. 1074-1082 - Zhihe Lu, Tanhao Hu, Lingxiao Song, Zhaoxiang Zhang, Ran He:
Conditional Expression Synthesis with Face Parsing Transformation. 1083-1091 - Liang Li, Shuhui Wang, Shuqiang Jiang, Qingming Huang:
Attentive Recurrent Neural Network for Weak-supervised Multi-label Image Classification. 1092-1100 - Jatin Garg, Skand Vishwanath Peri, Himanshu Tolani, Narayanan C. Krishnan:
Deep Cross Modal Learning for Caricature Verification and Identification (CaVINet). 1101-1109 - Nakamasa Inoue, Koichi Shinoda:
Few-Shot Adaptation for Multimedia Semantic Indexing. 1110-1118 - Zhengzhong Zhou, Xiu Di, Wei Zhou, Liqing Zhang:
Fashion Sensitive Clothing Recommendation Using Hierarchical Collocation Model. 1119-1127 - Yihang Lou, Yan Bai, Shiqi Wang, Ling-Yu Duan:
Multi-Scale Context Attention Network for Image Retrieval. 1128-1136 - Yibing Zhan, Jun Yu, Zhou Yu, Rong Zhang, Dacheng Tao, Qi Tian:
Comprehensive Distance-Preserving Autoencoders for Cross-Modal Retrieval. 1137-1145 - Xusong Chen, Dong Liu, Zheng-Jun Zha, Wengang Zhou, Zhiwei Xiong, Yan Li:
Temporal Hierarchical Attention at Category- and Item-Level for Micro-Video Click-Through Prediction. 1146-1153 - Jufeng Yang, Liyi Chen, Le Zhang, Xiaoxiao Sun, Dongyu She, Shao-Ping Lu, Ming-Ming Cheng:
Historical Context-based Style Classification of Painting Images via Label Distribution Learning. 1154-1162 - Hao Wu, Zhengxing Sun, Weihang Yuan:
Direction-aware Neural Style Transfer. 1163-1171 - Bin He, Feng Gao, Daiqian Ma, Boxin Shi, Ling-Yu Duan:
ChipGAN: A Generative Adversarial Network for Chinese Ink Wash Painting Style Transfer. 1172-1180 - Teemu Kämäräinen, Matti Siekkinen, Jukka Eerikäinen, Antti Ylä-Jääski:
CloudVR: Cloud Accelerated Interactive Mobile Virtual Reality. 1181-1189 - Anh Nguyen, Zhisheng Yan, Klara Nahrstedt:
Your Attention is Unique: Detecting 360-Degree Video Saliency in Head-Mounted Display for Head Movement Prediction. 1190-1198 - Yiting Shao, Qi Zhang, Ge Li, Zhu Li, Li Li:
Hybrid Point Cloud Attribute Compression Using Slice-based Layered Structure and Block-based Intra Prediction. 1199-1207 - Tianchi Huang, Rui-Xiao Zhang, Chao Zhou, Lifeng Sun:
QARC: Video Quality Aware Rate Control for Real-Time Video Streaming based on Deep Reinforcement Learning. 1208-1216 - Haitian Pang, Cong Zhang, Fangxin Wang, Han Hu, Zhi Wang, Jiangchuan Liu, Lifeng Sun:
Optimizing Personalized Interaction Experience in Crowd-Interactive Livecast: A Cloud-Edge Approach. 1217-1225
Demo + Video + Makers' Program
- Kwanghoon Sohn, Yong Man Ro:
Session details: Demo + Video + Makers' Program. - Songyou Peng, Le Zhang, Stefan Winkler, Marianne Winslett:
Give Me One Portrait Image, I Will Tell You Your Emotion and Personality. 1226-1227 - Yang Liu, Yang Yang, Weidong Fang, Wuxiong Zhang:
Demo: Phase-based Acoustic Localization and Motion Tracking for Mobile Interaction. 1228-1230 - Cunjun Zhang, Kehua Lei, Jia Jia, Yihui Ma, Zhiyuan Hu:
AI Painting: An Aesthetic Painting Generation System. 1231-1233 - Aleksandr Farseev, Kirill Lepikhin, Hendrik Schwartz, Eu Khoon Ang, Kenny Powar:
SoMin.ai: Social Multimedia Influencer Discovery Marketplace. 1234-1236 - Taoran Tang, Hanyang Mao, Jia Jia:
AniDance: Real-Time Dance Motion Synthesize to the Song. 1237-1239 - Gjorgji Strezoski, Inske Groenen, Jurriaan Besenbruch, Marcel Worring:
ArtSight: An Artistic Data Exploration Engine. 1240-1241 - Yoonjung Park, Yoonsik Yang, Hyocheol Ro, Junghyun Byun, Seougho Chae, Tack-Don Han:
Meet AR-bot: Meeting Anywhere, Anytime with Movable Spatial AR Robot. 1242-1243 - Ryosuke Tanno, Daichi Horita, Wataru Shimoda, Keiji Yanai:
Magical Rice Bowl: A Real-time Food Category Changer. 1244-1246 - Haolin Ren, Benjamin Renoust, Guy Melançon, Marie-Luce Viaud, Shin'ichi Satoh:
Exploring Temporal Communities in Mass Media Archives. 1247-1249 - Matthias Zeppelzauer, Alexis Ringot, Florian Taurer:
SoniControl - A Mobile Ultrasonic Firewall. 1250-1252 - Mohammed Habibullah Baig, Jibin Rajan Varghese, Zhangyang Wang:
MusicMapp: A Deep Learning Based Solution for Music Exploration and Visual Interaction. 1253-1255 - Paula Gómez Duran, Eva Mohedano, Kevin McGuinness, Xavier Giró-i-Nieto, Noel E. O'Connor:
Demonstration of an Open Source Framework for Qualitative Evaluation of CBIR Systems. 1256-1257 - Yun-Gyung Cheong, Woo-Hyun Park, Hye-Yeon Yu:
A Demonstration of an Intelligent Storytelling System. 1258-1259 - Yaohua Bu, Jia Jia, Xiang Li, Suping Zhou, Xiaobo Lu:
IcooBook: When the Picture Book for Children Encounters Aesthetics of Interaction. 1260-1262 - Thomas Forgione, Axel Carlier, Géraldine Morin, Wei Tsang Ooi, Vincent Charvillat, Praveen Kumar Yadav:
An Implementation of a DASH Client for Browsing Networked Virtual Environment. 1263-1264 - Lizi Liao, You Zhou, Yunshan Ma, Richang Hong, Tat-Seng Chua:
Knowledge-aware Multimodal Fashion Chatbot. 1265-1266 - Alex Lee, Chang-Uk Kwak, Jeong-Woo Son, Sun-Joong Kim:
SVIAS: Scene-segmented Video Information Annotation System. 1267-1269 - Chang-Uk Kwak, Minho Han, Sun-Joong Kim, Gyeong-June Hahm:
Interactive Story Maker: Tagged Video Retrieval System for Video Re-creation Service. 1270-1271 - Xingyu Liu, Jingfan Guo, Tongwei Ren, Yahong Han, Lei Huang, Gangshan Wu:
HeterStyle: A Heterogeneous Video Style Transfer Application. 1272-1273 - Hyocheol Ro, Inhwan Kim, Junghyun Byun, Yoonsik Yang, Yoonjung Park, Seungho Chae, Tack-Don Han:
PAMI: Projection Augmented Meeting Interface for Video Conferencing. 1274-1277 - Yoonjung Park, Yoonsik Yang, Hyocheol Ro, Jinwon Cha, Kyuri Kim, Tack-Don Han:
ChildAR-bot: Educational Playing Projection-based AR Robot for Children. 1278-1282
Deep-2 (Recognition)
- Qin Jin:
Session details: Deep-2 (Recognition). - Yansong Tang, Zian Wang, Peiyang Li, Jiwen Lu, Ming Yang, Jie Zhou:
Mining Semantics-Preserving Attention for Group Activity Recognition. 1283-1291 - Rui Yan, Jinhui Tang, Xiangbo Shu, Zechao Li, Qi Tian:
Participation-Contributed Temporal Dynamic Model for Group Activity Recognition. 1292-1300 - Peiqin Zhuang, Yali Wang, Yu Qiao:
WildFish: A Large Benchmark for Fish Recognition in the Wild. 1301-1309 - Haoxuan You, Yifan Feng, Rongrong Ji, Yue Gao:
PVNet: A Joint Convolutional Network of Point Cloud and Multi-View for 3D Shape Recognition. 1310-1318
Multimedia-2 (Socical & Emotional Multimedia)
- Rongrong Ji:
Session details: Multimedia-2 (Socical & Emotional Multimedia). - Sicheng Zhao, Xin Zhao, Guiguang Ding, Kurt Keutzer:
EmotionGAN: Unsupervised Domain Adaptation for Learning Discrete Probability Distributions of Image Emotions. 1319-1327 - Pei Lv, Meng Wang, Yongbo Xu, Ze Peng, Junyi Sun, Shi-Mei Su, Bing Zhou, Mingliang Xu:
USAR: An Interactive User-specific Aesthetic Ranking Framework for Images. 1328-1336 - Ekraam Sabir, Wael AbdAlmageed, Yue Wu, Prem Natarajan:
Deep Multimodal Image-Repurposing Detection. 1337-1345 - Bowen Pan, Shangfei Wang:
Facial Expression Recognition Enhanced by Thermal Images through Adversarial Learning. 1346-1353
Panel-1
- Jun Jitao, Yu Sang:
Session details: Panel-1. - Jitao Sang, Jun Yu, Ramesh C. Jain, Rainer Lienhart, Peng Cui, Jiashi Feng:
Deep Learning for Multimedia: Science or Technology? 1354-1355
Open Source Software Competition
- Min-Chun Hu:
Session details: Open Source Software Competition. - Kuan-Ting Lai, Chia-Chih Lin, Chun-Yao Kang, Mei-Enn Liao, Ming-Syan Chen:
VIVID: Virtual Environment for Visual Deep Learning. 1356-1359 - Tsung-Wei Huang, Chun-Xun Lin, Guannan Guo, Martin D. F. Wong:
A General-purpose Distributed Programming System using Data-parallel Streams. 1360-1363 - Konstantinos Zampogiannis, Cornelia Fermüller, Yiannis Aloimonos:
cilantro: A Lean, Versatile, and Efficient Library for Point Cloud Data Processing. 1364-1367 - Matthieu Pizenberg, Axel Carlier, Emmanuel Faure, Vincent Charvillat:
Web-Based Configurable Image Annotations. 1368-1371
Vision-3 (Applications in Multimedia)
- Zheng-Jun Zha:
Session details: Vision-3 (Applications in Multimedia). - Xiangteng He, Yuxin Peng:
Only Learn One Sample: Fine-Grained Visual Categorization with One Sample Training. 1372-1380 - Kecheng Zheng, Zheng-Jun Zha, Yang Cao, Xuejin Chen, Feng Wu:
LA-Net: Layout-Aware Dense Network for Monocular Depth Estimation. 1381-1388 - Ziqing Huang, Shiguang Liu:
Robustness and Discrimination Oriented Hashing Combining Texture and Invariant Vector Distance. 1389-1397 - Shuhui Wang, Yangyu Chen, Junbao Zhuo, Qingming Huang, Qi Tian:
Joint Global and Co-Attentive Representation Learning for Image-Sentence Retrieval. 1398-1406
Multimodal-2 (Cross-Modal Translation)
- Xian-Sheng Hua:
Session details: Multimodal-2 (Cross-Modal Translation). - Mingkuan Yuan, Yuxin Peng:
Text-to-image Synthesis via Symmetrical Distillation Networks. 1407-1415 - Daqing Liu, Zheng-Jun Zha, Hanwang Zhang, Yongdong Zhang, Feng Wu:
Context-Aware Visual Policy Network for Sequence-Level Image Captioning. 1416-1424 - Sheng Liu, Zhou Ren, Junsong Yuan:
SibNet: Sibling Convolutional Encoder for Video Captioning. 1425-1434 - Wenbin Che, Xiaopeng Fan, Ruiqin Xiong, Debin Zhao:
Paragraph Generation Network with Visual Relationship Detection. 1435-1443
Panel-2
- Jiaying Liu, Wen-Huang Cheng:
Session details: Panel-2. - Wen-Huang Cheng, Jiaying Liu, Mohan S. Kankanhalli, Abdulmotaleb El-Saddik, Benoit Huet:
AI + Multimedia Make Better Life? 1455-1456
FF-5
- Zhu Li:
Session details: FF-5. - Na Jiang, Sichen Bai, Yue Xu, Chang Xing, Zhong Zhou, Wei Wu:
Online Inter-Camera Trajectory Association Exploiting Person Re-Identification and Camera Topology. 1457-1465 - Jing Zhu, Yi Fang:
Learning Local Descriptors with Adversarial Enhancer from Volumetric Geometry Patches. 1466-1474 - Zhen Cui, Chunyan Xu, Wenming Zheng, Jian Yang:
Context-Dependent Diffusion Network for Visual Relationship Detection. 1475-1482 - Shuo Wang, Dan Guo, Wengang Zhou, Zheng-Jun Zha, Meng Wang:
Connectionist Temporal Fusion for Sign Language Translation. 1483-1491 - Kai Li, Zhengming Ding, Kunpeng Li, Yulun Zhang, Yun Fu:
Support Neighbor Loss for Person Re-Identification. 1492-1500 - Bing Li, Chia-Wen Lin, Shan Liu, Tiejun Huang, Wen Gao, C.-C. Jay Kuo:
Perceptual Temporal Incoherence Aware Stereo Video Retargeting. 1501-1509 - Yanli Ji, Feixiang Xu, Yang Yang, Fumin Shen, Heng Tao Shen, Wei-Shi Zheng:
A Large-scale RGB-D Database for Arbitrary-view Human Action Recognition. 1510-1518 - Huiyun Wang, Youjiang Xu, Yahong Han:
Spotting and Aggregating Salient Regions for Video Captioning. 1519-1526 - Qixian Zhou, Xiaodan Liang, Ke Gong, Liang Lin:
Adaptive Temporal Encoding Network for Video Instance-level Human Parsing. 1527-1535 - Yuanzheng Ci, Xinzhu Ma, Zhihui Wang, Haojie Li, Zhongxuan Luo:
User-Guided Deep Anime Line Art Colorization with Conditional Adversarial Networks. 1536-1544 - Tianli Zhao, Xiangyu He, Jian Cheng, Jing Hu:
BitStream: Efficient Computing Architecture for Real-Time Low-Power Inference of Binary Neural Networks on CPUs. 1545-1552 - Lingbo Liu, Ruimao Zhang, Jiefeng Peng, Guanbin Li, Bowen Du, Liang Lin:
Attentive Crowd Flow Machines. 1553-1561 - Deqiang Ouyang, Jie Shao, Yonghui Zhang, Yang Yang, Heng Tao Shen:
Video-based Person Re-identification via Self-Paced Learning and Deep Reinforcement Learning Framework. 1562-1570 - Lizi Liao, Xiangnan He, Bo Zhao, Chong-Wah Ngo, Tat-Seng Chua:
Interpretable Multimodal Retrieval for Fashion Products. 1571-1579 - Chieh-Yu Chen, Wenze Lai, Hsin-Ying Hsieh, Wen-Hao Zheng, Yu-Shuen Wang, Jung-Hong Chuang:
Generating Defensive Plays in Basketball Games. 1580-1588 - Hong Liu, Mingbao Lin, Shengchuan Zhang, Yongjian Wu, Feiyue Huang, Rongrong Ji:
Dense Auto-Encoder Hashing for Robust Cross-Modality Retrieval. 1589-1597 - Taoran Tang, Jia Jia, Hanyang Mao:
Dance with Melody: An LSTM-autoencoder Approach to Music-oriented Dance Synthesis. 1598-1606 - Gong Chen, Yan Liu, Sheng-hua Zhong, Xiang Zhang:
Musicality-Novelty Generative Adversarial Nets for Algorithmic Composition. 1607-1615 - Divyashri Bhat, Rajvardhan Somraj Deshmukh, Michael Zink:
Improving QoE of ABR Streaming Sessions through QUIC Retransmissions. 1616-1624 - Ziqian Chen, Shiqi Wang, Dapeng Oliver Wu, Tiejun Huang, Ling-Yu Duan:
From Data to Knowledge: Deep Learning Model Compression, Transmission and Communication. 1625-1633
Keynote 4
- Kyoung Mu Lee:
Session details: Keynote 4. - Gary Geunbae Lee:
Living with AI in Connected Devices for valuable Experience. 1634
Multimedia -3 (Multimedia Search)
- Jitao Sang:
Session details: Multimedia -3 (Multimedia Search). - Mingbao Lin, Rongrong Ji, Hong Liu, Yongjian Wu:
Supervised Online Hashing via Hadamard Codebook Learning. 1635-1643 - Yuanqiang Fang, Wengang Zhou, Yijuan Lu, Jinhui Tang, Qi Tian, Houqiang Li:
Cascaded Feature Augmentation with Diffusion for Image Retrieval. 1644-1652 - Zhangjie Cao, Ziping Sun, Mingsheng Long, Jianmin Wang, Philip S. Yu:
Deep Priority Hashing. 1653-1661 - Xingbo Liu, Xiushan Nie, Wenjun Zeng, Chaoran Cui, Lei Zhu, Yilong Yin:
Fast Discrete Cross-modal Hashing With Regressing From Semantic Labels. 1662-1669
Experience-1 (Multimedia Entertainment and Experience)
- Zhisheng Yan:
Session details: Experience-1 (Multimedia Entertainment and Experience). - Shuai Zheng, Fan Yang, M. Hadi Kiapour, Robinson Piramuthu:
ModaNet: A Large-scale Street Fashion Dataset with Polygon Annotations. 1670-1678 - Dania Murad, Riwu Wang, Douglas Turnbull, Ye Wang:
SLIONS: A Karaoke Application to Enhance Foreign Language Learning. 1679-1687 - Shuai Yang, Jiaying Liu, Wenhan Yang, Zongming Guo:
Context-Aware Unsupervised Text Stylization. 1688-1696 - Jun Kato, Masa Ogata, Takahiro Inoue, Masataka Goto:
Songle Sync: A Large-Scale Web-based Platform for Controlling Various Devices in Synchronization with Music. 1697-1705
System-2 (Smart Multimedia Systems)
- Yijuan Lu:
Session details: System-2 (Smart Multimedia Systems). - Weidong Geng, Feilin Han, Jiangke Lin, Liuyi Zhu, Jieming Bai, Suzhen Wang, Lin He, Qiang Xiao, Zhangjiong Lai:
Fine-Grained Grocery Product Recognition by One-Shot Learning. 1706-1714 - Yusuke Matsui, Ryota Hinami, Shin'ichi Satoh:
Reconfigurable Inverted Index. 1715-1723 - Hiroshi Sankoh, Sei Naito, Keisuke Nonaka, Houari Sabirin, Jun Chen:
Robust Billboard-based, Free-viewpoint Video Synthesis Algorithm to Overcome Occlusions under Challenging Outdoor Sport Scenes. 1724-1732 - Wei Cheng, Lan Xu, Lei Han, Yuanfang Guo, Lu Fang:
iHuman3D: Intelligent Human Body 3D Reconstruction using a Single Flying Camera. 1733-1741
FF-6
- Benoit Huet:
Session details: FF-6. - Lianli Gao, Pengpeng Zeng, Jingkuan Song, Xianglong Liu, Heng Tao Shen:
Examine before You Answer: Multi-task Learning with Adaptive-attentions for Multiple-choice VQA. 1742-1750 - Zhiwen Fan, Huafeng Wu, Xueyang Fu, Yue Huang, Xinghao Ding:
Residual-Guide Network for Single Image Deraining. 1751-1759 - Zhengyu Zhao, Martha A. Larson:
From Volcano to Toyshop: Adaptive Discriminative Region Discovery for Scene Recognition. 1760-1768 - Joshua Sowerby, Yang Zhang, Dimitris Agrafiotis:
The Effect of Foveation on High Dynamic Range Video Perception. 1769-1776 - Wenxue Cui, Feng Jiang, Xinwei Gao, Shengping Zhang, Debin Zhao:
An Efficient Deep Quantized Compressed Sensing Coding Framework of Natural Images. 1777-1785 - Diep Thi Ngoc Nguyen, Hideki Nakayama, Naoaki Okazaki, Tatsuya Sakaeda:
PoB: Toward Reasoning Patterns of Beauty in Image Data. 1786-1793 - Nan Xu, Yanqing Guo, Xin Zheng, Qianyu Wang, Xiangyang Luo:
Partial Multi-view Subspace Clustering. 1794-1801 - Teng Long, Xing Xu, Youyou Li, Fumin Shen, Jingkuan Song, Heng Tao Shen:
Pseudo Transfer with Marginalized Corrupted Attribute for Zero-shot Learning. 1802-1810 - Guangxing Han, Xuan Zhang, Chongrong Li:
Semi-Supervised DFF: Decoupling Detection and Feature Flow for Video Object Detectors. 1811-1819 - Lingjing Wang, Cheng Qian, Jifei Wang, Yi Fang:
Unsupervised Learning of 3D Model Reconstruction from Hand-Drawn Sketches. 1820-1828 - Sibo Song, Ngai-Man Cheung, Vijay Chandrasekhar, Bappaditya Mandal:
Deep Adaptive Temporal Pooling for Activity Recognition. 1829-1837 - Mingyong Zeng, Chang Tian, Zemin Wu:
Person Re-identification with Hierarchical Deep Learning Feature and efficient XQDA Metric. 1838-1846 - Jingkuan Song, Zhilong Zhou, Lianli Gao, Xing Xu, Heng Tao Shen:
Cumulative Nets for Edge Detection. 1847-1855 - Niluthpol Chowdhury Mithun, Rameswar Panda, Evangelos E. Papalexakis, Amit K. Roy-Chowdhury:
Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval. 1856-1864 - Yangyang Guo, Zhiyong Cheng, Liqiang Nie, Xin-Shun Xu, Mohan S. Kankanhalli:
Multi-modal Preference Modeling for Product Search. 1865-1873 - Feiran Huang, Xiaoming Zhang, Zhoujun Li:
Learning Joint Multimodal Representation with Adversarial Attention Networks. 1874-1882 - Binbing Liao, Jingqing Zhang, Ming Cai, Siliang Tang, Yifan Gao, Chao Wu, Shengwen Yang, Wenwu Zhu, Yike Guo, Fei Wu:
Dest-ResNet: A Deep Spatiotemporal Residual Network for Hotspot Traffic Speed Prediction. 1883-1891 - Yifang Yin, Rajiv Ratn Shah, Roger Zimmermann:
Learning and Fusing Multimodal Deep Features for Acoustic Scene Categorization. 1892-1900 - Zhenyu Tang, Nicolás Morales, Dinesh Manocha:
Dynamic Sound Field Synthesis for Speech and Music Optimization. 1901-1909 - Thomas Forgione, Axel Carlier, Géraldine Morin, Wei Tsang Ooi, Vincent Charvillat, Praveen Kumar Yadav:
DASH for 3D Networked Virtual Environment. 1910-1918
Keynote 5
- Wenwu Zhu:
Session details: Keynote 5. - Bowen Zhou:
Transforming Retailing Experiences with Artificial Intelligence. 1919-1920
Deep-3 (Image Processing-Inpainting, Super-Resolution, Deblurring)
- Shuqiang Jiang:
Session details: Deep-3 (Image Processing-Inpainting, Super-Resolution, Deblurring). - Risheng Liu, Yi He, Shichao Cheng, Xin Fan, Zhongxuan Luo:
Learning Collaborative Generation Correction Modules for Blind Image Deblurring and Beyond. 1921-1929 - Minghao Yin, Yongbing Zhang, Xiu Li, Shiqi Wang:
When Deep Fool Meets Deep Prior: Adversarial Attack on Super-Resolution Network. 1930-1938 - Haoran Zhang, Zhenzhen Hu, Changzhi Luo, Wangmeng Zuo, Meng Wang:
Semantic Image Inpainting with Progressive Generative Networks. 1939-1947 - Huy V. Vo, Ngoc Q. K. Duong, Patrick Pérez:
Structural inpainting. 1948-1956
Brand New Ideas
- Kiyoharu Aizawa:
Session details: Brand New Ideas. - Mykhaylo Andriluka, Jasper R. R. Uijlings, Vittorio Ferrari:
Fluid Annotation: A Human-Machine Collaboration Interface for Full Image Annotation. 1957-1966 - Lixin Liu, Xiaojun Wan, Zongming Guo:
Images2Poem: Generating Chinese Poetry from Image Streams. 1967-1975 - Yaman Kumar, Mayank Aggarwal, Pratham Nawal, Shin'ichi Satoh, Rajiv Ratn Shah, Roger Zimmermann:
Harnessing AI for Speech Reconstruction using Multi-view Silent Video Feed. 1976-1983 - Kanchan Bahirat, Umang Shah, Alvaro A. Cárdenas, Balakrishnan Prabhakaran:
ALERT: Adding a Secure Layer in Decision Support for Advanced Driver Assistance System (ADAS). 1984-1992 - Nitish Nag, Vaibhav Pandey, Preston J. Putzel, Hari Bhimaraju, Srikanth Krishnan, Ramesh C. Jain:
Cross-Modal Health State Estimation. 1993-2002
Grand Challenge-1
- Shuqiang Jiang:
Session details: Grand Challenge-1. - Liuwu Li, Sihong Huang, Ziliang He, Wenyin Liu:
An Effective Text-based Characterization Combined with Numerical Features for Social Media Headline Prediction. 2003-2007 - Chih-Chung Hsu, Chia-Yen Lee, Ting-Xuan Liao, Jun-Yi Lee, Tsai-Yne Hou, Ying-Chu Kuo, Jing-Wen Lin, Ching-Yi Hsueh, Zhong-Xuan Zhang, Hsiang-Chin Chien:
An Iterative Refinement Approach for Social Media Headline Prediction. 2008-2012 - Feitao Huang, Junhong Chen, Zehang Lin, Peipei Kang, Zhenguo Yang:
Random Forest Exploiting Post-related and User-related Features for Social Media Popularity Prediction. 2013-2017 - Xusong Chen, Rui Zhao, Shengjie Ma, Dong Liu, Zheng-Jun Zha:
Content-Based Video Relevance Prediction with Second-Order Relevance and Attention Modeling. 2018-2022
Vision-4 (Representation Learning)
- Marcel Worring:
Session details: Vision-4 (Representation Learning). - Tianshui Chen, Wenxi Wu, Yuefang Gao, Le Dong, Xiaonan Luo, Liang Lin:
Fine-Grained Representation Learning and Recognition by Exploiting Hierarchical Semantic Embedding. 2023-2031 - Gang Yang, Jinlu Liu, Jieping Xu, Xirong Li:
Dissimilarity Representation Learning for Generalized Zero-Shot Recognition. 2032-2039 - Kai Han, Jianyuan Guo, Chao Zhang, Mingjian Zhu:
Attribute-Aware Attention Model for Fine-grained Representation Learning. 2040-2048 - Siyu Huang, Xi Li, Zhiqi Cheng, Zhongfei Zhang, Alexander G. Hauptmann:
GNAS: A Greedy Neural Architecture Search Method for Multi-Attribute Learning. 2049-2057
Grand Challenge-2
- Shuqiang Jiang:
Session details: Grand Challenge-2. - Jianfeng Dong, Xirong Li, Chaoxi Xu, Gang Yang, Xun Wang:
Feature Re-Learning with Data Augmentation for Content-based Video Recommendation. 2058-2062 - Qi Wang, Jingxiang Lai, Kai Xu, Wenyin Liu, Liang Lei:
Beauty Product Image Retrieval Based on Multi-Feature Fusion and Feature Aggregation. 2063-2067 - Jian Han Lim, Nurul Japar, Chun Chet Ng, Chee Seng Chan:
Unprecedented Usage of Pre-trained CNNs on Beauty Product. 2068-2072 - Zehang Lin, Zhenguo Yang, Feitao Huang, Junhong Chen:
Regional Maximum Activations of Convolutions with Attention for Cross-domain Beauty and Personal Care Product Retrieval. 2073-2077
Interactive Art
- Hyunjung Shim:
Session details: Interactive Art. - Lyn Chao-ling Chen, He-Lin Luo:
Shadow Calligraphy of Dance: An Image-Based Interactive Installation for Capturing Flowing Human Figures. 2078-2080 - Anis Haron, Soon Xuan Yong, Chee-Onn Wong:
Cellular Music: An Interactive Game of Life Sequencer. 2081-2083 - Soon Xuan Yong, Chee-Onn Wong, Kong Cheng Tan, Anis Haron:
TAGapp Visualization: An Application Based Visual Art Installation. 2084-2086
Tutorials
- Jan Sedmidubský, Pavel Zezula:
Similarity-Based Processing of Motion Capture Data. 2087-2089 - Yunchao Wei, Xiaodan Liang, Si Liu, Liang Lin:
Structured Deep Learning for Pixel-level Understanding. 2090-2092 - Jungseock Joo, Zachary C. Steinert-Threlkeld, Jiebo Luo:
Social and Political Event Analysis based on Rich Media. 2093-2095 - Joseph P. Robinson, Ming Shao, Yun Fu:
To Recognize Families In the Wild: A Machine Vision Tutorial. 2096-2097 - Jitao Sang:
Deep Learning Interpretation. 2098-2100 - Klaus Schoeffmann, Werner Bailer, Cathal Gurrin, George Awad, Jakub Lokoc:
Interactive Video Search: Where is the User in the Age of Deep Learning? 2101-2103 - Ting Yao, Jingen Liu:
Human Behavior Understanding: From Action Recognition to Complex Event Detection. 2104-2105 - Michael Riegler, Pål Halvorsen, Bernd Münzer, Klaus Schoeffmann:
The Importance of Medical Multimedia. 2106-2108
Workshop Summaries
- Teresa Chambel, Francesca De Simone, Rene Kaiser, Nimesha Ranasinghe, Wendy Van den Broeck:
AltMM 2018 - 3rd International Workshop on Multimedia Alternate Realities. 2109-2110 - Fabien Ringeval, Björn W. Schuller, Michel F. Valstar, Roddy Cowie, Maja Pantic:
Summary for AVEC 2018: Bipolar Disorder and Cross-Cultural Affect Recognition. 2111-2112 - Kwanghoon Sohn, Ming-Hsuan Yang, Hyeran Byun, Jongwoo Lim, Jison Hsu, Stephen Lin, Euntai Kim, Seungryong Kim:
CoVieW'18: The 1st Workshop and Challenge on Comprehensive Video Understanding in the Wild. 2113-2115 - Jochen Meyer, Susanne Boll, Noel E. O'Connor, Ramesh C. Jain, Troy McDaniel:
HealthMedia 2018: Third International Workshop on Multimedia for Personal Health and Health Care. 2116-2117 - Xueliang Liu, Rui Min, Benoit Huet, Jia Jia:
MAHCI 2018: The 1st Workshop on Multimedia for Accessible Human Computer Interface. 2118-2119 - Dong-Yan Huang, Sicheng Zhao, Björn W. Schuller, Hongxun Yao, Jianhua Tao, Min Xu, Lei Xie, Qingming Huang, Jie Yang:
ASMMC-MMAC 2018: The Joint Workshop of 4th the Workshop on Affective Social Multimedia Computing and first Multi-Modal Affective Computing of Large-Scale Multimedia Data Workshop. 2120-2121 - Adrian Hilton, Hong-Goo Kang, Hansung Kim, Kwanghoon Sohn:
AVSU: Workshop on Audio-Visual Scene Understanding for Immersive Multimedia. 2122-2124 - Rainer Lienhart, Thomas B. Moeslund, Hideo Saito:
1st ACM International Workshop on Multimedia Content Analysis in Sports. 2125-2126 - Xavier Alameda-Pineda, Miriam Redi, Nicu Sebe, Shih-Fu Chang, Jiebo Luo:
EE-USAD: ACM MM 2018Workshop on UnderstandingSubjective Attributes of Data focus on Evoked Emotions. 2127-2128
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.