default search action
ICME 2019: Shanghai, China
- IEEE International Conference on Multimedia and Expo, ICME 2019, Shanghai, China, July 8-12, 2019. IEEE 2019, ISBN 978-1-5386-9552-4
Oral Sessions
Best Paper Session
- Yu Hao, Yanwei Fu, Yu-Gang Jiang, Qi Tian:
An End-to-End Architecture for Class-Incremental Object Detection with Knowledge Distillation. 1-6 - Zunjie Zhu, Feng Xu, Chenggang Yan, Xinhong Hao, Xiangyang Ji, Yongdong Zhang, Qionghai Dai:
Real-time Indoor Scene Reconstruction with RGBD and Inertial Input. 7-12 - Changde Du, Changying Du, Huiguang He:
Doubly Semi-Supervised Multimodal Adversarial Learning for Classification, Generation and Retrieval. 13-18 - Yihang Lou, Ling-Yu Duan, Yong Luo, Ziqian Chen, Tongliang Liu, Shiqi Wang, Wen Gao:
Towards Digital Retina in Smart Cities: A Model Generation, Utilization and Communication Paradigm. 19-24
O-01: Content Recommendation and Cross-modal Hashing
- Zhenhua Tan, Danke Wu, Liangliang He, Qiuyun Chang, Bin Zhang:
SDP: An Improved Baseline Estimation Model Based On Standard Deviation Proportion. 25-30 - Jie Chen, Yang Liu, Shu Zhao, Yanping Zhang:
Citation Recommendation Based on Weighted Heterogeneous Information Network Containing Semantic Linking. 31-36 - Li Wang, Lei Zhu, En Yu, Jiande Sun, Huaxiang Zhang:
Fusion-Supervised Deep Cross-Modal Hashing. 37-42 - Wei Chen, Nan Pu, Yu Liu, Erwin M. Bakker, Michael S. Lew:
Domain Uncertainty Based On Information Theory for Cross-Modal Hash Retrieval. 43-48
O-02: Development of Multimedia Standards and Related Research
- Eurico Lopes, João Ascenso, Catarina Brites, Fernando Pereira:
Adaptive Plane Projection for Video-Based Point Cloud Coding. 49-54 - Ting Fu, Hao Zhang, Fan Mu, Huanbang Chen:
Fast CU Partitioning Algorithm for H.266/VVC Intra-Frame Coding. 55-60 - Ting Fu, Hao Zhang, Fan Mu, Huanbang Chen:
Two-Stage Fast Multiple Transform Selection Algorithm for VVC Intra Coding. 61-66 - Junru Li, Meng Wang, Li Zhang, Kai Zhang, Hongbin Liu, Shiqi Wang, Siwei Ma, Wen Gao:
History-Based Motion Vector Prediction for Future Video Coding. 67-72
O-03: Classification and Low Shot Learning
- Jingcai Guo, Song Guo:
AMS-SFE: Towards an Alignment of Manifold Structures via Semantic Feature Expansion for Zero-shot Learning. 73-78 - Xuefeng Du, Dexing Zhong, Pengna Li:
Low-Shot Palmprint Recognition Based on Meta-Siamese Network. 79-84 - Zihan Ye, Fan Lyu, Linyan Li, Qiming Fu, Jinchang Ren, Fuyuan Hu:
SR-GAN: Semantic Rectifying Generative Adversarial Network for Zero-shot Learning. 85-90 - Huaxi Huang, Junjie Zhang, Jian Zhang, Qiang Wu, Jingsong Xu:
Compare More Nuanced: Pairwise Alignment Bilinear Network for Few-Shot Fine-Grained Learning. 91-96
O-04: 3D Media Computing
- Gerasimos Arvanitis, Aris S. Lalos, Konstantinos Moustakas:
Feature-Aware and Content-wise Denoising of 3D Static and Dynamic Meshes using Deep Autoencoders. 97-102 - Xinyu Wei, Jun Huang, Xiaoyuan Ma:
Real-Time Monocular Visual SLAM by Combining Points and Lines. 103-108 - Chuanpu Li, Xin Jin, Junke Li, Qionghai Dai:
F-Number Adaptation for Maximizing the Sensor Usage of Light Field Cameras. 109-114 - Xufu Sun, Xin Jin, Pei Wang, Yanqin Chen, Qionghai Dai:
Blind Calibration for Focused Plenoptic Cameras. 115-120
O-05: Special Session "Pedestrian Detection, Tracking and Reidentification in Videos"
- Peizhen Zhang, Feng Zheng, Junlong Du, Jun Zhang, Xiaowei Guo, Wei-Shi Zheng:
Particle Swarm Loss for Lightweight Object Detection. 121-126 - Qiang Fu, Linsen Dong, Ziyuan Liu, Yong Luo, Yonggang Wen, Ying Li, Ling-Yu Duan:
Incorporating Category Taxonomy in Deep Reinforcement Learning Based Image Hashing. 127-132 - Ji Hu, Chenggang Yan, Xin Liu, Jiyong Zhang, Dongliang Peng, Yi Yang:
Truncated Gradient Confidence-Weighted Based Online Learning for Imbalance Streaming Data. 133-138 - Mohamed A. Kassab, Ali Maher, Fathy Elkazzaz, Baochang Zhang:
UAV Target Tracking By Detection via Deep Neural Networks. 139-144
O-06: Special Session "Multimedia Technologies Empowering Retail Experiences"
- Shan An, Zhibiao Huang, Guangfu Che, Xianglong Liu, Xin Ma, Yu Chen:
Quarter-Point Codeword Expansion for Product Quantization. 145-150 - Minghui Zhang, Yumeng Liang, Huadong Ma:
Context-Aware Affective Graph Reasoning for Emotion Recognition. 151-156 - Weibo Zhang, Fuqing Zhu, Jiao Dai, Songlin Hu, Jizhong Han, Tao Guo:
SPL: Exploiting Unlabeled Data for Multi-label Image Classification. 157-162 - Yu Zhou, Shancheng Fang, Hongtao Xie, Zheng-Jun Zha, Yongdong Zhang:
MLTS: A Multi-Language Scene Text Spotter. 163-168
O-07: 3D and Low Level Vision
- Xinchen Ye, Mingliang Zhang, Rui Xu, Wei Zhong, Xin Fan, Zhu Liu, Jiaao Zhang:
Unsupervised Monocular Depth Estimation Based on Dual Attention Mechanism and Depth-Aware Loss. 169-174 - Gang Fu, Qing Zhang, Chunxia Xiao:
Towards High-Quality Intrinsic Images in the Wild. 175-180 - Shuosen Guan, Haoxin Li, Wei-Shi Zheng:
Unsupervised Learning for Optical Flow Estimation Using Pyramid Convolution LSTM. 181-186 - Yuan Gao, Robert Bregovic, Atanas P. Gotchev, Reinhard Koch:
MAST: Mask-Accelerated Shearlet Transform for Densely-Sampled Light Field Reconstruction. 187-192
O-08: Object Detection I
- Li Wang, Yongbo Li, Xiangyang Xue:
CODA: Counting Objects via Scale-Aware Adversarial Density Adaption. 193-198 - Chunbiao Zhu, Xing Cai, Kan Huang, Thomas H. Li, Ge Li:
PDNet: Prior-Model Guided Depth-Enhanced Network for Salient Object Detection. 199-204 - Qi Yuan, Bingwang Zhang, Haojie Li, Zhihui Wang, Zhongxuan Luo, Wei Zhong:
Continuous Scale Adaption for Efficient Box-Based Scene Text Detection. 205 - Xiaobao Guo, Jinxing Li, Bingzhi Chen, Guangming Lu:
Mask-Most Net: Mask Approximation Based Multi-oriented Scene Text Detection Network. 206-211
O-09: Emerging Applications of Deep Learning
- Junhao Huang, Lin Zhang, Ying Shen, Huijuan Zhang, Shengjie Zhao, Yukai Yang:
DMPR-PS: A Novel Approach for Parking-Slot Detection Using Directional Marking-Point Regression. 212-217 - Yong-Xiang Lin, Daniel Stanley Tan, Wen-Huang Cheng, Kai-Lung Hua:
Adapting Semantic Segmentation of Urban Scenes via Mask-Aware Gated Discriminator. 218-223 - Maomao Li, Chun Yuan, Zhihui Lin, Zhuobin Zheng, Yangyang Cheng:
Stochastic Video Generation with Disentangled Representations. 224-229 - Jianjin Zhang, Yunbo Wang, Mingsheng Long, Jianmin Wang, Philip S. Yu:
Z-Order Recurrent Neural Networks for Video Prediction. 230-235
O-10: Multimedia Quality Assessment and Enhancement
- Yingru Liu, Dongliang Xie, Xin Wang:
Energy-Based Recurrent Model for Stochastic Modeling of Music. 236-241 - Huaixuan Zhang, Yuhai Lan, Tao Dai, Ruizhi Qiao, Ying Xu, Yao Yao, Shu-Tao Xia:
Residual Frame for Noisy Video Classification According to Perceptual Quality in Convolutional Neural Networks. 242-247 - Guanqun Hou, Yujiu Yang, Jing-Hao Xue:
Residual Dilated Network with Attention for Image Blind Denoising. 248-253 - Zhuopeng Li, Xiaoyan Zhang:
Collaborative Deep Reinforcement Learning for Image Cropping. 254-259
O-11: Multimedia for Society and Health
- Penghui Sun, Hao Liu, Xin Wang, Zhenhua Yu, Suping Wu:
Similarity-Aware Deep Adversarial Learning for Facial Age Estimation. 260-265 - Yinghong Liao, Bin Qiu, Zhuo Su, Ruomei Wang, Xiangjian He:
Learning Transmission Filtering Network for Image-Based Pm2.5 Estimation. 266-271 - Yuan Tian, Xiongkuo Min, Guangtao Zhai, Zhiyong Gao:
Video-Based Early ASD Detection via Temporal Pyramid Networks. 272-277 - Ying Zhang, Yinjia Zhang, Qinpei Zhao, Weixiong Rao:
Automatic User Categorization Through Large Transaction Data. 278-283
O-12: Immersive Media
- Junkun Qi, Wei Hu, Zongming Guo:
Feature Preserving and Uniformity-Controllable Point Cloud Simplification on Graph. 284-289 - Jun Fu, Xiaoming Chen, Zhizheng Zhang, Shilin Wu, Zhibo Chen:
360SRL: A Sequential Reinforcement Learning Approach for ABR Tile-Based 360 Video Streaming. 290-295 - Falah Jabar, João Ascenso, Maria Paula Queluz:
Content-Aware Perspective Projection Optimization for Viewport Rendering of 360° Images. 296-301 - Ziming Wu, Jiabin Guo, Shuangli Zhang, Chen Zhao, Xiaojuan Ma:
An AR Benchmark System for Indoor Planar Object Tracking. 302-307
O-13: 3D and Stereo Computing
- Zhenchao Wu, Kun Li, Yu-Kun Lai, Jingyu Yang:
Global as-Conformal-as-Possible Non-Rigid Registration of Multi-view Scans. 308-313 - Zhengning Wang, Longfei Feng, Fanwei Zeng, Guang Hu, Xiang Zhang, Xia Lv, Fengjun Zhang:
A Light-Weighted Network for Facial Landmark Detection via Combined Heatmap and Coordinate Regression. 314-319 - Xianzhe Xu, Yonghong Hou, Pichao Wang, Zhongyu Jiang, Wanqing Li:
Light Weight Stereo Matching via Deep Extraction and Integration of Low and High Level Information. 320-325 - Hongxin Lin, Zelin Xiao, Yang Tan, Hongyang Chao, Shengyong Ding:
Justlookup: One Millisecond Deep Feature Extraction for Point Clouds By Lookup Tables. 326-331
O-14: Machine Learning Applications in Image and Video Coding I
- Bo Jiang, Xingyue Jiang, Jin Tang, Bin Luo, Shilei Huang:
Multiple Graph Convolutional Networks for Co-Saliency Detection. 332-337 - Lahiru D. Chamain, Sen-ching Samson Cheung, Zhi Ding:
Quannet: Joint Image Compression and Classification Over Channels with Limited Bandwidth. 338-343 - Jiawen Gu, Bichuan Guo, Jiangtao Wen:
High Efficiency Light Field Compression via Virtual Reference and Hierarchical MV-HEVC. 344-349 - Youfa Liu, Bo Du, Lefei Zhang:
Self-Paced Subspace Clustering. 350-355
O-15: Vison, Language and Text Processing
- Xuri Ge, Fuhai Chen, Chen Shen, Rongrong Ji:
Colloquial Image Captioning. 356-361 - Yike Wu, Shiwan Zhao, Jia Chen, Ying Zhang, Xiaojie Yuan, Zhong Su:
Improving Captioning for Low-Resource Languages by Cycle Consistency. 362-367 - Zhuo Lei, Chao Zhang, Qian Zhang, Guoping Qiu:
FrameRank: A Text Processing Approach to Video Summarization. 368-373 - Anna Zhu, Qiyang Zhang, Xiongbo Lu, Shengwu Xiong:
Character Image Synthesis Based on Selected Content and Referenced Style Embedding. 374-379
O-16: Media Classification and Segmentation II
- Yujia Liu, Weiming Zhang, Nenghai Yu:
Query-Free Embedding Attack Against Deep Learning. 380-386 - Zongmin Li, Jun Zhang, Guanlin Li, Yujie Liu, Siyuan Li:
Graph Attention Neural Networks for Point Cloud Recognition. 387-392 - Lu Li, Yang Li, Xiangxiang Xu, Shao-Lun Huang, Lin Zhang:
Maximal Correlation Embedding Network for Multilabel Learning with Missing Labels. 393-398 - Zengyuan Guo, Xinzhu Ma, Haojie Li, Zhihui Wang, Pengbo Zhang:
Self-Adaption Multi-classifier Fusion Networks for Image Recognition. 399-405
O-17: AI for Human Understanding
- Baohan Xu, Yingbin Zheng, Hao Ye, Caili Wu, Heng Wang, Gufei Sun:
Video Emotion Recognition with Concept Selection. 406-411 - Han Zhang, Yonghong Song, Yuanlin Zhang:
Graph Convolutional LSTM Model for Skeleton-Based Action Recognition. 412-417 - Zhongwei Qiu, Kai Qiu, Jianlong Fu, Dongmei Fu:
Learning Recurrent Structure-Guided Attention Network for Multi-person Pose Estimation. 418-423 - Zhenying Fang, Suguo Zhu, Jun Yu, Qi Tian:
PCPCAD: Proposal Complementary Action Detector. 424-429
O-18: Image Quality Metrics
- Leida Li, Hancheng Zhu, Sicheng Zhao, Guiguang Ding, Hongyan Jiang, Allen Tan:
Personality Driven Multi-task Learning for Image Aesthetic Assessment. 430-435 - Chen Bai, Amy R. Reibman:
Video Quality Temporal Pooling using a Visibility Measure. 436-441 - Yuming Fang, Yan Zeng, Hanwei Zhu, Guangtao Zhai:
Image Quality Assessment of Multi-exposure Image Fusion for Both Static and Dynamic Scenes. 442-447 - Sumei Li, Jianwei Xue, Yongtian Han:
No-Reference Stereoscopic Image Quality Assessment Based on Local to Global Feature Regression. 448-453
O-19: Multimedia Recommendations
- Wenmian Yang, Wenyuan Gao, Xiaojie Zhou, Weijia Jia, Shaohua Zhang, Yutao Luo:
Herding Effect Based Attention for Personalized Time-Sync Video Recommendation. 454-459 - Shang Liu, Zhenzhong Chen:
Sequential Behavior Modeling for Next Micro-Video Recommendation with Collaborative Transformer. 460-465 - Dawei Liu, Ying Cao, Rynson W. H. Lau, Antoni B. Chan:
ButtonTips: Design Web Buttons with Suggestions. 466-471 - Shengjie Ma, Zheng-Jun Zha, Feng Wu:
Knowing User Better: Jointly Predicting Click-Through and Playtime for Micro-Video. 472-477
O-20: Search and Retrieval
- Xin Wen, Zhizhong Han, Xinyu Yin, Yu-Shen Liu:
Adversarial Cross-Modal Retrieval via Learning and Transferring Single-Modal Similarities. 478-483 - Zekun Li, Zeyu Cui, Shu Wu, Xiaoyu Zhang, Liang Wang:
Semi-Supervised Compatibility Learning Across Categories for Clothing Matching. 484-489 - Kevin Lin, Fan Yang, Qiaosong Wang, Robinson Piramuthu:
Adversarial Learning for Fine-Grained Image Search. 490-495 - Lei Qi, Jing Huo, Lei Wang, Yinghuan Shi, Yang Gao:
A Mask Based Deep Ranking Neural Network for Person Retrieval. 496-501
O-21: Media Understanding
- Kunal Swami, Kaushik Raghavan, Nikhilanj Pelluri, Rituparna Sarkar, Pankaj Bajpai:
DISCO: Depth Inference from Stereo using Context. 502-507 - Yunian Chen, Yanjie Wang, Yang Zhang, Yanwen Guo:
PANet: A Context Based Predicate Association Network for Scene Graph Generation. 508-513 - Aming Wu, Yahong Han, Quanxin Zhang, Xiaohui Kuang:
Untargeted Adversarial Attack via Expanding the Semantic Gap. 514-519 - Yen-Wei Chang, Wen-Hsiao Peng:
Learning Goal-Oriented Visual Dialog Agents: Imitating and Surpassing Analytic Experts. 520-525
O-22: Super-resolution and Enhancement
- Kui Jiang, Zhongyuan Wang, Peng Yi, Junjun Jiang, Guangcheng Wang, Zhen Han, Tao Lu:
GAN-Based Multi-level Mapping Network for Satellite Imagery Super-Resolution. 526-531 - Ren Yang, Xiaoyan Sun, Mai Xu, Wenjun Zeng:
Quality-Gated Convolutional Lstm for Enhancing Compressed Video. 532-537 - Risheng Liu, Minjun Hou, Jinyuan Liu, Xin Fan, Zhongxuan Luo:
Compounded Layer-Prior Unrolling: A Unified Transmission-Based Image Enhancement Framework. 538-543 - Qiang Fu, Wenhan Yang, Ying Li, Jiaying Liu:
Deep Pyramid Variation Learning for Image Interpolation. 544-549
O-23: Pose and Action Recognition II
- Zhangxuan Gu, Jianfu Zhang, Ziqi Pan, Haohua Zhao, Liqing Zhang:
Clothes Keypoints Localization and Attribute Recognition via Prior Knowledge. 550-555 - Yong Su, Zhiyong Feng:
Spatio-Temporal Multi-Factor Discriminant Analysis for Individual Identification. 556-561 - Jianjun Lei, Yalong Jia, Bo Peng, Qingming Huang:
Channel-wise Temporal Attention Network for Video Action Recognition. 562-567 - Qichao Xu, John See, Weiyao Lin:
Localization Guided Fight Action Detection in Surveillance Videos. 568-573
O-24: Image and Video Enhancements I
- Yue Lu, Zhuqing Jiang, Guodong Ju, Liangheng Shen, Aidong Men:
Recursive Multi-Stage Upscaling Network with Discriminative Fusion for Super-Resolution. 574-579 - Yuanfei Huang, Jie Li, Xinbo Gao, Wen Lu, Yanting Hu:
Improving Image Super-Resolution via Feature Re-Balancing Fusion. 580-585 - Jinghui Qin, Ziwei Xie, Yukai Shi, Wushao Wen:
Difficulty-Aware Image Super Resolution via Deep Adaptive Dual-Network. 586-591 - Xiaoting Du, Yuan Zhou, Yanfang Chen, Yeda Zhang, Jianxing Yang, Dou Jin:
Dense-Connected Residual Network for Video Super-Resolution. 592-597
O-25: Face and Person Analysis
- Zhihao Zhang, Liansheng Zhuang, Wengang Zhou, Houqiang Li:
Dynamic Cascaded Regression Network with Reinforcement Learning for Robust Face Alignment. 598-603 - Mengyan Li, Yuechuan Sun, Zhaoyu Zhang, Haonian Xie, Jun Yu:
Deep Learning Face Hallucination via Attributes Transfer and Enhancement. 604-609 - Junjie Zhu, Xibin Zhao, Han Hu, Yue Gao:
Emotion Recognition from Physiological Signals using Multi-Hypergraph Neural Networks. 610-615 - Yue Liao, Si Liu, Tianrui Hui, Chen Gao, Yao Sun, Hefei Ling, Bo Li:
GPS: Group People Segmentation with Detailed Part Inference. 616-621
O-26: Media Classification and Segmentation III
- Zhao-Min Chen, Xiu-Shen Wei, Xin Jin, Yanwen Guo:
Multi-Label Image Recognition with Joint Class-Aware Map Disentangling and Label Correlation Embedding. 622-627 - Zhengtao Tan, Bin Liu, Weihai Li, Nenghai Yu:
Real Time Compressed Video Object Segmentation. 628-633 - Zhihui Wang, Shijie Wang, Pengbo Zhang, Haojie Li, Bo Liu:
Accurate And Fast Fine-Grained Image Classification via Discriminative Learning. 634-639 - Zhong Li, Xin Chen, Wangyiteng Zhou, Yingliang Zhang, Jingyi Yu:
Pose2Body: Pose-Guided Human Parts Segmentation. 640-645
O-27: Image and Video Enhancements II
- Zhan Shu, Mengcheng Cheng, Biao Yang, Zhuo Su, Xiangjian He:
Residual Magnifier: A Dense Information Flow Network for Super Resolution. 646-651 - Xinyu Li, Wei Zhang, Tong Shen, Tao Mei:
Everyone is a Cartoonist: Selfie Cartoonization with Attentive Adversarial Networks. 652-657 - Jichun Li, Ke Li, Bo Yan:
Scale-Aware Deep Network with Hole Convolution for Blind Motion Deblurring. 658-663 - Tie Liu, Mai Xu, Zulin Wang:
Removing Rain in Videos: A Large-Scale Database and a Two-Stream ConvLSTM Approach. 664-669
O-28: Multimedia Learning and Adaptation
- Zhengyuan Pang, Lifeng Sun, Tianchi Huang, Zhi Wang, Shiqiang Yang:
Towards QoS-Aware Cloud Live Transcoding: A Deep Reinforcement Learning Approach. 670-675 - Ding Ma, Xiangqian Wu:
High Speed Recurrent Regression Network for Visual Tracking. 676-681 - Yanmin Shang, Zhezhou Kang, Yanan Cao, Dongjie Zhang, Yang Li, Yangxi Li, Yanbing Liu:
PAAE: A Unified Framework for Predicting Anchor Links with Adversarial Embedding. 682-687 - Ying Li, Lin Cheng, Yaxin Peng, Zhijie Wen, Shihui Ying:
Manifold Alignment and Distribution Adaptation for Unsupervised Domain Adaptation. 688-693
O-29: Person (Re-)Identification and People Detection
- Hui Li, Meng Yang, Zhihui Lai, Weishi Zheng, Zitong Yu:
Pedestrian re-Identification Based on Tree Branch Network with Local and Global Learning. 694-699 - Zheng Liu, Jie Qin, Annan Li, Yunhong Wang, Luc Van Gool:
Adversarial Binary Coding for Efficient Person Re-Identification. 700-705 - Yingzhi Tang, Xi Yang, Nannan Wang, Xinrui Jiang, Bin Song, Xinbo Gao:
Person re-Identification with Gradual Background Suppression. 706-711 - Yingxin Zhu, Xiaoqiang Guo, Jianlei Liu, Zhuqing Jiang:
Multi-Branch Context-Aware Network for Person Re-Identification. 712-717
O-30: Multimedia and Language II
- Fenxiao Chen, Angela Wang, C.-C. Jay Kuo:
Post-Processing of Word Representations via Variance Normalization and Dynamic Embedding. 718-723 - Dong Zhang, Liangqing Wu, Shoushan Li, Qiaoming Zhu, Guodong Zhou:
Multi-Modal Language Analysis with Hierarchical Interaction-Level and Selection-Level Attentions. 724-729 - Dong Zhang, Shoushan Li, Qiaoming Zhu, Guodong Zhou:
Modeling the Clause-Level Structure to Multimodal Sentiment Analysis via Reinforcement Learning. 730-735 - Jianming Wang, Wei Deng, Yukuan Sun, Yuanyuan Li, Kai Wang, Guanghao Jin:
Twice Opportunity Knocks Syntactic Ambiguity: A Visual Question Answering Model with yes/no Feedback. 736-741
O-31: Multimedia Communications and Localization
- Bin Sun, Chen Chen, Yingying Zhu, Jianmin Jiang:
GEOCAPSNET: Ground to Aerial View Image Geo-Localization using Capsule Network. 742-747 - Bo Wang, Fengyuan Ren:
Improving Robustness of DASH Against Network Uncertainty. 748-753 - Bo Wang, Fengyuan Ren, Chao Zhou:
Hybrid Control-Based ABR: Towards Low-Delay Live Streaming. 754-759 - Zhilin Qiu, Lingbo Liu, Guanbin Li, Qing Wang, Nong Xiao, Liang Lin:
Taxi Origin-Destination Demand Prediction with Contextualized Spatial-Temporal Network. 760-765
O-32: Multimedia Security, Privacy and Forensics II
- Sahib Khan, Tiziano Bianchi:
Fast Image Clustering Based on Camera Fingerprint Ordering. 766-771 - Xin Xu, Quanwei Cai, Jingqiang Lin, Shiran Pan, Liangqin Ren:
Enforcing Access Control in Distributed Version Control Systems. 772-777 - Peixuan He, Kaiping Xue, Jie Xu, Qiudong Xia, Jianqing Liu, Hao Yue:
Attribute-Based Accountable Access Control for Multimedia Content with In-Network Caching. 778-783 - Liyue Fan:
Practical Image Obfuscation with Provable Privacy. 784-789
O-33: Multimedia Sensing and Signal Processing
- Zhenwen Liang, Dongyang Zhang, Jie Shao:
Jointly Solving Deblurring and Super-Resolution Problems with Dual Supervised Network. 790-795 - Michael Gref, Christoph Schmidt, Sven Behnke, Joachim Köhler:
Two-Staged Acoustic Modeling Adaption for Robust Speech Recognition by the Example of German Oral History Interviews. 796-801 - Yang Zhang, Huiming Zhang, Yanwen Guo, Kai Lin, Jingwu He:
An Adaptive Affinity Graph with Subspace Pursuit for Natural Image Segmentation. 802-807 - Li He, Yi Zhou, Hongqing Liu:
Phase Time-Frequency Masking Based Speech Enhancement Algorithm Using Circular Microphone Array. 808-813
O-34: Detection and Recognition
- Yanyan Fang, Biyun Zhan, Wandi Cai, Shenghua Gao, Bo Hu:
Locality-Constrained Spatial Transformer Network for Video Crowd Counting. 814-819 - Yixin Li, Shengqin Tang, Yun Ye, Jinwen Ma:
Spatial-Aware Non-Local Attention for Fashion Landmark Detection. 820-825 - Wu Zheng, Lin Li, Zhaoxiang Zhang, Yan Huang, Liang Wang:
Relational Network for Skeleton-Based Action Recognition. 826-831 - Weipeng Lin, Yidong Li, Xiaoliang Yang, Peixi Peng, Junliang Xing:
Multi-View Learning for Vehicle Re-Identification. 832-837
O-35: Multi-modal Media Computing and Human-machine Interaction
- Yi Zhang, Cheng Zeng, Hao Cheng, Chongjun Wang, Lei Zhang:
Many Could be Better Than All: A Novel Instance-Oriented Algorithm for Multi-modal Multi-label Problem. 838-843 - Benchao Li, Zhenzhong Chen, Shan Li, Wei-Shi Zheng:
Affective Video Content Analyses by Using Cross-Modal Embedding Learning Features. 844-849 - Xiaolong Zhou, Jianing Lin, Jiaqi Jiang, Shengyong Chen:
Learning A 3D Gaze Estimator with Improved Itracker Combined with Bidirectional LSTM. 850-855 - Jingda Guo, Xianwei Cheng, Qi Chen, Qing Yang:
Detection of Occluded Road Signs on Autonomous Driving Vehicles. 856-861
Poster Sessions
Poster Session 1 & TMM Poster
- Yingyi Zhang, Lin Zhang, Xiao Liu, Shengjie Zhao, Ying Shen, Yukai Yang:
Pay By Showing Your Palm: A Study of Palmprint Verification on Mobile Platforms. 862-867 - Yuze Guo, Wenjing Huang, Yajing Chen, Shikui Tu:
Regularize Network Skip Connections by Gating Mechanisms for Electron Microscopy Image Segmentation. 868-873 - Xiaohui Lin, Yi Xu, Mingda Wang, Bingbing Ni, Xiaokang Yang, Guangyu Tao, Xiaodan Ye:
Cross Modality Alignment of Medical Volumes using Spatio-Semantic Attentive Cycle-GAN. 874-879 - Bin Yuan, Zongqing Lu, Jing-Hao Xue, Qingmin Liao:
A New Approach to Automatic Clothing Matting from Mannequins. 880-885 - Jinlin Wu, Shengcai Liao, Zhen Lei, Xiaobo Wang, Yang Yang, Stan Z. Li:
Clustering and Dynamic Sampling Based Unsupervised Domain Adaptation for Person Re-Identification. 886-891 - Fanchao Lin, Chuanbin Liu, Hongtao Xie, Zheng-Jun Zha, Yongdong Zhang:
Semantic-Embedding and Shape-Aware U-Net for Ultrasound Eyeball Segmentation. 892-897 - Chengpei Xu, Ruomei Wang, Shujin Lin, Xiaonan Luo, Baoquan Zhao, Lijie Shao, Mengqiu Hu:
Lecture2Note: Automatic Generation of Lecture Notes from Slide-Based Educational Videos. 898-903 - Jianqiang Liu, Jian Yao, Jingmin Tu, Junhao Cheng:
Data-Adaptive Packing Method for Compression of Dynamic Point Cloud Sequences. 904-909 - Pengfei Li, Meng Yang:
Semantic GAN: Application for Cross-Domain Image Style Transfer. 910-915 - Paras Maharjan, Li Li, Zhu Li, Ning Xu, Chongyang Ma, Yue Li:
Improving Extreme Low-Light Image Denoising via Residual Learning. 916-921 - Yang Gao, Jun Tao, Li Zeng, Xiaoming Fang, Qian Fang, Xiaoyan Li:
User Profiling with Campus Wi-Fi Access Trace and Network Traffic. 922-927 - Baoquan Zhao, Songhua Xu, Shujin Lin, Ruomei Wang, Xiaonan Luo:
A New Visual Interface for Searching and Navigating Slide-Based Lecture Videos. 928-933 - Pin Fang, Yisen Wang, Yuan Luo:
Self-Attentive Networks for one-shot Image Recognition. 934-939 - Tianyi Wu, Sheng Tang, Rui Zhang, Juan Cao, Jintao Li:
Tree-Structured Kronecker Convolutional Network for Semantic Segmentation. 940-945 - Yixin Zhu, Jun-Yong Zhu, Wei-Shi Zheng:
Part-Based Convolutional Network for Imbalanced Age Estimation. 946-951 - Qiuzheng Chen, Ruoyu Yang:
Learning to Distinguish: A General Method to Improve Compare-Based one-shot Learning Frameworks for Similar Classes. 952-957 - Xiaokai Chen, Ke Gao, Juan Cao:
Predictability Analyzing: Deep Reinforcement Learning for Early Action Recognition. 958-963 - Junming Chen, Jie Shao, Dongyang Zhang, Xuehui Wu:
A Fast End-to-End Method with Style Transfer for Room Layout Estimation. 964-969 - Lijyun Huang, Kate Ching-Ju Lin, Yu-Chee Tseng:
Resolving Intra-Class Imbalance for GAN-Based Image Augmentation. 970-975 - Weitong Zhang, Qieshi Zhang, Jun Cheng, Cong Bai, Pengyi Hao:
End-to-End Panoptic Segmentation with Pixel-Level Non-Overlapping Embedding. 976-981 - Kaixiang Wang:
Robust Embedding Framework with Dynamic Hypergraph Fusion for Multi-label Classification. 982-987
Poster Session 2
- Peirui Cheng, Weiqiang Wang, Yuanqiang Cai:
Multi-scale Scene Text Detection via Resolution Transform. 988-993 - Haiyan Wang, Xuejian Rong, Yingli Tian:
Towards Accurate Instance-Level Text Spotting with Guided Attention. 994-999 - Youming Deng, Xianming Lin, Run Li, Rongrong Ji:
Multi-scale Gem Pooling with N-Pair Center Loss for Fine-Grained Image Search. 1000-1005 - Xingzhi Wang, Xin Liu, Zhikai Hu, Nannan Wang, Wentao Fan, Ji-Xiang Du:
Semi-Supervised Semantic-Preserving Hashing for Efficient Cross-Modal Retrieval. 1006-1011 - Haitao Wang, Hui Chen, Min Meng, Jigang Wu:
Robust Multi-View Hashing for Cross-Modal Retrieval. 1012-1017 - Siwei Wang, Yongtao Wang, Xiaoran Qin, Qijie Zhao, Zhi Tang:
Scene Text Recognition via Gated Cascade Attention. 1018-1023 - Yuyang Wang, Feng Su, Ye Qian:
Text-Attentional Conditional Generative Adversarial Network for Super-Resolution of Text Images. 1024-1029 - Fan Ma, Haoyun Yang, Haibing Yin, Xiaofeng Huang, Chenggang Yan, Xiang Meng:
Online Learning to Rank in a Listwise Approach for Information Retrieval. 1030-1035 - Yang Mi, Song Wang:
Recognizing Micro Actions in Videos: Learning Motion Details via Segment-Level Temporal Pyramid. 1036-1041 - Miao Xin, Shuhang Wang, Jian Cheng:
Entanglement Loss for Context-Based Still Image Action Recognition. 1042-1047 - Lu Zhou, Yingying Chen, Jinqiao Wang, Ming Tang, Hanqing Lu:
Bi-Directional Message Passing Based Scanet for Human Pose Estimation. 1048-1053 - Jingjun Chen, Yonghong Song, Yuanlin Zhang:
Spatial Mask ConvLSTM Network and Intra-Class Joint Training Method for Human Action Recognition in Video. 1054-1059 - Renyi Xiao, Yonghong Hou, Zihui Guo, Chuankun Li, Pichao Wang, Wanqing Li:
Self-Attention Guided Deep Features for Action Recognition. 1060-1065 - Yanshan Li, Rongjie Xia, Xing Liu, Qinghua Huang:
Learning Shape-Motion Representations from Geometric Algebra Spatio-Temporal Model for Skeleton-Based Action Recognition. 1066-1071 - Yang Bai, Weiqiang Wang:
ACPNet: Anchor-Center Based Person Network for Human Pose Estimation and Instance Segmentation. 1072-1077 - Jianyu Yang, Chen Zhu, Junsong Yuan:
Spatio-Temporal Multi-scale Soft Quantization Learning for Skeleton-Based Human Action Recognition. 1078-1083 - Wei Sun, Yezhao Fan, Xiongkuo Min, Shihao Peng, Siwei Ma, Guangtao Zhai:
LPHD: A Large-Scale Head Pose Dataset for RGB Images. 1084-1089 - Zhengyuan Yang, Yixuan Zhang, Jiebo Luo:
Human-Centered Emotion Recognition in Animated GIFs. 1090-1095 - Qize Yang, Ancong Wu, Wei-Shi Zheng:
Deep Semi-Supervised Person Re-Identification with External Memory. 1096-1101 - Tanzila Rahman, Mrigank Rochan, Yang Wang:
Convolutional Temporal Attention Model for Video-Based Person Re-Identification. 1102-1107 - Zhiyuan Li, Shizhong Han, Ahmed-Shehab Khan, Jie Cai, Zibo Meng, James O'Reilly, Yan Tong:
Pooling Map Adaptation in Convolutional Neural Network for Facial Expression Recognition. 1108-1113 - Jianheng Li, Fuhang Liang, Yuanxun Li, Wei-Shi Zheng:
Fast Person Search Pipeline. 1114-1119 - Gaoqi He, Zhenwei Ma, Binhao Huang, Bin Sheng, Yubo Yuan:
Dynamic Region Division for Adaptive Learning Pedestrian Counting. 1120-1125 - Jing Zhang, Han Sun, Zhe Wang, Tong Ruan:
Another Dimension: Towards Multi-subnet Neural Network for Image Sentiment Analysis. 1126-1131 - Pilin Dai, Jinna Lv, Bin Wu:
Two-Stage Model for Social Relationship Understanding from Videos. 1132-1137 - Junhao Hu, Lei Jin, Shenghuo Gao:
FPN++: A Simple Baseline for Pedestrian Detection. 1138-1143 - Fei Ma, Wei Zhang, Yang Li, Shao-Lun Huang, Lin Zhang:
An End-to-End Learning Approach for Multimodal Emotion Recognition: Extracting Common and Private Information. 1144-1149
Poster Session 3 & Demo Session 1
- Haoyu Ma, Juncheng Zhang, Shaojun Liu, Qingmin Liao:
Boundary Aware Multi-focus Image Fusion Using Deep Neural Network. 1150-1155 - Chenxi Ma, Weimin Tan, Bahetiyaer Bare, Bo Yan:
A Multi-level Aggregated Network for Image Restoration. 1156-1161 - Xuehui Wu, Jie Shao, Dongyang Zhang, Junming Chen:
Unsupervised Facial Image Synthesis Using Two-Discriminator Adversarial Autoencoder Network. 1162-1167 - Jie Liu, Cheolkon Jung:
Facial Image Inpainting Using Multi-level Generative Network. 1168-1173 - Jianyu Wang, Shaohui Liu, Feng Jiang, Xiaoshuai Sun, Yongliang Liu:
A Video Post-Filter Deblocking Method Based on Temporal Boosting Residual Networks. 1174-1179 - Xiaopeng Sun, Wen Lu, Rui Wang, Furui Bai:
Distilling with Residual Network for Single Image Super Resolution. 1180-1185 - Junyi Wang, Weimin Tan, Xuejing Niu, Bo Yan:
RDGAN: Retinex Decomposition Based Adversarial Learning for Low-Light Enhancement. 1186-1191 - Shichao Li, Yonghong Hou, Huanjing Yue, Zihui Guo:
Single Image De-Raining via Generative Adversarial Nets. 1192-1197 - Yuanlue Zhu, Mengchao Bai, Linlin Shen, Zhiwei Wen:
SwitchGAN for Multi-domain Facial Image Translation. 1198-1203 - Michele Brizzi, Federica Battisti, Alessandro Neri:
A Feature-Based Approach for Light Field Video Enhancement. 1204-1209 - Jindong Wang, Yiqiang Chen, Han Yu, Meiyu Huang, Qiang Yang:
Easy Transfer Learning By Exploiting Intra-Domain Structures. 1210-1215 - Guyue Hu, Bo Cui, Shan Yu:
Skeleton-Based Action Recognition with Synchronous Local and Non-Local Spatio-Temporal Learning and Frequency Attention. 1216-1221 - Meilu Zhu, Daming Shi:
Deep Geometry Embedding Networks for Robust Facial Landmark Detection. 1222-1227 - Guangzhen Liu, Jiechao Guan, Manli Zhang, Jianhong Zhang, Zihao Wang, Zhiwu Lu:
Joint Projection and Subspace Learning for Zero-Shot Recognition. 1228-1233 - Haoye Dong, Xiaodan Liang, Chenxing Zhou, Hanjiang Lai, Jia Zhu, Jian Yin:
Part-Preserving Pose Manipulation for Person Image Synthesis. 1234-1239 - He Chen, Faming Fang:
Bregman-Tanimoto Based Method for Contrast Preserving Decolorization. 1240-1245 - Xiaoqiang Li, Yaqin Zhu, Jiayue Han, Jide Li, Weiqin Tong:
TDCC: Top-Down Semantic Aggregation for Color Constancy. 1246-1251 - Lin Zhang, Jianbo Zhao, Si Li, Boxin Shi, Ling-Yu Duan:
From Market to Dish: Multi-ingredient Image Recognition for Personalized Recipe Recommendation. 1252-1257 - Hongjie Zhang, Ang Li, Xu Han, Zhaoming Chen, Yang Zhang, Yanwen Guo:
Improving Open Set Domain Adaptation Using Image-to-Image Translation. 1258-1263 - Chaoqun Wang, Xuejin Chen, Shaobo Min, Feng Wu:
Structure Generation and Guidance Network for Unsupervised Monocular Depth Estimation. 1264-1269 - Xinyao Chen, Bichuan Guo, Minhao Tang, Yuxing Han, Jiangtao Wen:
A Conditional Bayesian Block Structure Inference Model for Optimized AV1 Encoding. 1270-1275 - Ce Wang, Renjie Wan, Feng Gao, Boxin Shi, Ling-Yu Duan:
Learning to Remove Reflections for Text Images. 1276-1281
Poster Session 4 & Demo Session 2
- Hao Zhou, Wengang Zhou, Houqiang Li:
Dynamic Pseudo Label Decoding for Continuous Sign Language Recognition. 1282-1287 - Yupan Huang, Qi Dai, Yutong Lu:
Decoupling Localization and Classification in Single Shot Temporal Action Detection. 1288-1293 - Zhiming Ma, Chun Yuan, Yangyang Cheng, Xinrui Zhu:
Image-to-Tree: A Tree-Structured Decoder for Image Captioning. 1294-1299 - Liang Sun, Bing Li, Chunfeng Yuan, Zhengjun Zha, Weiming Hu:
Multimodal Semantic Attention Network for Video Captioning. 1300-1305 - Jie Wu, Tianshui Chen, Hefeng Wu, Zhi Yang, Qing Wang, Liang Lin:
Concrete Image Captioning by Integrating Content Sensitive and Global Discriminative Objective. 1306-1311 - Huidong Li, Dandan Song, Lejian Liao, Cuimei Peng:
REVnet: Bring Reviewing Into Video Captioning for a Better Description. 1312-1317 - Xi Meng, Hao Kong, Dongqi Tang, Tong Lu:
Multimodal Image Captioning Through Combining Reinforced Cross Entropy Loss and Stochastic Deprecation. 1318-1323 - Qi Wei, Kai Fan, Wenlin Wang, Tianhang Zheng, Amit Chakraborty, Katherine A. Heller, Changyou Chen, Kui Ren:
InverseNet: Solving Inverse Problems of Multimedia Data with Splitting Networks. 1324-1329 - Shaobo Lin, Long Chen, Qin Zou, Wei Tian:
High-Resolution Driving Scene Synthesis Using Stacked Conditional Gans and Spectral Normalization. 1330-1335 - Yuqi Huo, Jiechao Guan, Jianhong Zhang, Manli Zhang, Ji-Rong Wen, Zhiwu Lu:
Zero-Shot Learning with Few Seen Class Samples. 1336-1341 - Zhihao Ouyang, Yan Feng, Zihao He, Tianbo Hao, Tao Dai, Shu-Tao Xia:
Attentiondrop for Convolutional Neural Networks. 1342-1347 - Yongyong Chen, Xiaolin Xiao, Yicong Zhou:
Multi-view Clustering via Simultaneously Learning Graph Regularized Low-Rank Tensor Representation and Affinity Matrix. 1348-1353 - Boxin He, Shengbei Wang, Weitao Yuan, Jianming Wang, Masashi Unoki:
Data Augmentation for Monaural Singing Voice Separation Based on Variational Autoencoder-Generative Adversarial Network. 1354-1359 - Tao He, Xiaoming Jin, Guiguang Ding, Lan Yi, Chenggang Yan:
Towards Better Uncertainty Sampling: Active Learning with Multiple Views for Deep Convolutional Neural Network. 1360-1365 - Chunbin Gu, Jiajun Bu, Keyue Shi, Zhi Yu, Beidou Wang, Liangcheng Li:
Local Metric Learning Based on Anchor Points for Multimedia Analysis. 1366-1371 - Tung Doan, Atsuhiro Takasu:
Sparse Regression-Based Multiple Sequence Alignment. 1372-1377 - Youzhao Yang, Hong Lu:
Single Image Deraining using a Recurrent Multi-scale Aggregation and Enhancement Network. 1378-1383 - Chunpeng Wang, Jie Zhu:
Neural Network Based Phase Compensation Methods on Monaural Speech Separation. 1384-1389 - Huikai Shao, Dexing Zhong, Yuhan Li:
PalmGAN for Cross-Domain Palmprint Recognition. 1390-1395 - Jianing Li, Siwei Dong, Zhaofei Yu, Yonghong Tian, Tiejun Huang:
Event-Based Vision Enhanced: A Joint Detection Framework in Autonomous Driving. 1396-1401 - Liming Zhai, Lina Wang, Yanzhen Ren:
Multi-domain Embedding Strategies for Video Steganography by Combining Partition Modes and Motion Vectors. 1402-1407 - Hangqing Guo, Nan Zhang, Wenjun Shi, Saeed Ali-AlQarni, Shaoen Wu, Honggang Wang:
Real-Time Indoor 3D Human Imaging Based on MIMO Radar Sensing. 1408-1413 - Shanfa Ke, Ruimin Hu, Gang Li, Tingzhao Wu, Xiaochen Wang, Zhongyuan Wang:
Multi-speakers Speech Separation Based on Modified Attractor Points Estimation and GMM Clustering. 1414-1419 - Jing Zhao, Ruiqin Xiong, Jizheng Xu, Feng Wu, Tiejun Huang:
Learning a Deep Convolutional Network for Subband Image Denoising. 1420-1425 - Zhijie Lin, Sen Jia, Bin Deng:
Multi-Task Embedded Convolutional Neural Network for Hyperspectral Image Classification. 1426-1431 - Lin Zhu, Siwei Dong, Tiejun Huang, Yonghong Tian:
A Retina-Inspired Sampling Method for Visual Texture Reconstruction. 1432-1437 - Zhizheng Zhang, Zhibo Chen, Jianxin Lin, Weiping Li:
Learned Scalable Image Compression with Bidirectional Context Disentanglement Network. 1438-1443 - Jiabao Yao, Li Wang, Fangdong Chen, Chaoyi Lin, Shiliang Pu:
An Attention Residual Neural Network with Recurrent Greedy Approach as Loop Filter for Inter Frames. 1444-1449 - Yuhang Liu, Wenyong Dong, Wanjuan Song, Lei Zhang:
Bayesian Nonnegative Matrix Factorization with a Truncated Spike-and-Slab Prior. 1450-1455 - Chao Huang, Zongju Peng, Fen Chen, Qiuping Jiang, Xin Cui, Gangyi Jiang:
Encoding Complexity Control for Live Video Applications: An Interpretable Machine Learning Approach. 1456-1461 - Risheng Liu, Cheng Yang, Long Ma, Miao Zhang, Xin Fan, Zhongxuan Luo:
Enhanced Residual Dense Intrinsic Network for Intrinsic Image Decomposition. 1462-1467 - Zhipeng Lin, Zhenyu Zhao, Tingjin Luo, Wenjing Yang, Yongjun Zhang, Yuhua Tang:
Non-Convex Transfer Subspace Learning for Unsupervised Domain Adaptation. 1468-1473 - Jianping Gou, Lei Wang, Zhang Yi, Yun-Hao Yuan, Weihua Ou, Qirong Mao:
Discriminative Group Collaborative Competitive Representation for Visual Classification. 1474-1479 - Jiahong Wu, He Zheng, Bo Zhao, Yixin Li, Baoming Yan, Rui Liang, Wenjia Wang, Shipei Zhou, Guosen Lin, Yanwei Fu, Yizhou Wang, Yonggang Wang:
Large-Scale Datasets for Going Deeper in Image Understanding. 1480-1485 - Xuan Shao, Xiao Liu, Lin Zhang, Shengjie Zhao, Ying Shen, Yukai Yang:
Revisit Surround-view Camera System Calibration. 1486-1491 - Vishal Keshav, Tej Pratap G. V. S. L.:
Decoupling Semantic Context and Color Correlation with Multi-class Cross Branch Regularization. 1492-1497 - Zhilin Qiu, Lingbo Liu, Guanbin Li, Qing Wang, Nong Xiao, Liang Lin:
Crowd Counting via Multi-view Scale Aggregation Networks. 1498-1503 - Jia Shao, Bo Du, Chen Wu, Pingkun Yan:
PASiam: Predicting Attention Inspired Siamese Network, for Space-Borne Satellite Video Tracking. 1504-1509 - Wenbo Zheng, Lan Yan, Chao Gou, Wenwen Zhang, Fei-Yue Wang:
A Relation Network Embedded with Prior Features for Few-Shot Caricature Recognition. 1510-1515 - Fenfen Sheng, Zhineng Chen, Tao Mei, Bo Xu:
A Single-Shot Oriented Scene Text Detector with Learnable Anchors. 1516-1521 - Rui Lu, Menghan Zhou, Anlong Ming, Yu Zhou:
Context-Constrained Accurate Contour Extraction for Occlusion Edge Detection. 1522-1527 - Yun-Hao Yuan, Jin Li, Jianping Gou, Yun Li, Jipeng Qiang, Bin Li:
Learning Simultaneous Face Super-Resolution Using Multiset Partial Least Squares. 1528-1533 - Qifeng Lin, Jianhui Zhao, Qianqian Tong, Guian Zhang, Zhiyong Yuan, Gang Fu:
Cropping Region Proposal Network Based Framework for Efficient Object Detection on Large Scale Remote Sensing Images. 1534-1539 - Jinghua Wang, Adrian Hilton, Jianmin Jiang:
Spectral Analysis Network for Deep Representation Learning and Image Clustering. 1540-1545
Poster Session 5 & Grand Challenge
- Chang Tang, Xinzhong Zhu, Xinwang Liu, Pichao Wang:
Salient Object Detection via Recurrently Aggregating Spatial Attention Weighted Cross-Level Deep Features. 1546-1551
Normal University), Xinwang Liu (National University of Defense Technology), and Pichao Wang (Alibaba Group (U.S.) Inc)
- Xiaoshui Huang, Lixin Fan, Qiang Wu, Jian Zhang, Chun Yuan:
Fast Registration for Cross-Source Point Clouds by using Weak Regional Affinity and Pixel-Wise Refinement. 1552-1557 - Cunkuan Yuan, Kun Li, Yu-Kun Lai, Yebin Liu, Jingyu Yang:
3D Face Reprentation and Reconstruction with Multi-scale Graph Convolutional Autoencoders. 1558-1563 - Qiang Wang, Yahong Han:
Visual Dialog with Targeted Objects. 1564-1569 - Zhengyang Sun, Zongqing Lu, Jing-Hao Xue, Qingmin Liao:
A New Object Scene Flow Algorithm Based on Support Points Selection and Robust Moving Object Proposal. 1570-1575 - Dashan Guo, Wei Li, Ning Xu, Jianhui Sun, Xiangzhong Fang:
Refining Proposals with Neighboring Contexts for Temporal Action Detection. 1576-1581 - Yanjun Chen, Jie Guo, Bingyang Hu, Yanwen Guo, Jingui Pan:
A Data-Driven Framework for Appearance Editing of Measured Materials. 1582-1587 - Yang Zhou, Shuhan Shen, Zhanyi Hu:
Active Semantic Labeling of Street View Point Clouds. 1588-1593 - Qian Wu, Wenmin Wang, Xiongtao Chen, Weimian Li:
Video Prediction with Temporal-Spatial Attention Mechanism and Deep Perceptual Similarity Branch. 1594-1599 - Chongyang Bai, Maksim Bolonkin, Judee K. Burgoon, Chao Chen, Norah E. Dunbar, Bharat Singh, V. S. Subrahmanian, Zhe Wu:
Automatic Long-Term Deception Detection in Group Interaction Videos. 1600-1605 - Yachi Zhang, Zongqing Lu, Jing-Hao Xue, Qingmin Liao:
A New Rotation-Invariant Deep Network for 3D Object Recognition. 1606-1611 - Andreas Kah, Matthias Narroschke:
Local Optical Flow Considering Object Boundaries by Adaptive Window Positioning. 1612-1617 - Meng Zhang, Xinchen Liu, Wu Liu, Anfu Zhou, Huadong Ma, Tao Mei:
Multi-Granularity Reasoning for Social Relation Recognition From Images. 1618-1623 - Xin Chen, Yahong Han:
Multi-Timescale Context Encoding for Scene Parsing Prediction. 1624-1629 - Lingyu Zhu, Tinghuai Wang, Emre Aksu, Joni-Kristian Kamarainen:
Portrait Instance Segmentation for Mobile Devices. 1630-1635 - Pengbo Zhang, Zhihui Wang, Xinzhu Ma, Haojie Li, Jianjun Li:
Learning to Segment Unseen Category Objects using Gradient Gaussian Attention. 1636-1641 - Fei Pan, Yanwen Guo, Zhicheng Yan, Jie Guo:
Temporal Segment Convolutional Kernel Networks for Sequence Modeling of Videos. 1642-1647 - Shaoshuai Li, Fuyan Liu:
SVNet: A Single View Network for 3D Shape Recognition. 1648-1653 - Fei Wang, Shujin Lin, Hefeng Wu, Hanhui Li, Ruomei Wang, Xiaonan Luo, Xiangjian He:
SPFusionNet: Sketch Segmentation Using Multi-modal Data Fusion. 1654-1659 - Mengmeng Jing, Jingjing Li, Ke Lu, Jieyan Liu, Zi Huang:
Adaptive Component Embedding for Unsupervised Domain Adaptation. 1660-1665 - Truc Nguyen, Franz Pernkopf:
Acoustic Scene Classification with Mismatched Recording Devices Using Mixture of Experts Layer. 1666-1671 - Hongchao Gao, Xi Wang, Yujia Li, Jizhong Han, Songlin Hu, Ruixuan Li:
Self-Representation Convolutional Neural Networks. 1672-1677
Poster Session 6
- Tianchi Huang, Xin Yao, Chenglei Wu, Rui-Xiao Zhang, Zhengyuan Pang, Lifeng Sun:
Tiyuntsong: A Self-Play Reinforcement Learning Approach for ABR Video Streaming. 1678-1683 - Venkatraman Balasubramanian, Mu Wang, Martin Reisslein, Changqiao Xu:
Edge-Boost: Enhancing Multimedia Delivery with Mobile Edge Caching in 5G-D2D Networks. 1684-1689 - Hao Wu, Xiaoyan Sun, Jingyu Yang, Feng Wu:
3D Mesh Based Inter-Image Prediction for Image Set Compression. 1690-1695 - Dayong Wang, Yu Sun, Weisheng Li, Ce Zhu, Frédéric Dufaux:
Fast Inter Mode Predictions for SHVC. 1696-1701 - Xing Chen, Lijun He, Shang Xu, Shibo Hu, Qingzhou Li, Guizhong Liu:
Hit Ratio Driven Mobile Edge Caching Scheme for Video on Demand Services. 1702-1707 - Fang Liu, Wei Zhang, Yonggang Wen:
QoE-Driven Mobile Streaming: A Location-Aware Approach. 1708-1713 - Aris S. Lalos, Gerasimos Arvanitis, Evangelos Vlachos, Konstantinos Moustakas:
Energy Efficient Transmission of 3D Meshes Over MMWave-Based Massive MIMO Systems. 1714-1719 - Hao Fan, Xu Tong, Qing Zhang, Tianxiang Zhang, Chenyang Wang, Xiaofei Wang:
Identifying Influential Users in Mobile Device-to-Device Social Networks to Promote Offline Multimedia Content Propagation. 1720-1725 - Yuhao Chen, Min Zhao, Xin Tan, Hong Tang, Dihua Sun:
Accurate and Efficient Object Detection with Context Enhancement Block. 1726-1731 - Yousong Zhu, Chaoyang Zhao, Chenxia Han, Jinqiao Wang, Hanqing Lu:
Mask Guided Knowledge Distillation for Single Shot Detector. 1732-1737 - Yang Wang, Lan Wang, Feng Su, Jiahao Shi:
Video Text Detection with Fully Convolutional Network and Tracking. 1738-1743 - Dongming Yang, Yuexian Zou:
Cascade Region Proposal Networks for Object Detection in the Wild. 1744-1749 - Wenfei Yang, Bin Liu, Weihai Li, Nenghai Yu:
Tracking Assisted Faster Video Object Detection. 1750-1755 - Pengyuan Xie, Jing Xiao, Yang Cao, Jia Zhu, Asad Khan:
RefineText: Refining Multi-oriented Scene Text Detection with a Feature Refinement Module. 1756-1761 - Qi Qi, Sanyuan Zhao, Jianbing Shen, Kin-Man Lam:
Multi-scale Capsule Attention-Based Salient Object Detection with Multi-crossed Layer Connections. 1762-1767 - Donghao Gu, Zhaojing Wen, Wenxue Cui, Rui Wang, Feng Jiang, Shaohui Liu:
Continuous Bidirectional Optical Flow for Video Frame Sequence Interpolation. 1768-1773 - Chunhui Zhang, Shiming Ge, Yingying Hua, Dan Zeng:
Robust Deep Tracking with Two-step Augmentation Discriminative Correlation Filters. 1774-1779 - Yiwu Yao, Bin Dong, Yuke Li, Weiqiang Yang, Haoqi Zhu:
Efficient Implementation of Convolutional Neural Networks with End to End Integer-Only Dataflow. 1780-1785 - Qianqian Wang, Liansheng Zhuang, Ning Wang, Wengang Zhou, Houqiang Li:
Learning Motion-Aware Policies for Robust Visual Tracking. 1786-1791 - Lei Jiang, Wengang Zhou, Houqiang Li:
Knowledge Distillation with Category-Aware Attention and Discriminant Logit Losses. 1792-1797 - Anjie Wang, Yongbin Gao, Zhijun Fang, Xiaoyan Jiang, Shanshe Wang, Siwei Ma, Jenq-Neng Hwang:
Unsupervised Learning of Depth and Ego-Motion with Spatial-Temporal Geometric Constraints. 1798-1803 - Ming-Ya Ko, Jeng-Lin Li, Chi-Chun Lee:
Learning Minimal Intra-Genre Multimodal Embedding from Trailer Content and Reactor Expressions for Box Office Prediction. 1804-1809 - Yangwo Jian, Jing Xiao, Yang Cao, Asad Khan, Jia Zhu:
Deep Pairwise Ranking with Multi-label Information for Cross-Modal Retrieval. 1810-1815 - Luo Xiong, Yanjie Liang, Yan Yan, Hanzi Wang:
Correlation Filter Tracking with Adaptive Proposal Selection for Accurate Scale Estimation. 1816-1821 - Haitao Wang, Min Meng, Hui Chen, Jigang Wu:
Supervised Consistent and Specific Hashing. 1822-1827 - Shengdong Li, Xueqiang Lv:
Momentum Based on Adaptive Bold Driver. 1828-1833 - Meiyu Huang, Xueshuang Xiang, Yao Xu, Yiqiang Chen:
A Lightweight Neural Network Based Human Depth Recovery Method. 1834-1839 - Shiyu Zhao, Lin Zhang, Shuaiyi Huang, Ying Shen, Shengjie Zhao, Yukai Yang:
Evaluation of Defogging: A Real-World Benchmark Dataset, A New Criterion and Baselines. 1840-1845 - Yanan Wang, Haili Wang, Jiaoyang Shang, Hu Tuo:
RESA: A Real-Time Evaluation System for ABR. 1846-1851 - Qingbo Wu, Rui Ma, King Ngi Ngan, Hongliang Li, Fanman Meng:
Blind Image Sharpness Assessment And Enhancement via Deep Auxiliary Learning. 1852-1857 - Jinjian Wu, Jupo Ma, Fuhu Liang, Weisheng Dong, Guangming Shi:
End-to-End Blind Image Quality Assessment with Cascaded Deep Features. 1858-1863 - Chen Huang, Tingting Jiang, Ming Jiang:
Encoding Distortions for Multi-task Full-Reference Image Quality Assessment. 1864-1869 - Yuan Meng, Shenglin Zhang, Zijie Ye, Benliang Wang, Zhi Wang, Yongqian Sun, Qitong Liu, Shuai Yang, Dan Pei:
Causal Analysis of the Unsatisfying Experience in Realtime Mobile Multiplayer Games in the Wild. 1870-1875
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.