Gebruikersprofielen voor Yadong Mu

Yadong Mu

Peking University
Geverifieerd e-mailadres voor pku.edu.cn
Geciteerd door 9956

Deep high-resolution representation learning for visual recognition

…, B Jiang, C Deng, Y Zhao, D Liu, Y Mu… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
High-resolution representations are essential for position-sensitive vision problems, such as
human pose estimation, semantic segmentation, and object detection. Existing state-of-the-…

High-resolution representations for labeling pixels and regions

…, Y Zhao, B Jiang, T Cheng, B Xiao, D Liu, Y Mu… - arXiv preprint arXiv …, 2019 - arxiv.org
High-resolution representation learning plays an essential role in many vision problems, eg,
pose estimation and semantic segmentation. The high-resolution network (HRNet)~\cite{…

Fast fourier convolution

L Chi, B Jiang, Y Mu - Advances in Neural Information …, 2020 - proceedings.neurips.cc
Vanilla convolutions in modern deep networks are known to operate locally and at fixed scale
(eg, the widely-adopted 3* 3 kernels in image-oriented tasks). This causes low efficacy in …

Discriminative local binary patterns for human detection in personal album

Y Mu, S Yan, Y Liu, T Huang… - 2008 IEEE conference on …, 2008 - ieeexplore.ieee.org
In recent years, local pattern based object detection and recognition have attracted increasing
interest in computer vision research community. However, to our best knowledge no …

Recurrent attentive zooming for joint crowd counting and precise localization

C Liu, X Weng, Y Mu - … of the IEEE/CVF conference on …, 2019 - openaccess.thecvf.com
Crowd counting is a new frontier in computer vision with far-reaching applications particularly
in social safety management. A majority of existing works adopt a methodology that first …

Weakly-supervised action localization by generative attention modeling

B Shi, Q Dai, Y Mu, J Wang - Proceedings of the IEEE/CVF …, 2020 - openaccess.thecvf.com
Weakly-supervised temporal action localization is a problem of learning an action localization
model with only video-level action labeling available. The general framework largely relies …

Weakly-supervised hashing in kernel space

Y Mu, J Shen, S Yan - 2010 IEEE Computer Society Conference …, 2010 - ieeexplore.ieee.org
The explosive growth of the vision data motivates the recent studies on efficient data indexing
methods such as locality-sensitive hashing (LSH). Most existing approaches perform …

Informative dropout for robust representation learning: A shape-bias perspective

B Shi, D Zhang, Q Dai, Z Zhu, Y Mu… - … on Machine Learning, 2020 - proceedings.mlr.press
Convolutional Neural Networks (CNNs) are known to rely more on local texture rather than
global shape when making decisions. Recent work also indicates a close relationship …

Attention-based multi-context guiding for few-shot semantic segmentation

T Hu, P Yang, C Zhang, G Yu, Y Mu… - Proceedings of the AAAI …, 2019 - ojs.aaai.org
Few-shot learning is a nascent research topic, motivated by the fact that traditional deep
learning methods require tremendous amounts of data. The scarcity of annotated data becomes …

Deep steering: Learning end-to-end driving model from spatial and temporal visual cues

L Chi, Y Mu - arXiv preprint arXiv:1708.03798, 2017 - arxiv.org
In recent years, autonomous driving algorithms using low-cost vehicle-mounted cameras
have attracted increasing endeavors from both academia and industry. There are multiple …