GrAMME: Semi-Supervised Learning using Multi-layered Graph Attention Models

US Shanthamallu, JJ Thiagarajan, H Song… - arXiv preprint arXiv …, 2018 - arxiv.org
arXiv preprint arXiv:1810.01405, 2018arxiv.org
Modern data analysis pipelines are becoming increasingly complex due to the presence of
multi-view information sources. While graphs are effective in modeling complex
relationships, in many scenarios a single graph is rarely sufficient to succinctly represent all
interactions, and hence multi-layered graphs have become popular. Though this leads to
richer representations, extending solutions from the single-graph case is not straightforward.
Consequently, there is a strong need for novel solutions to solve classical problems, such as …
Modern data analysis pipelines are becoming increasingly complex due to the presence of multi-view information sources. While graphs are effective in modeling complex relationships, in many scenarios a single graph is rarely sufficient to succinctly represent all interactions, and hence multi-layered graphs have become popular. Though this leads to richer representations, extending solutions from the single-graph case is not straightforward. Consequently, there is a strong need for novel solutions to solve classical problems, such as node classification, in the multi-layered case. In this paper, we consider the problem of semi-supervised learning with multi-layered graphs. Though deep network embeddings, e.g. DeepWalk, are widely adopted for community discovery, we argue that feature learning with random node attributes, using graph neural networks, can be more effective. To this end, we propose to use attention models for effective feature learning, and develop two novel architectures, GrAMME-SG and GrAMME-Fusion, that exploit the inter-layer dependencies for building multi-layered graph embeddings. Using empirical studies on several benchmark datasets, we evaluate the proposed approaches and demonstrate significant performance improvements in comparison to state-of-the-art network embedding strategies. The results also show that using simple random features is an effective choice, even in cases where explicit node attributes are not available.
arxiv.org