Statistics and Its Interface

Volume 17 (2024)

Number 2

Special issue on statistical learning of tensor data

Community detection in temporal citation network via a tensor-based approach

Pages: 145 – 158

DOI: https://dx.doi.org/10.4310/22-SII751

Authors

Tianchen Gao (School of Economics, Xiamen University, Xiamen, Fujian, China)

Rui Pan (School of Statistics and Mathematics, Central University of Finance and Economics, Beijing, China)

Junfei Zhang (School of Statistics and Mathematics, Central University of Finance and Economics, Beijing, China)

Hansheng Wang (Guanghua School of Management, Peking University, Beijing, China)

Abstract

In the era of big data, network analysis has attracted widespread attention. Detecting and tracking community evolution in temporal networks can uncover important and interesting behaviors. In this paper, we analyze a temporal citation network constructed by publications collected from 44 statistical journals between 2001 and 2018. We propose an approach named Tensor-based Directed Spectral Clustering On Ratios of Eigenvectors (TD-SCORE) which can correct for degree heterogeneity to detect the community structure of the temporal citation network. We first explore the characteristics of the temporal network via in-degree distribution and visualization of different snapshots, and we find that both the community structure and the key nodes change over time. Then, we apply the TD-SCORE method to the core network of our temporal citation network. Seven communities are identified, including variable selection, Bayesian analysis, functional data analysis, and many others. Finally, we track the evolution of the above communities and reach some conclusions.

Keywords

community detection, temporal citation network, TD-SCORE, community evolution, CP decomposition

The research of Rui Pan is supported by National Natural Science Foundation of China (No. 11971504). Hansheng Wang’s research is partially supported by National Natural Science Foundation of China (No. 11831008) and also partially supported by the Open Research Fund of Key Laboratory of Advanced Theory and Application in Statistics and Data Science (KLATASDS-MOE-ECNU-KLATASDS2101). Rui Pan and Junfei Zhang are both supported by the disciplinary funding and the Emerging Interdisciplinary Project of Central University of Finance and Economics.

Received 23 November 2021

Accepted 5 August 2022

Published 1 February 2024