Statistics and Its Interface

Volume 13 (2020)

Number 1

GPU Accelerated Liquid Association GALA

Pages: 119 – 125

DOI: https://dx.doi.org/10.4310/SII.2020.v13.n1.a10

Authors

Shinsheng Yuan (Institute of Statistical Science, Academia Sinica, Taipei City, Taiwan)

Guani Wu (Department of Statistics University of California at Los Angeles)

Yu-Cheng Li (Department of Electrical Engineering, National Taiwan University, Taipei City, Taiwan)

Yi-Chang Lu (Department of Electrical Engineering, National Taiwan University, Taipei City, Taiwan)

Ker-Chau Li (Institute of Statistical Science, Academia Sinica, Taipei City, Taiwan; and Department of Statistics, University of California at Los Angeles)

Abstract

High throughput biological assays have provided numerous data sources for studying complex interactions between multiple variables in a biological system. Many computational tools for exploring the voluminous biological data are based on pair-wise correlation between variables. Liquid Association (LA) is a novel statistical concept for inferring higher order of association between variables in a system. While LA was originally introduced to study gene-gene interaction involving three genes at a time, it can be applied for correlating biological measurements with clinical variables such as drug sensitivity profiling and patient survival time. It is computationally expensive to compute LA scores for all possible triplets in very large datasets. Here we show how to take advantage of Graphic Processing Units (GPUs) for speeding up the LA computing. Our GPU-accelerated version of LA computation (GALA) achieved nearly 200-fold improvement over the traditional CPU-alone version. A companion package in R was developed for facilitating follow-up analysis and improving user experience. An example on Global Health Observatory data is provided to showcase how LA analysis can be applied in other data intensive fields.

Keywords

Liquid Association, Correlation Coefficient, GPU, Gene Expression

This research was supported by Academia Sinica, Taiwan (Mathematics in Biology & AS-104-TP-A07), Ministry of Science and Technology, Taiwan, MOST106-2314-B-001-005 and National Science Foundation, USA, NSF, DMS-1513622. The content is solely the responsibility of the authors and does not necessarily represent the official views of NSF.

Received 15 January 2019

Accepted 10 September 2019

Published 7 November 2019