Statistics and Its Interface

Volume 15 (2022)

Number 1

Sparsity-restricted estimation for the accelerated failure time model

Pages: 1 – 18

DOI: https://dx.doi.org/10.4310/21-SII669

Authors

Xiaoyu Zhang (Department of Statistics and Actuarial Science, University of Hong Kong)

Yunpeng Zhou (Department of Statistics and Actuarial Science, University of Hong Kong)

Jinfeng Xu (Zhejiang Institute of Research and Innovation, University of Hong Kong, Hangzhou, Zhejiang, China; and Department of Statistics and Actuarial Science, University of Hong Kong)

Kam Chuen Yuen (Department of Statistics and Actuarial Science, University of Hong Kong)

Abstract

In many biomedical studies, such as high-throughput microarray or RNA-sequencing (RNA-seq) gene expression analyses, it is of practical interest to link gene expression profiles to censored survival phenotypes, for example, time to cancer recurrence or time to death. With the number of genes greatly exceeding the sample size and the nuances of survival data such as right censoring, regularized methods that combine the rank-based loss function and the penalty are often used to identify relevant prognostic biomarkers and yield parsimonious prediction models for event times. Existing penalization methods for survival data often use $\ell_1$ penalty to approximate the sparsity, yielding numerical convenience for its convexity. In practice, however, the $\ell_1$ approximation also leads to an inflated model size to achieve a desired cross-validated prediction error when compared to the ideal sparsity-restricted method. In this paper, we consider sparsity-restricted estimation in the accelerated failure time (AFT) model for censored survival data. An efficient and fast two-stage procedure that uses a convex regularized Gehan rank regression and a simple hard-thresholding estimation is proposed for its numerical implementation. The effectiveness of the proposed method is demonstrated by extensive simulation studies and real-data applications.

Keywords

AFT model, LASSO, penalty, prediction, sparsity, survival data

The full text of this article is unavailable through your IP address: 172.17.0.1

The work of Jinfeng Xu was supported by the University of Hong Kong, Zhejiang Institute of Research and Innovation Seed Fund, and by the Hong Kong General Research Fund (17308820).

Received 21 August 2019

Accepted 13 March 2021

Published 11 August 2021