Statistics and Its Interface

Volume 13 (2020)

Number 3

Multi-dimensional classification with semiparametric mixture model

Pages: 347 – 359

DOI: https://dx.doi.org/10.4310/SII.2020.v13.n3.a5

Authors

Anqi Yin (Department of Biostatistics, Bioinformatics and Biomathematics, Georgetown University, Washington, D.C., U.S.A.)

Ao Yuan (Department of Biostatistics, Bioinformatics and Biomathematics, Georgetown University, Washington, D.C., U.S.A.)

Abstract

Compared to non-model based classification methods, the model based classification has the advantage of classification together with regression analysis, and is the interest of our investigation. For robustness, we propose and study a semiparametric mixture model, in which each sub-density is only assumed unimodal. The semiparametric maximum likelihood estimate is used to estimate the parametric and nonparametric components. Then the Bayesian classification rule is used to classify the subjects according to the model. Large sample properties of the estimates are investigated, simulation studies are conducted to evaluate the finite sample performance of the proposed model, and then the method is applied to analyze a real data.

Keywords

Classification, mixture model, maximum likelihood estimate, semiparametric model

2010 Mathematics Subject Classification

Primary 62H30. Secondary 62J99.

Received 31 May 2019

Accepted 4 February 2020

Published 22 April 2020