CMStatistics 2020: Start Registration
View Submission - CMStatistics
Title: Clustering data with nonignorable missingness using semi-parametric mixture models Authors:  Matthieu Marbac - CREST - ENSAI (France) [presenting]
Marie du Roy de Chaumaray - CREST-ENSAI (France)
Abstract: The focus is on clustering continuous data sets subject to nonignorable missingness. We perform clustering with a specific semi-parametric mixture, avoiding the component distributions and the missingness process to be specified, under the assumption of conditional independence given the component. Estimation is performed by maximizing an extension of smoothed likelihood allowing missingness. This optimization is achieved by a Majorization-Minorization algorithm. We illustrate the relevance of our approach by numerical experiments. Under mild assumptions, we show the identifiability of our model, the monotony of the MM algorithm as well as the consistency of the estimator. We propose an extension of the new method to the case of mixed-type data that we illustrate on a real data set.