COMPSTAT 2022: Start Registration
View Submission - COMPSTAT2022
Title: Combined-information criterion for clusterwise elastic-net regression: Application to omic data Authors:  Stephanie Bougeard - ANSES (France) [presenting]
Xavier Bry - Universite Montpellier (France)
Thomas Verron - SEITA - ITG (France)
Ndeye Niang - CNAM (France)
Abstract: Many research questions pertain to a regression problem assuming that the population under study is not homogeneous with respect to the underlying model. In this setting, we propose an original method called Combined Information criterion CLUSterwise elastic-net regression (CICLUS). This method handles several methodological and application-related challenges. It is derived from both the information theory and the microeconomic utility theory and maximizes a well-defined criterion combining three weighted sub-criteria, each being related to a specific aim: getting a parsimonious partition, compact clusters for a better prediction of cluster-membership and a good within-cluster regression fit. The solving algorithm is monotonously convergent under mild assumptions. The CICLUS method provides an innovative solution to two key issues: the automatic optimization of the number of clusters and the issue of a prediction model. We applied it to elastic-net regression in order to be able to manage high-dimensional data involving redundant explanatory variables. CICLUS is illustrated through a real example in the field of omic data, showing how it improves the quality of the prediction and facilitates the interpretation. It should therefore prove useful whenever the data involve a population mixture such as, for example, in biology, social sciences, economics or marketing.