Skip to Main content Skip to Navigation
Journal articles

Clusterwise elastic-net regression based on a combined information criterion

Abstract : Many research questions pertain to a regression problem assuming that the population under study is not homogeneous with respect to the underlying model. In this setting, we propose an original method called Combined Information criterion CLUSterwise elastic-net regression (Ciclus). This method handles several methodological and application-related challenges. It is derived from both the information theory and the microeconomic utility theory and maximizes a well-defined criterion combining three weighted sub-criteria, each being related to a specific aim: getting a parsimonious partition, compact clusters for a better prediction of cluster-membership, and a good within-cluster regression fit. The solving algorithm is monotonously convergent, under mild assumptions. The Ciclus principle provides an innovative solution to two key issues: (i) the automatic optimization of the number of clusters, (ii) the proposal of a elastic-net regression. We applied it to elastic-net regression in order to be able to manage high-dimensional data involving redundant explanatory variables. Ciclus is illustrated through both a simulation study and a real example in the field of omic data, showing how it improves the quality of the prediction and facilitates the interpretation. It should therefore prove useful whenever the data involve a population mixture as for example in biology, social sciences, economics or marketing.
Document type :
Journal articles
Complete list of metadata

https://hal-cnam.archives-ouvertes.fr/hal-03541125
Contributor : Ndeye Niang Connect in order to contact the contributor
Submitted on : Monday, January 24, 2022 - 2:09:49 PM
Last modification on : Wednesday, September 28, 2022 - 5:59:41 AM

Identifiers

Citation

Xavier Bry, Ndèye Niang, Thomas Verron, Stéphanie Bougeard. Clusterwise elastic-net regression based on a combined information criterion. Advances in Data Analysis and Classification, Springer Verlag, 2022, ⟨10.1007/s11634-021-00489-w⟩. ⟨hal-03541125⟩

Share

Metrics

Record views

93