Skip to Main content Skip to Navigation
Conference papers

Regularized clusterwise multiblock regression

Abstract : Regression coefficients are usually estimated under the assumption that observations come from a single and homogeneous population. However in many applications, this assumption is not true and the overall model is not efficient to recover the specificities of potential cluster models. When variables are in addition structured into a dependent block of variables and several blocks of explanatory ones, we propose a new method called regularized clusterwise multiblock regression. The aims of this method are to find out simultaneously thanks to a single criterion: a data reduction of explanatory variables through components that can be intermediate between the ones from multiblock PLS and multiblock Redundancy Analysis, a partition of the observations into several clusters and the corresponding cluster multiblock regression coefficients. The three unknown parameters of this criterion, namely the regularization parameter which aims at stabilize the inversion of the block variance-covariance matrices, the number of components and the number of clusters, are all defined such as to minimize the prediction error on the basis of a ten-fold cross-validation. A simulation study is carried out to assess the performance of the method and an empirical application in the field of consumer satisfaction is provided which illustrates the usefulness of the method.
Document type :
Conference papers
Complete list of metadatas

https://hal-cnam.archives-ouvertes.fr/hal-02500591
Contributor : Gilbert Saporta <>
Submitted on : Friday, March 6, 2020 - 10:43:41 AM
Last modification on : Thursday, March 12, 2020 - 4:22:15 PM

Identifiers

  • HAL Id : hal-02500591, version 1

Collections

Citation

Stéphanie Bougeard, Ndèye Niang, Gilbert Saporta. Regularized clusterwise multiblock regression. Compstat 2016, Aug 2016, Oviedo, Spain. ⟨hal-02500591⟩

Share

Metrics

Record views

23