Regularized clusterwise multiblock regression - Cnam - Conservatoire national des arts et métiers Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

Regularized clusterwise multiblock regression

Résumé

Regression coefficients are usually estimated under the assumption that observations come from a single and homogeneous population. However in many applications, this assumption is not true and the overall model is not efficient to recover the specificities of potential cluster models. When variables are in addition structured into a dependent block of variables and several blocks of explanatory ones, we propose a new method called regularized clusterwise multiblock regression. The aims of this method are to find out simultaneously thanks to a single criterion: a data reduction of explanatory variables through components that can be intermediate between the ones from multiblock PLS and multiblock Redundancy Analysis, a partition of the observations into several clusters and the corresponding cluster multiblock regression coefficients. The three unknown parameters of this criterion, namely the regularization parameter which aims at stabilize the inversion of the block variance-covariance matrices, the number of components and the number of clusters, are all defined such as to minimize the prediction error on the basis of a ten-fold cross-validation. A simulation study is carried out to assess the performance of the method and an empirical application in the field of consumer satisfaction is provided which illustrates the usefulness of the method.
Fichier non déposé

Dates et versions

hal-02500591 , version 1 (06-03-2020)

Identifiants

  • HAL Id : hal-02500591 , version 1

Citer

Stéphanie Bougeard, Ndèye Niang, Gilbert Saporta. Regularized clusterwise multiblock regression. Compstat 2016, Aug 2016, Oviedo, Spain. ⟨hal-02500591⟩
78 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More