Variable selection for longitudinal biomarkers constrained by a detection limit

Julia Geronimi; Gilbert Saporta

Communication Dans Un Congrès Année : 2016

Variable selection for longitudinal biomarkers constrained by a detection limit

(1) , (2)

1
2

Julia Geronimi

Fonction : Auteur
PersonId : 997361

Centre d'études et de recherche en informatique et communications

Gilbert Saporta

Fonction : Auteur
PersonId : 180161
IdHAL : gilbert-saporta
ORCID : 0000-0002-3406-5887
IdRef : 027122565

CEDRIC. Méthodes statistiques de data-mining et apprentissage

Résumé

Repeated measures over time are common in the biomedical field, and widely used to analyze the link between covariates and a clinical criterion. In a longitudinal context, a high number of variables associated with the presence of missing data, are complex issues to be resolved. We deal with several types of covariates, some suffer from haphazard missingness, and others are subject to detection thresholds. For the latter, Tobit regression combined with bootstrap is an unbiased approach, but it needs complete predictors for the mean model. An adaptation of the wellknown multivariate imputation by chained equation is proposed. We use the Tobit model as the imputation method for covariates below the detection limit, predictive mean matching and logistic regression for others. Variable selection is done by using MI-PGEE which consists in the following ingredients: a) a group LASSO penalty is imposed on the group of estimated regression coefficients of the same variable across multiplyimputed datasets leading to a consistent selection. The optimal shrinkage parameter is chosen by minimizing a BIC-like criterion. b) GEE allows integrating correlations due to the longitudinal context. The usefulness of the new method is illustrated by an application on the FNIH project of the Osteoarthritis Initiative.

Mots clés

detection limit missing values GEE

Domaines

Statistiques [stat]

Gilbert Saporta : Connectez-vous pour contacter le contributeur

https://cnam.hal.science/hal-02500582

Soumis le : vendredi 6 mars 2020-10:37:13

Dernière modification le : mercredi 28 septembre 2022-05:54:07

Dates et versions

hal-02500582 , version 1 (06-03-2020)

Identifiants

HAL Id : hal-02500582 , version 1

Citer

Julia Geronimi, Gilbert Saporta. Variable selection for longitudinal biomarkers constrained by a detection limit. Compstat 2016, Aug 2016, Oviedo, Spain. ⟨hal-02500582⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNAM CEDRIC-CNAM HESAM

48 Consultations

0 Téléchargements

Variable selection for longitudinal biomarkers constrained by a detection limit

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager