Abstract
We encounter missing data in many longitudinal studies. When the missing data are nonignorable, it is important to analyze the data by incorporating the missing data mechanism into the observed data likelihood function. The classical maximum likelihood (ML) method for analyzing longitudinal missing data has been extensively studied in the literature. However, it is well-known that the ordinary ML estimators are sensitive to extreme observations or outliers in the data. In this paper, we propose and explore a robust method, which is developed in the framework of the ML method, and is useful for downweighting any influential observations in the data when estimating the model parameters. We study the empirical properties of the robust estimators in small simulations. We also illustrate the robust method using incomplete longitudinal data on CD4 counts from clinical trials of HIV-infected patients.
Similar content being viewed by others
References
Baker SG, Laird NM (1988) Regression analysis for categorical variables with outcome subject to nonresponse. J Am Stat Assoc 83: 62–69
Beaumont JF (1999) A robust estimation method in the presence of nonignorable nonresponse. In: Proceedings of the section on survey research methods. American Statistical Association, pp 819–824
Brown CH (1990) Protecting against nonrandomly missing data in longitudinal studies. Biometrics 46: 143–157
Cantoni E, Ronchetti E (2001) Robust inference for generalized linear models. J Am Stat Assoc 96: 1022–1030
Dantan E, Proust-Lima C, Letenneur L, Jacqmin-Gadda H (2008) Pattern mixture models and latent class models for the analysis of multivariate longitudinal data with informative dropouts. Int J Biostat 4 (article 14)
Diggle P, Kenward MG (1994) Informative dropout in longitudinal data analysi (with discussion). Appl Stat 43: 49–94
Gallant JE, Moore RD, Richman DD, Keruly J, Chaisson RE (1992) Incidence and natural history of cytomegalovirus disease in patients with advanced human immunodeficiency virus disease treated with Zidovudine. J Inf Dis 166: 1223–1227
Ibrahim JG, Lipsitz SR, Chen MH (1999) Missing covariates in generalized linear models when the missing data mechanism is non-ignorable. J R Stat Soc Ser B 61: 173–190
Ibrahim JG, Chen MH, Lipsitz SR (2001) Missing responses in generalized linear mixed models when the missing data mechanism is nonignorable. Biometrika 88: 551–564
Kahn JO, Lagakos SW, Richman DD (1992) A controlled trial comparing continued zidovudine with didanosine in human immunodeficiency virus infection. New Eng J Med 327: 581–587
Little RJA (1988) Robust estimation of the mean and covariance matrix from data with missing values. Appl Stat 37: 23–38
Little RJA (1995) Modeling the drop-Out mechanism in repeated-measures studies. J Am Stat Assoc 90: 1112–1121
Little RJA, Rubin DB (2002) Statistical Analysis with missing data, 2nd edn. Wiley, New Jersey
McCulloch CE (1997) Maximum likelihood algorithms for generalized linear mixed models. J Am Stat Assoc 92: 162–170
Molenberghs G, Verbeke G (2001) A review on linear mixed models for longitudinal data, possibly subject to dropout. Stat Modell 1: 235–269
Preisser JS, Galecki AT, Lohman KK, Wagenknecht LE (2000) Analysis of smoking trends with incomplete longitudinal binary responses. J Am Stat Assoc 95: 1021–1031
Preisser JS, Qaqish BF (1999) Robust regression to clustered data with application to binary responses. Biometrics 55: 574–579
Rousseeuw PJ, van Zomeren BC (1990) Unmasking multivariate outliers and leverage points. J Am Stat Assoc 85: 633–639
Rubin DB (1976) Inference and missing data. Biometrika 63: 581–592
Sinha SK (2004) Robust analysis of generalized linear mixed models. J Am Stat Assoc 99: 451–460
Sinha SK (2008) Robust methods for generalized linear models with nonignorable missing covariates. Can J Stat 36(2): 277–299
Verbeke G, Molenberghs G (2005) Longitudinal and incomplete clinical studies. Metron 63: 143–170
Wu L, Liu W, Liu J (2009) A longitudinal study of children’s aggressive behaviours based on multivariate mixed models with incomplete data. Can J Stat 37: 435–452
Xie H (2008) A local sensitivity analysis approach to longitudinal non-gaussian data with non-ignorable dropout. Stat Med 27: 3155–3177
Yi GY, Cook RJ (2002) Marginal methods for incomplete longitudinal data arising in clusters. J Am Stat Assoc 97: 1071–1080
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Sinha, S.K. Robust analysis of longitudinal data with nonignorable missing responses. Metrika 75, 913–938 (2012). https://doi.org/10.1007/s00184-011-0359-3
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00184-011-0359-3