What difference does multiple imputation make in longitudinal modeling of EQ-5D-5L data? Empirical analyses of simulated and observed missing data patterns

Inka Rösel; Lina María Serna-Higuita; Fatima Al Sayah; Maresa Buchholz; Ines Buchholz; Thomas Kohlmann; Peter Martus; You-Shan Feng

doi:10.1007/s11136-021-03037-3

What difference does multiple imputation make in longitudinal modeling of EQ-5D-5L data? Empirical analyses of simulated and observed missing data patterns

Standard

What difference does multiple imputation make in longitudinal modeling of EQ-5D-5L data? Empirical analyses of simulated and observed missing data patterns. / Rösel, Inka; Serna-Higuita, Lina María; Al Sayah, Fatima; Buchholz, Maresa; Buchholz, Ines; Kohlmann, Thomas; Martus, Peter; Feng, You-Shan.

In: QUAL LIFE RES, Vol. 31, No. 5, 05.2022, p. 1521-1532.

Research output: SCORING: Contribution to journal › SCORING: Journal article › Research › peer-review

Harvard

Rösel, I, Serna-Higuita, LM, Al Sayah, F, Buchholz, M, Buchholz, I, Kohlmann, T, Martus, P & Feng, Y-S 2022, 'What difference does multiple imputation make in longitudinal modeling of EQ-5D-5L data? Empirical analyses of simulated and observed missing data patterns', QUAL LIFE RES, vol. 31, no. 5, pp. 1521-1532. https://doi.org/10.1007/s11136-021-03037-3

APA

Rösel, I., Serna-Higuita, L. M., Al Sayah, F., Buchholz, M., Buchholz, I., Kohlmann, T., Martus, P., & Feng, Y-S. (2022). What difference does multiple imputation make in longitudinal modeling of EQ-5D-5L data? Empirical analyses of simulated and observed missing data patterns. QUAL LIFE RES, 31(5), 1521-1532. https://doi.org/10.1007/s11136-021-03037-3

Vancouver

Rösel I, Serna-Higuita LM, Al Sayah F, Buchholz M, Buchholz I, Kohlmann T et al. What difference does multiple imputation make in longitudinal modeling of EQ-5D-5L data? Empirical analyses of simulated and observed missing data patterns. QUAL LIFE RES. 2022 May;31(5):1521-1532. https://doi.org/10.1007/s11136-021-03037-3

Bibtex

@article{5620d2ecfa1a44fdb8f1238e9fa503cd,

title = "What difference does multiple imputation make in longitudinal modeling of EQ-5D-5L data? Empirical analyses of simulated and observed missing data patterns",

abstract = "PURPOSE: Although multiple imputation is the state-of-the-art method for managing missing data, mixed models without multiple imputation may be equally valid for longitudinal data. Additionally, it is not clear whether missing values in multi-item instruments should be imputed at item or score-level. We therefore explored the differences in analyzing the scores of a health-related quality of life questionnaire (EQ-5D-5L) using four approaches in two empirical datasets.METHODS: We used simulated (GR dataset) and observed missingness patterns (ABCD dataset) in EQ-5D-5L scores to investigate the following approaches: approach-1) mixed models using respondents with complete cases, approach-2) mixed models using all available data, approach-3) mixed models after multiple imputation of the EQ-5D-5L scores, and approach-4) mixed models after multiple imputation of EQ-5D 5L items.RESULTS: Approach-1 yielded the highest estimates of all approaches (ABCD, GR), increasingly overestimating the EQ-5D-5L score with higher percentages of missing data (GR). Approach-4 produced the lowest scores at follow-up evaluations (ABCD, GR). Standard errors (0.006-0.008) and mean squared errors (0.032-0.035) increased with increasing percentages of simulated missing GR data. Approaches 2 and 3 showed similar results (both datasets).CONCLUSION: Complete cases analyses overestimated the scores and mixed models after multiple imputation by items yielded the lowest scores. As there was no loss of accuracy, mixed models without multiple imputation, when baseline covariates are complete, might be the most parsimonious choice to deal with missing data. However, multiple imputation may be needed when baseline covariates are missing and/or more than two timepoints are considered.",

author = "Inka R{\"o}sel and Serna-Higuita, {Lina Mar{\'i}a} and {Al Sayah}, Fatima and Maresa Buchholz and Ines Buchholz and Thomas Kohlmann and Peter Martus and You-Shan Feng",

note = "{\textcopyright} 2021. The Author(s).",

year = "2022",

month = may,

doi = "10.1007/s11136-021-03037-3",

language = "English",

volume = "31",

pages = "1521--1532",

journal = "QUAL LIFE RES",

issn = "0962-9343",

publisher = "Springer Netherlands",

number = "5",

}

RIS

TY - JOUR

T1 - What difference does multiple imputation make in longitudinal modeling of EQ-5D-5L data? Empirical analyses of simulated and observed missing data patterns

AU - Rösel, Inka

AU - Serna-Higuita, Lina María

AU - Al Sayah, Fatima

AU - Buchholz, Maresa

AU - Buchholz, Ines

AU - Kohlmann, Thomas

AU - Martus, Peter

AU - Feng, You-Shan

PY - 2022/5

Y1 - 2022/5

N2 - PURPOSE: Although multiple imputation is the state-of-the-art method for managing missing data, mixed models without multiple imputation may be equally valid for longitudinal data. Additionally, it is not clear whether missing values in multi-item instruments should be imputed at item or score-level. We therefore explored the differences in analyzing the scores of a health-related quality of life questionnaire (EQ-5D-5L) using four approaches in two empirical datasets.METHODS: We used simulated (GR dataset) and observed missingness patterns (ABCD dataset) in EQ-5D-5L scores to investigate the following approaches: approach-1) mixed models using respondents with complete cases, approach-2) mixed models using all available data, approach-3) mixed models after multiple imputation of the EQ-5D-5L scores, and approach-4) mixed models after multiple imputation of EQ-5D 5L items.RESULTS: Approach-1 yielded the highest estimates of all approaches (ABCD, GR), increasingly overestimating the EQ-5D-5L score with higher percentages of missing data (GR). Approach-4 produced the lowest scores at follow-up evaluations (ABCD, GR). Standard errors (0.006-0.008) and mean squared errors (0.032-0.035) increased with increasing percentages of simulated missing GR data. Approaches 2 and 3 showed similar results (both datasets).CONCLUSION: Complete cases analyses overestimated the scores and mixed models after multiple imputation by items yielded the lowest scores. As there was no loss of accuracy, mixed models without multiple imputation, when baseline covariates are complete, might be the most parsimonious choice to deal with missing data. However, multiple imputation may be needed when baseline covariates are missing and/or more than two timepoints are considered.

AB - PURPOSE: Although multiple imputation is the state-of-the-art method for managing missing data, mixed models without multiple imputation may be equally valid for longitudinal data. Additionally, it is not clear whether missing values in multi-item instruments should be imputed at item or score-level. We therefore explored the differences in analyzing the scores of a health-related quality of life questionnaire (EQ-5D-5L) using four approaches in two empirical datasets.METHODS: We used simulated (GR dataset) and observed missingness patterns (ABCD dataset) in EQ-5D-5L scores to investigate the following approaches: approach-1) mixed models using respondents with complete cases, approach-2) mixed models using all available data, approach-3) mixed models after multiple imputation of the EQ-5D-5L scores, and approach-4) mixed models after multiple imputation of EQ-5D 5L items.RESULTS: Approach-1 yielded the highest estimates of all approaches (ABCD, GR), increasingly overestimating the EQ-5D-5L score with higher percentages of missing data (GR). Approach-4 produced the lowest scores at follow-up evaluations (ABCD, GR). Standard errors (0.006-0.008) and mean squared errors (0.032-0.035) increased with increasing percentages of simulated missing GR data. Approaches 2 and 3 showed similar results (both datasets).CONCLUSION: Complete cases analyses overestimated the scores and mixed models after multiple imputation by items yielded the lowest scores. As there was no loss of accuracy, mixed models without multiple imputation, when baseline covariates are complete, might be the most parsimonious choice to deal with missing data. However, multiple imputation may be needed when baseline covariates are missing and/or more than two timepoints are considered.

U2 - 10.1007/s11136-021-03037-3

DO - 10.1007/s11136-021-03037-3

M3 - SCORING: Journal article

C2 - 34797507

VL - 31

SP - 1521

EP - 1532

JO - QUAL LIFE RES

JF - QUAL LIFE RES

SN - 0962-9343

IS - 5

ER -