Handling missing covariates in observational studies: an illustration with the assessment of prognostic factors of survival outcomes in soft-tissue or visceral sarcomas in irradiated fields (SIF)

Background: Missing covariates are common in observational research and can lead to bias and loss of statistical power. Limited data regarding prognostic factors of survival outcomes of sarcomas in irradiated fields (SIF) are available. Because of the long lag time between irradiation of first cance...

Full description

Bibliographic Details
Main Authors: Noémie Huchet, Nicolas Penel, Sylvie Bonvalot, Juliette Thariat, Françoise Ducimetière, Antoine Giraud, Maud Toulmonde, Axel Le Cesne, Jean-Yves Blay, Carine Bellera
Format: Article
Language:English
Published: SAGE Publishing 2024-01-01
Series:Therapeutic Advances in Medical Oncology
Online Access:https://doi.org/10.1177/17588359231220999
_version_ 1797351044575920128
author Noémie Huchet
Nicolas Penel
Sylvie Bonvalot
Juliette Thariat
Françoise Ducimetière
Antoine Giraud
Maud Toulmonde
Axel Le Cesne
Jean-Yves Blay
Carine Bellera
author_facet Noémie Huchet
Nicolas Penel
Sylvie Bonvalot
Juliette Thariat
Françoise Ducimetière
Antoine Giraud
Maud Toulmonde
Axel Le Cesne
Jean-Yves Blay
Carine Bellera
author_sort Noémie Huchet
collection DOAJ
description Background: Missing covariates are common in observational research and can lead to bias and loss of statistical power. Limited data regarding prognostic factors of survival outcomes of sarcomas in irradiated fields (SIF) are available. Because of the long lag time between irradiation of first cancer and scarcity of SIF, missing data are a critical issue when analyzing long-term outcomes. We assessed prognostic factors of overall (OS), progression-free (PFS), and metastatic-progression-free (MPFS) survivals in SIF using three methods to account for missing covariates. Methods: We relied on the NETSARC French Sarcoma Group database, Cox (OS/PFS), and competitive hazards (MPFS) survival models. Covariates investigated were age, sex, histological subtype, tumor size, depth and grade, metastasis, surgery, surgical resection, surgeon’s expertise, imaging, and neo-adjuvant treatment. We first applied multiple imputation (MI): observed data were used to estimate the missing covariate. With the missing-data modality approach, a category missing was created for qualitative variables. With the complete-case (CC) approach, analysis was restricted to patients without missing covariates. Results: CC subjects ( N = 167; 33%) presented more often with soft-tissue sarcoma ( versus visceral sarcoma) and grade I–II tumors as compared to the 504 eligible cases. With MI ( N = 504), factors associated with the worst outcome included metastasis ( p = 0.04) and R1/R2 resection ( p < 0.001) for OS; higher grade/non-gradable tumors ( p = 0.002) and R1/R2 resection ( p < 0.001) for PFS; and metastasis ( p = 0.01) for M-PFS. The ‘missing-data modality’ approach ( N = 504) led to different associations, including significance reached due to variables with the modality ‘missing’. The CC analysis led to different results and reduced precision. Conclusion: The CC population was not representative of the eligible population, introducing bias, in addition to worst precision. The ‘missing-data modality method’ results in biased estimates in non-randomized studies, as outcomes may be related to variables with missing values. Appropriate statistical methods for missing covariates, for example, MI, should therefore be considered.
first_indexed 2024-03-08T12:54:51Z
format Article
id doaj.art-f62d8bca81c14c98a54fbe0ca2d36164
institution Directory Open Access Journal
issn 1758-8359
language English
last_indexed 2024-03-08T12:54:51Z
publishDate 2024-01-01
publisher SAGE Publishing
record_format Article
series Therapeutic Advances in Medical Oncology
spelling doaj.art-f62d8bca81c14c98a54fbe0ca2d361642024-01-19T19:04:15ZengSAGE PublishingTherapeutic Advances in Medical Oncology1758-83592024-01-011610.1177/17588359231220999Handling missing covariates in observational studies: an illustration with the assessment of prognostic factors of survival outcomes in soft-tissue or visceral sarcomas in irradiated fields (SIF)Noémie HuchetNicolas PenelSylvie BonvalotJuliette ThariatFrançoise DucimetièreAntoine GiraudMaud ToulmondeAxel Le CesneJean-Yves BlayCarine BelleraBackground: Missing covariates are common in observational research and can lead to bias and loss of statistical power. Limited data regarding prognostic factors of survival outcomes of sarcomas in irradiated fields (SIF) are available. Because of the long lag time between irradiation of first cancer and scarcity of SIF, missing data are a critical issue when analyzing long-term outcomes. We assessed prognostic factors of overall (OS), progression-free (PFS), and metastatic-progression-free (MPFS) survivals in SIF using three methods to account for missing covariates. Methods: We relied on the NETSARC French Sarcoma Group database, Cox (OS/PFS), and competitive hazards (MPFS) survival models. Covariates investigated were age, sex, histological subtype, tumor size, depth and grade, metastasis, surgery, surgical resection, surgeon’s expertise, imaging, and neo-adjuvant treatment. We first applied multiple imputation (MI): observed data were used to estimate the missing covariate. With the missing-data modality approach, a category missing was created for qualitative variables. With the complete-case (CC) approach, analysis was restricted to patients without missing covariates. Results: CC subjects ( N = 167; 33%) presented more often with soft-tissue sarcoma ( versus visceral sarcoma) and grade I–II tumors as compared to the 504 eligible cases. With MI ( N = 504), factors associated with the worst outcome included metastasis ( p = 0.04) and R1/R2 resection ( p < 0.001) for OS; higher grade/non-gradable tumors ( p = 0.002) and R1/R2 resection ( p < 0.001) for PFS; and metastasis ( p = 0.01) for M-PFS. The ‘missing-data modality’ approach ( N = 504) led to different associations, including significance reached due to variables with the modality ‘missing’. The CC analysis led to different results and reduced precision. Conclusion: The CC population was not representative of the eligible population, introducing bias, in addition to worst precision. The ‘missing-data modality method’ results in biased estimates in non-randomized studies, as outcomes may be related to variables with missing values. Appropriate statistical methods for missing covariates, for example, MI, should therefore be considered.https://doi.org/10.1177/17588359231220999
spellingShingle Noémie Huchet
Nicolas Penel
Sylvie Bonvalot
Juliette Thariat
Françoise Ducimetière
Antoine Giraud
Maud Toulmonde
Axel Le Cesne
Jean-Yves Blay
Carine Bellera
Handling missing covariates in observational studies: an illustration with the assessment of prognostic factors of survival outcomes in soft-tissue or visceral sarcomas in irradiated fields (SIF)
Therapeutic Advances in Medical Oncology
title Handling missing covariates in observational studies: an illustration with the assessment of prognostic factors of survival outcomes in soft-tissue or visceral sarcomas in irradiated fields (SIF)
title_full Handling missing covariates in observational studies: an illustration with the assessment of prognostic factors of survival outcomes in soft-tissue or visceral sarcomas in irradiated fields (SIF)
title_fullStr Handling missing covariates in observational studies: an illustration with the assessment of prognostic factors of survival outcomes in soft-tissue or visceral sarcomas in irradiated fields (SIF)
title_full_unstemmed Handling missing covariates in observational studies: an illustration with the assessment of prognostic factors of survival outcomes in soft-tissue or visceral sarcomas in irradiated fields (SIF)
title_short Handling missing covariates in observational studies: an illustration with the assessment of prognostic factors of survival outcomes in soft-tissue or visceral sarcomas in irradiated fields (SIF)
title_sort handling missing covariates in observational studies an illustration with the assessment of prognostic factors of survival outcomes in soft tissue or visceral sarcomas in irradiated fields sif
url https://doi.org/10.1177/17588359231220999
work_keys_str_mv AT noemiehuchet handlingmissingcovariatesinobservationalstudiesanillustrationwiththeassessmentofprognosticfactorsofsurvivaloutcomesinsofttissueorvisceralsarcomasinirradiatedfieldssif
AT nicolaspenel handlingmissingcovariatesinobservationalstudiesanillustrationwiththeassessmentofprognosticfactorsofsurvivaloutcomesinsofttissueorvisceralsarcomasinirradiatedfieldssif
AT sylviebonvalot handlingmissingcovariatesinobservationalstudiesanillustrationwiththeassessmentofprognosticfactorsofsurvivaloutcomesinsofttissueorvisceralsarcomasinirradiatedfieldssif
AT juliettethariat handlingmissingcovariatesinobservationalstudiesanillustrationwiththeassessmentofprognosticfactorsofsurvivaloutcomesinsofttissueorvisceralsarcomasinirradiatedfieldssif
AT francoiseducimetiere handlingmissingcovariatesinobservationalstudiesanillustrationwiththeassessmentofprognosticfactorsofsurvivaloutcomesinsofttissueorvisceralsarcomasinirradiatedfieldssif
AT antoinegiraud handlingmissingcovariatesinobservationalstudiesanillustrationwiththeassessmentofprognosticfactorsofsurvivaloutcomesinsofttissueorvisceralsarcomasinirradiatedfieldssif
AT maudtoulmonde handlingmissingcovariatesinobservationalstudiesanillustrationwiththeassessmentofprognosticfactorsofsurvivaloutcomesinsofttissueorvisceralsarcomasinirradiatedfieldssif
AT axellecesne handlingmissingcovariatesinobservationalstudiesanillustrationwiththeassessmentofprognosticfactorsofsurvivaloutcomesinsofttissueorvisceralsarcomasinirradiatedfieldssif
AT jeanyvesblay handlingmissingcovariatesinobservationalstudiesanillustrationwiththeassessmentofprognosticfactorsofsurvivaloutcomesinsofttissueorvisceralsarcomasinirradiatedfieldssif
AT carinebellera handlingmissingcovariatesinobservationalstudiesanillustrationwiththeassessmentofprognosticfactorsofsurvivaloutcomesinsofttissueorvisceralsarcomasinirradiatedfieldssif