Comprehensive evaluation of harmonization on functional brain imaging for multisite data-fusion

ABSTRACT: To embrace big-data neuroimaging, harmonizing the site effect in resting-state functional magnetic resonance imaging (R-fMRI) data fusion is a fundamental challenge. A comprehensive evaluation of potentially effective harmonization strategies, particularly with specifically collected data,...

Full description

Bibliographic Details
Main Authors: Yu-Wei Wang, Xiao Chen, Chao-Gan Yan
Format: Article
Language:English
Published: Elsevier 2023-07-01
Series:NeuroImage
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1053811923002355
_version_ 1797826942438735872
author Yu-Wei Wang
Xiao Chen
Chao-Gan Yan
author_facet Yu-Wei Wang
Xiao Chen
Chao-Gan Yan
author_sort Yu-Wei Wang
collection DOAJ
description ABSTRACT: To embrace big-data neuroimaging, harmonizing the site effect in resting-state functional magnetic resonance imaging (R-fMRI) data fusion is a fundamental challenge. A comprehensive evaluation of potentially effective harmonization strategies, particularly with specifically collected data, has been scarce, especially for R-fMRI metrics. Here, we comprehensively assess harmonization strategies from multiple perspectives, including tests on residual site effect, individual identification, test-retest reliability, and replicability of group-level statistical results, on widely used R-fMRI metrics across various datasets, including data obtained from participants with repetitive measures at different scanners. For individual identifiability (i.e., whether the same subject could be identified across R-fMRI data scanned across different sites), we found that, while most methods decreased site effects, the Subsampling Maximum-mean-distance based distribution shift correction Algorithm (SMA) and parametric unadjusted CovBat outperformed linear regression models, linear mixed models, ComBat series and invariant conditional variational auto-encoder in clustering accuracy. Test-retest reliability was better for SMA and parametric adjusted CovBat than unadjusted ComBat series and parametric unadjusted CovBat in the number of overlapped voxels. At the same time, SMA was superior to the latter in replicability in terms of the Dice coefficient and the scale of brain areas showing sex differences reproducibly observed across datasets. Furthermore, SMA better detected reproducible sex differences of ALFF under the site-sex confounded situation. Moreover, we designed experiments to identify the best target site features to optimize SMA identifiability, test-retest reliability, and stability. We noted both sample size and distribution of the target site matter and introduced a heuristic formula for selecting the target site. In addition to providing practical guidelines, this work can inform continuing improvements and innovations in harmonizing methodologies for big R-fMRI data.
first_indexed 2024-04-09T12:40:22Z
format Article
id doaj.art-fd03f0c992424a14ba1d652f7f895b46
institution Directory Open Access Journal
issn 1095-9572
language English
last_indexed 2024-04-09T12:40:22Z
publishDate 2023-07-01
publisher Elsevier
record_format Article
series NeuroImage
spelling doaj.art-fd03f0c992424a14ba1d652f7f895b462023-05-15T04:13:47ZengElsevierNeuroImage1095-95722023-07-01274120089Comprehensive evaluation of harmonization on functional brain imaging for multisite data-fusionYu-Wei Wang0Xiao Chen1Chao-Gan Yan2CAS Key Laboratory of Behavioral Science, Institute of Psychology, Beijing 100101, China; Department of Psychology, University of Chinese Academy of Sciences, Beijing 100049, China; International Big-Data Center for Depression Research, Institute of Psychology, Chinese Academy of Sciences, Beijing 100101, ChinaCAS Key Laboratory of Behavioral Science, Institute of Psychology, Beijing 100101, China; Department of Psychology, University of Chinese Academy of Sciences, Beijing 100049, China; International Big-Data Center for Depression Research, Institute of Psychology, Chinese Academy of Sciences, Beijing 100101, China; Magnetic Resonance Imaging Research Center, Institute of Psychology, Chinese Academy of Sciences, Beijing 100101, China.CAS Key Laboratory of Behavioral Science, Institute of Psychology, Beijing 100101, China; Department of Psychology, University of Chinese Academy of Sciences, Beijing 100049, China; International Big-Data Center for Depression Research, Institute of Psychology, Chinese Academy of Sciences, Beijing 100101, China; Magnetic Resonance Imaging Research Center, Institute of Psychology, Chinese Academy of Sciences, Beijing 100101, China.; Corresponding author: Chao-Gan Yan, CAS Key Laboratory of Behavioral Science, Institute of Psychology, 16 Lincui Road, Chaoyang District, Beijing 100101, China, Tel: +86-10-64101582, Fax: +86-10-64101582ABSTRACT: To embrace big-data neuroimaging, harmonizing the site effect in resting-state functional magnetic resonance imaging (R-fMRI) data fusion is a fundamental challenge. A comprehensive evaluation of potentially effective harmonization strategies, particularly with specifically collected data, has been scarce, especially for R-fMRI metrics. Here, we comprehensively assess harmonization strategies from multiple perspectives, including tests on residual site effect, individual identification, test-retest reliability, and replicability of group-level statistical results, on widely used R-fMRI metrics across various datasets, including data obtained from participants with repetitive measures at different scanners. For individual identifiability (i.e., whether the same subject could be identified across R-fMRI data scanned across different sites), we found that, while most methods decreased site effects, the Subsampling Maximum-mean-distance based distribution shift correction Algorithm (SMA) and parametric unadjusted CovBat outperformed linear regression models, linear mixed models, ComBat series and invariant conditional variational auto-encoder in clustering accuracy. Test-retest reliability was better for SMA and parametric adjusted CovBat than unadjusted ComBat series and parametric unadjusted CovBat in the number of overlapped voxels. At the same time, SMA was superior to the latter in replicability in terms of the Dice coefficient and the scale of brain areas showing sex differences reproducibly observed across datasets. Furthermore, SMA better detected reproducible sex differences of ALFF under the site-sex confounded situation. Moreover, we designed experiments to identify the best target site features to optimize SMA identifiability, test-retest reliability, and stability. We noted both sample size and distribution of the target site matter and introduced a heuristic formula for selecting the target site. In addition to providing practical guidelines, this work can inform continuing improvements and innovations in harmonizing methodologies for big R-fMRI data.http://www.sciencedirect.com/science/article/pii/S1053811923002355ComparisonHarmonizationMulti-site poolingResting-state fMRI
spellingShingle Yu-Wei Wang
Xiao Chen
Chao-Gan Yan
Comprehensive evaluation of harmonization on functional brain imaging for multisite data-fusion
NeuroImage
Comparison
Harmonization
Multi-site pooling
Resting-state fMRI
title Comprehensive evaluation of harmonization on functional brain imaging for multisite data-fusion
title_full Comprehensive evaluation of harmonization on functional brain imaging for multisite data-fusion
title_fullStr Comprehensive evaluation of harmonization on functional brain imaging for multisite data-fusion
title_full_unstemmed Comprehensive evaluation of harmonization on functional brain imaging for multisite data-fusion
title_short Comprehensive evaluation of harmonization on functional brain imaging for multisite data-fusion
title_sort comprehensive evaluation of harmonization on functional brain imaging for multisite data fusion
topic Comparison
Harmonization
Multi-site pooling
Resting-state fMRI
url http://www.sciencedirect.com/science/article/pii/S1053811923002355
work_keys_str_mv AT yuweiwang comprehensiveevaluationofharmonizationonfunctionalbrainimagingformultisitedatafusion
AT xiaochen comprehensiveevaluationofharmonizationonfunctionalbrainimagingformultisitedatafusion
AT chaoganyan comprehensiveevaluationofharmonizationonfunctionalbrainimagingformultisitedatafusion