protGear: A protein microarray data pre-processing suite

Protein microarrays are versatile tools for high throughput study of the human proteome, but systematic and non-systematic sources of bias constrain optimal interpretation and the ultimate utility of the data. Published guidelines to limit technical variability whilst maintaining important biologica...

Full description

Bibliographic Details
Main Authors: Mwai, K, Kibinge, N, Tuju, J, Kamuyu, G, Kimathi, R, Mburu, J, Chepsat, E, Nyamako, L, Chege, T, Nkumama, I, Kinyanjui, S, Musenge, E, Osier, F
Format: Journal article
Language:English
Published: Elsevier 2021
_version_ 1797083489134182400
author Mwai, K
Kibinge, N
Tuju, J
Kamuyu, G
Kimathi, R
Mburu, J
Chepsat, E
Nyamako, L
Chege, T
Nkumama, I
Kinyanjui, S
Musenge, E
Osier, F
author_facet Mwai, K
Kibinge, N
Tuju, J
Kamuyu, G
Kimathi, R
Mburu, J
Chepsat, E
Nyamako, L
Chege, T
Nkumama, I
Kinyanjui, S
Musenge, E
Osier, F
author_sort Mwai, K
collection OXFORD
description Protein microarrays are versatile tools for high throughput study of the human proteome, but systematic and non-systematic sources of bias constrain optimal interpretation and the ultimate utility of the data. Published guidelines to limit technical variability whilst maintaining important biological variation favour DNA-based microarrays that often differ fundamentally in their experimental design. Rigorous tools to guide background correction, the quantification of within-sample variation, normalisation, and batch correction specifically for protein microarrays are limited, require extensive investigation and are not centrally accessible. Here, we develop a generic one-stop-shop pre-processing suite for protein microarrays that is compatible with data from the major protein microarray scanners. Our graphical and tabular interfaces facilitate a detailed inspection of data and are coupled with supporting guidelines that enable users to select the most appropriate algorithms to systematically address bias arising in customized experiments. The localization and distribution of background signal intensities determine the optimal correction strategy. A novel function overcomes the limitations in the interpretation of the coefficient of variation when signal intensities are at the lower end of the detection threshold. We demonstrate essential considerations in the experimental design and their impact on a range of algorithms for normalization and minimization of batch effects. Our user-friendly interactive web-based platform eliminates the need for prowess in programming. The open-source R interface includes illustrative examples, generates an auditable record, enables reproducibility, and can incorporate additional custom scripts through its online repository. This versatility will enhance its broad uptake in the infectious disease and vaccine development community.
first_indexed 2024-03-07T01:42:21Z
format Journal article
id oxford-uuid:974fc8d8-902e-4019-b1d5-8c1eb17398fb
institution University of Oxford
language English
last_indexed 2024-03-07T01:42:21Z
publishDate 2021
publisher Elsevier
record_format dspace
spelling oxford-uuid:974fc8d8-902e-4019-b1d5-8c1eb17398fb2022-03-26T23:58:40ZprotGear: A protein microarray data pre-processing suiteJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:974fc8d8-902e-4019-b1d5-8c1eb17398fbEnglishSymplectic ElementsElsevier2021Mwai, KKibinge, NTuju, JKamuyu, GKimathi, RMburu, JChepsat, ENyamako, LChege, TNkumama, IKinyanjui, SMusenge, EOsier, FProtein microarrays are versatile tools for high throughput study of the human proteome, but systematic and non-systematic sources of bias constrain optimal interpretation and the ultimate utility of the data. Published guidelines to limit technical variability whilst maintaining important biological variation favour DNA-based microarrays that often differ fundamentally in their experimental design. Rigorous tools to guide background correction, the quantification of within-sample variation, normalisation, and batch correction specifically for protein microarrays are limited, require extensive investigation and are not centrally accessible. Here, we develop a generic one-stop-shop pre-processing suite for protein microarrays that is compatible with data from the major protein microarray scanners. Our graphical and tabular interfaces facilitate a detailed inspection of data and are coupled with supporting guidelines that enable users to select the most appropriate algorithms to systematically address bias arising in customized experiments. The localization and distribution of background signal intensities determine the optimal correction strategy. A novel function overcomes the limitations in the interpretation of the coefficient of variation when signal intensities are at the lower end of the detection threshold. We demonstrate essential considerations in the experimental design and their impact on a range of algorithms for normalization and minimization of batch effects. Our user-friendly interactive web-based platform eliminates the need for prowess in programming. The open-source R interface includes illustrative examples, generates an auditable record, enables reproducibility, and can incorporate additional custom scripts through its online repository. This versatility will enhance its broad uptake in the infectious disease and vaccine development community.
spellingShingle Mwai, K
Kibinge, N
Tuju, J
Kamuyu, G
Kimathi, R
Mburu, J
Chepsat, E
Nyamako, L
Chege, T
Nkumama, I
Kinyanjui, S
Musenge, E
Osier, F
protGear: A protein microarray data pre-processing suite
title protGear: A protein microarray data pre-processing suite
title_full protGear: A protein microarray data pre-processing suite
title_fullStr protGear: A protein microarray data pre-processing suite
title_full_unstemmed protGear: A protein microarray data pre-processing suite
title_short protGear: A protein microarray data pre-processing suite
title_sort protgear a protein microarray data pre processing suite
work_keys_str_mv AT mwaik protgearaproteinmicroarraydatapreprocessingsuite
AT kibingen protgearaproteinmicroarraydatapreprocessingsuite
AT tujuj protgearaproteinmicroarraydatapreprocessingsuite
AT kamuyug protgearaproteinmicroarraydatapreprocessingsuite
AT kimathir protgearaproteinmicroarraydatapreprocessingsuite
AT mburuj protgearaproteinmicroarraydatapreprocessingsuite
AT chepsate protgearaproteinmicroarraydatapreprocessingsuite
AT nyamakol protgearaproteinmicroarraydatapreprocessingsuite
AT cheget protgearaproteinmicroarraydatapreprocessingsuite
AT nkumamai protgearaproteinmicroarraydatapreprocessingsuite
AT kinyanjuis protgearaproteinmicroarraydatapreprocessingsuite
AT musengee protgearaproteinmicroarraydatapreprocessingsuite
AT osierf protgearaproteinmicroarraydatapreprocessingsuite