Combining independent, weighted P-values: achieving computational stability by a systematic expansion with controllable accuracy.

Given the expanding availability of scientific data and tools to analyze them, combining different assessments of the same piece of information has become increasingly important for social, biological, and even physical sciences. This task demands, to begin with, a method-independent standard, such...

Full description

Bibliographic Details
Main Authors: Gelio Alves, Yi-Kuo Yu
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2011-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3166143?pdf=render
_version_ 1811276571242135552
author Gelio Alves
Yi-Kuo Yu
author_facet Gelio Alves
Yi-Kuo Yu
author_sort Gelio Alves
collection DOAJ
description Given the expanding availability of scientific data and tools to analyze them, combining different assessments of the same piece of information has become increasingly important for social, biological, and even physical sciences. This task demands, to begin with, a method-independent standard, such as the P-value, that can be used to assess the reliability of a piece of information. Good's formula and Fisher's method combine independent P-values with respectively unequal and equal weights. Both approaches may be regarded as limiting instances of a general case of combining P-values from m groups; P-values within each group are weighted equally, while weight varies by group. When some of the weights become nearly degenerate, as cautioned by Good, numeric instability occurs in computation of the combined P-values. We deal explicitly with this difficulty by deriving a controlled expansion, in powers of differences in inverse weights, that provides both accurate statistics and stable numerics. We illustrate the utility of this systematic approach with a few examples. In addition, we also provide here an alternative derivation for the probability distribution function of the general case and show how the analytic formula obtained reduces to both Good's and Fisher's methods as special cases. A C++ program, which computes the combined P-values with equal numerical stability regardless of whether weights are (nearly) degenerate or not, is available for download at our group website http://www.ncbi.nlm.nih.gov/CBBresearch/Yu/downloads/CoinedPValues.html.
first_indexed 2024-04-13T00:00:00Z
format Article
id doaj.art-3474a0020ea34c199caca8bdc5a7b233
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-04-13T00:00:00Z
publishDate 2011-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-3474a0020ea34c199caca8bdc5a7b2332022-12-22T03:11:23ZengPublic Library of Science (PLoS)PLoS ONE1932-62032011-01-0168e2264710.1371/journal.pone.0022647Combining independent, weighted P-values: achieving computational stability by a systematic expansion with controllable accuracy.Gelio AlvesYi-Kuo YuGiven the expanding availability of scientific data and tools to analyze them, combining different assessments of the same piece of information has become increasingly important for social, biological, and even physical sciences. This task demands, to begin with, a method-independent standard, such as the P-value, that can be used to assess the reliability of a piece of information. Good's formula and Fisher's method combine independent P-values with respectively unequal and equal weights. Both approaches may be regarded as limiting instances of a general case of combining P-values from m groups; P-values within each group are weighted equally, while weight varies by group. When some of the weights become nearly degenerate, as cautioned by Good, numeric instability occurs in computation of the combined P-values. We deal explicitly with this difficulty by deriving a controlled expansion, in powers of differences in inverse weights, that provides both accurate statistics and stable numerics. We illustrate the utility of this systematic approach with a few examples. In addition, we also provide here an alternative derivation for the probability distribution function of the general case and show how the analytic formula obtained reduces to both Good's and Fisher's methods as special cases. A C++ program, which computes the combined P-values with equal numerical stability regardless of whether weights are (nearly) degenerate or not, is available for download at our group website http://www.ncbi.nlm.nih.gov/CBBresearch/Yu/downloads/CoinedPValues.html.http://europepmc.org/articles/PMC3166143?pdf=render
spellingShingle Gelio Alves
Yi-Kuo Yu
Combining independent, weighted P-values: achieving computational stability by a systematic expansion with controllable accuracy.
PLoS ONE
title Combining independent, weighted P-values: achieving computational stability by a systematic expansion with controllable accuracy.
title_full Combining independent, weighted P-values: achieving computational stability by a systematic expansion with controllable accuracy.
title_fullStr Combining independent, weighted P-values: achieving computational stability by a systematic expansion with controllable accuracy.
title_full_unstemmed Combining independent, weighted P-values: achieving computational stability by a systematic expansion with controllable accuracy.
title_short Combining independent, weighted P-values: achieving computational stability by a systematic expansion with controllable accuracy.
title_sort combining independent weighted p values achieving computational stability by a systematic expansion with controllable accuracy
url http://europepmc.org/articles/PMC3166143?pdf=render
work_keys_str_mv AT gelioalves combiningindependentweightedpvaluesachievingcomputationalstabilitybyasystematicexpansionwithcontrollableaccuracy
AT yikuoyu combiningindependentweightedpvaluesachievingcomputationalstabilitybyasystematicexpansionwithcontrollableaccuracy