Bayesian aggregation of categorical distributions with applications in crowdsourcing

A key problem in crowdsourcing is the aggregation of judgments of proportions. For example, workers might be presented with a news article or an image, and be asked to identify the proportion of each topic, sentiment, object, or colour present in it. These varying judgments then need to be aggregate...

Full description

Bibliographic Details
Main Authors: Augustin, A, Venanzi, M, Hare, J, Rogers, A, Jennings, N
Format: Conference item
Published: AAAI Press / International Joint Conferences on Artificial Intelligence 2017
_version_ 1797051202976874496
author Augustin, A
Venanzi, M
Hare, J
Rogers, A
Jennings, N
author_facet Augustin, A
Venanzi, M
Hare, J
Rogers, A
Jennings, N
author_sort Augustin, A
collection OXFORD
description A key problem in crowdsourcing is the aggregation of judgments of proportions. For example, workers might be presented with a news article or an image, and be asked to identify the proportion of each topic, sentiment, object, or colour present in it. These varying judgments then need to be aggregated to form a consensus view of the document’s contents. Often, however, these judgments can be skewed by workers who provide judgments randomly (i.e. they are spammers). Spammers make the cost of acquiring judgments more expensive and degrade the accuracy of the aggregation. For such cases, we provide a new Bayesian framework for aggregating these responses (expressed in the form of categorical distributions) that for the first time accounts for spammers. We elicit 796 judgments about proportions of objects and colours in images. Experimental results on three real-world datasets show comparable aggregation accuracy when 60% of the workers are spammers, as other state of the art approaches do when there are no spammers.
first_indexed 2024-03-06T18:16:35Z
format Conference item
id oxford-uuid:04d242be-6364-47bc-bb57-41f62a448c61
institution University of Oxford
last_indexed 2024-03-06T18:16:35Z
publishDate 2017
publisher AAAI Press / International Joint Conferences on Artificial Intelligence
record_format dspace
spelling oxford-uuid:04d242be-6364-47bc-bb57-41f62a448c612022-03-26T08:53:51ZBayesian aggregation of categorical distributions with applications in crowdsourcingConference itemhttp://purl.org/coar/resource_type/c_5794uuid:04d242be-6364-47bc-bb57-41f62a448c61Symplectic Elements at OxfordAAAI Press / International Joint Conferences on Artificial Intelligence2017Augustin, AVenanzi, MHare, JRogers, AJennings, NA key problem in crowdsourcing is the aggregation of judgments of proportions. For example, workers might be presented with a news article or an image, and be asked to identify the proportion of each topic, sentiment, object, or colour present in it. These varying judgments then need to be aggregated to form a consensus view of the document’s contents. Often, however, these judgments can be skewed by workers who provide judgments randomly (i.e. they are spammers). Spammers make the cost of acquiring judgments more expensive and degrade the accuracy of the aggregation. For such cases, we provide a new Bayesian framework for aggregating these responses (expressed in the form of categorical distributions) that for the first time accounts for spammers. We elicit 796 judgments about proportions of objects and colours in images. Experimental results on three real-world datasets show comparable aggregation accuracy when 60% of the workers are spammers, as other state of the art approaches do when there are no spammers.
spellingShingle Augustin, A
Venanzi, M
Hare, J
Rogers, A
Jennings, N
Bayesian aggregation of categorical distributions with applications in crowdsourcing
title Bayesian aggregation of categorical distributions with applications in crowdsourcing
title_full Bayesian aggregation of categorical distributions with applications in crowdsourcing
title_fullStr Bayesian aggregation of categorical distributions with applications in crowdsourcing
title_full_unstemmed Bayesian aggregation of categorical distributions with applications in crowdsourcing
title_short Bayesian aggregation of categorical distributions with applications in crowdsourcing
title_sort bayesian aggregation of categorical distributions with applications in crowdsourcing
work_keys_str_mv AT augustina bayesianaggregationofcategoricaldistributionswithapplicationsincrowdsourcing
AT venanzim bayesianaggregationofcategoricaldistributionswithapplicationsincrowdsourcing
AT harej bayesianaggregationofcategoricaldistributionswithapplicationsincrowdsourcing
AT rogersa bayesianaggregationofcategoricaldistributionswithapplicationsincrowdsourcing
AT jenningsn bayesianaggregationofcategoricaldistributionswithapplicationsincrowdsourcing