Imprecise Deep Forest for Partial Label Learning
In partial label (PL) learning, each instance corresponds to a set of candidate labels, among which only one is valid. The objective of PL learning is to obtain a multi-class classifier from the training instances. Because the true label of a PL training instance is hidden in the candidate label set...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9284528/ |
_version_ | 1819276417733492736 |
---|---|
author | Jie Gao Weiping Lin Kunhong Liu Qingqi Hong Guangyi Lin Beizhan Wang |
author_facet | Jie Gao Weiping Lin Kunhong Liu Qingqi Hong Guangyi Lin Beizhan Wang |
author_sort | Jie Gao |
collection | DOAJ |
description | In partial label (PL) learning, each instance corresponds to a set of candidate labels, among which only one is valid. The objective of PL learning is to obtain a multi-class classifier from the training instances. Because the true label of a PL training instance is hidden in the candidate label set and inaccessible to the learning algorithm, the training process of the classifier is significantly challenging. This study proposes a novel deep learning method for PL learning based on the improved error-correcting output codes (ECOC) algorithm and deep forest (DF) framework. For the ECOC algorithm, we extract the prior knowledge of the candidate label sets from the PL training set to optimize the generation of its coding matrix, where different binary training sets can be derived from the PL training set based on the dichotomy corresponding to each column code. For the DF framework, this improved ECOC algorithm is embedded as a unit in its cascade structure; moreover, an imprecise evaluation method is designed to determine the growth of the cascade of the DF. The effectiveness of the proposed method is verified by conducting several experiments on artificial and real-world PL datasets. |
first_indexed | 2024-12-23T23:39:54Z |
format | Article |
id | doaj.art-b434bf36dba948019ca4608bcab2d288 |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-12-23T23:39:54Z |
publishDate | 2020-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-b434bf36dba948019ca4608bcab2d2882022-12-21T17:25:42ZengIEEEIEEE Access2169-35362020-01-01821853021854110.1109/ACCESS.2020.30428389284528Imprecise Deep Forest for Partial Label LearningJie Gao0https://orcid.org/0000-0002-6580-0581Weiping Lin1Kunhong Liu2https://orcid.org/0000-0002-1222-8876Qingqi Hong3https://orcid.org/0000-0002-9996-6870Guangyi Lin4Beizhan Wang5https://orcid.org/0000-0002-2846-5411School of Informatics, Xiamen University, Xiamen, ChinaSchool of Informatics, Xiamen University, Xiamen, ChinaSchool of Informatics, Xiamen University, Xiamen, ChinaSchool of Informatics, Xiamen University, Xiamen, ChinaSchool of Informatics, Xiamen University, Xiamen, ChinaSchool of Informatics, Xiamen University, Xiamen, ChinaIn partial label (PL) learning, each instance corresponds to a set of candidate labels, among which only one is valid. The objective of PL learning is to obtain a multi-class classifier from the training instances. Because the true label of a PL training instance is hidden in the candidate label set and inaccessible to the learning algorithm, the training process of the classifier is significantly challenging. This study proposes a novel deep learning method for PL learning based on the improved error-correcting output codes (ECOC) algorithm and deep forest (DF) framework. For the ECOC algorithm, we extract the prior knowledge of the candidate label sets from the PL training set to optimize the generation of its coding matrix, where different binary training sets can be derived from the PL training set based on the dichotomy corresponding to each column code. For the DF framework, this improved ECOC algorithm is embedded as a unit in its cascade structure; moreover, an imprecise evaluation method is designed to determine the growth of the cascade of the DF. The effectiveness of the proposed method is verified by conducting several experiments on artificial and real-world PL datasets.https://ieeexplore.ieee.org/document/9284528/Deep foresterror-correcting output codespartial label learning |
spellingShingle | Jie Gao Weiping Lin Kunhong Liu Qingqi Hong Guangyi Lin Beizhan Wang Imprecise Deep Forest for Partial Label Learning IEEE Access Deep forest error-correcting output codes partial label learning |
title | Imprecise Deep Forest for Partial Label Learning |
title_full | Imprecise Deep Forest for Partial Label Learning |
title_fullStr | Imprecise Deep Forest for Partial Label Learning |
title_full_unstemmed | Imprecise Deep Forest for Partial Label Learning |
title_short | Imprecise Deep Forest for Partial Label Learning |
title_sort | imprecise deep forest for partial label learning |
topic | Deep forest error-correcting output codes partial label learning |
url | https://ieeexplore.ieee.org/document/9284528/ |
work_keys_str_mv | AT jiegao imprecisedeepforestforpartiallabellearning AT weipinglin imprecisedeepforestforpartiallabellearning AT kunhongliu imprecisedeepforestforpartiallabellearning AT qingqihong imprecisedeepforestforpartiallabellearning AT guangyilin imprecisedeepforestforpartiallabellearning AT beizhanwang imprecisedeepforestforpartiallabellearning |