Intra-Class Mixup for Out-of-Distribution Detection

Deep neural networks (DNNs) have found widespread adoption in solving image recognition and natural language processing tasks. However, they make confident mispredictions when presented with data that does not belong to the training distribution, i.e. out-of-distribution (OoD) samples. Research has...

Full description

Bibliographic Details
Main Authors:	Deepak Ravikumar, Sangamesh Kodge, Isha Garg, Kaushik Roy
Format:	Article
Language:	English
Published:	IEEE 2023-01-01
Series:	IEEE Access
Subjects:	Deep learning deep neural nets mixup intra-class mixup out-of-distribution detection angular margin
Online Access:	https://ieeexplore.ieee.org/document/10064300/

_version_	1827986587163557888
author	Deepak Ravikumar Sangamesh Kodge Isha Garg Kaushik Roy
author_facet	Deepak Ravikumar Sangamesh Kodge Isha Garg Kaushik Roy
author_sort	Deepak Ravikumar
collection	DOAJ
description	Deep neural networks (DNNs) have found widespread adoption in solving image recognition and natural language processing tasks. However, they make confident mispredictions when presented with data that does not belong to the training distribution, i.e. out-of-distribution (OoD) samples. Research has shown that angular representations can be useful to address the curse of dimensionality and improve OoD detection performance. However, when evaluating the angular separability using Fisher’s criterion we find that empirical risk minimization and inter-class mixup trained DNNs have low angular separability between in-distribution data and OoD data. To improve angular separability, we propose intra-class mixup. We provide mathematical reasoning that shows that intra-class mixup results in reduced angular spread because of reduced variance at the input during training. Further, to take full advantage of improved angular separability from intra-class mixup we propose supplementing the separation metric with the cosine of angular margin to improve OoD detection. Angular margin is the angle between the final layer weight vector and the sample representation. The proposed intra-class mixup when applied to various existing OoD detection techniques shows an improvement of 4.21% and 6.21% in AUROC performance over empirical risk minimization and inter-class mixup, respectively. Further, intra-class mixup aided with the cosine of angular margin improves AUROC performance by 6.71% and 8.75% over empirical risk minimization and inter-class mixup, respectively.
first_indexed	2024-04-09T23:33:44Z
format	Article
id	doaj.art-286cd5c4316f4bcf8f72a6fad313fe74
institution	Directory Open Access Journal
issn	2169-3536
language	English
last_indexed	2024-04-09T23:33:44Z
publishDate	2023-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj.art-286cd5c4316f4bcf8f72a6fad313fe742023-03-20T23:00:14ZengIEEEIEEE Access2169-35362023-01-0111259682598110.1109/ACCESS.2023.325492010064300Intra-Class Mixup for Out-of-Distribution DetectionDeepak Ravikumar0https://orcid.org/0000-0001-6736-3250Sangamesh Kodge1https://orcid.org/0000-0001-9713-5400Isha Garg2https://orcid.org/0000-0003-4702-9444Kaushik Roy3https://orcid.org/0000-0002-0735-9695School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, USASchool of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, USASchool of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, USASchool of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, USADeep neural networks (DNNs) have found widespread adoption in solving image recognition and natural language processing tasks. However, they make confident mispredictions when presented with data that does not belong to the training distribution, i.e. out-of-distribution (OoD) samples. Research has shown that angular representations can be useful to address the curse of dimensionality and improve OoD detection performance. However, when evaluating the angular separability using Fisher’s criterion we find that empirical risk minimization and inter-class mixup trained DNNs have low angular separability between in-distribution data and OoD data. To improve angular separability, we propose intra-class mixup. We provide mathematical reasoning that shows that intra-class mixup results in reduced angular spread because of reduced variance at the input during training. Further, to take full advantage of improved angular separability from intra-class mixup we propose supplementing the separation metric with the cosine of angular margin to improve OoD detection. Angular margin is the angle between the final layer weight vector and the sample representation. The proposed intra-class mixup when applied to various existing OoD detection techniques shows an improvement of 4.21% and 6.21% in AUROC performance over empirical risk minimization and inter-class mixup, respectively. Further, intra-class mixup aided with the cosine of angular margin improves AUROC performance by 6.71% and 8.75% over empirical risk minimization and inter-class mixup, respectively.https://ieeexplore.ieee.org/document/10064300/Deep learningdeep neural netsmixupintra-class mixupout-of-distribution detectionangular margin
spellingShingle	Deepak Ravikumar Sangamesh Kodge Isha Garg Kaushik Roy Intra-Class Mixup for Out-of-Distribution Detection IEEE Access Deep learning deep neural nets mixup intra-class mixup out-of-distribution detection angular margin
title	Intra-Class Mixup for Out-of-Distribution Detection
title_full	Intra-Class Mixup for Out-of-Distribution Detection
title_fullStr	Intra-Class Mixup for Out-of-Distribution Detection
title_full_unstemmed	Intra-Class Mixup for Out-of-Distribution Detection
title_short	Intra-Class Mixup for Out-of-Distribution Detection
title_sort	intra class mixup for out of distribution detection
topic	Deep learning deep neural nets mixup intra-class mixup out-of-distribution detection angular margin
url	https://ieeexplore.ieee.org/document/10064300/
work_keys_str_mv	AT deepakravikumar intraclassmixupforoutofdistributiondetection AT sangameshkodge intraclassmixupforoutofdistributiondetection AT ishagarg intraclassmixupforoutofdistributiondetection AT kaushikroy intraclassmixupforoutofdistributiondetection

Intra-Class Mixup for Out-of-Distribution Detection

Similar Items