A Novel Attention-Based Multi-Modal Modeling Technique on Mixed Type Data for Improving TFT-LCD Repair Process

In Thin-Film Transistor Liquid-Crystal Display (TFT-LCD) manufacturing, conducting a machine learning based system with multiple data types has become actively desired to solve complicated problems. This paper proposes a multi-modal learning approach: <italic>TabVisionNet</italic>, which...

Full description

Bibliographic Details
Main Authors:	Yi Liu, Hsueh-Ping Lu, Ching-Hao Lai
Format:	Article
Language:	English
Published:	IEEE 2022-01-01
Series:	IEEE Access
Subjects:	Smart manufacturing TFT-LCD deep learning multi-modality machine learning tabular data image data and attention mechanism
Online Access:	https://ieeexplore.ieee.org/document/9733354/

_version_	1828420323964354560
author	Yi Liu Hsueh-Ping Lu Ching-Hao Lai
author_facet	Yi Liu Hsueh-Ping Lu Ching-Hao Lai
author_sort	Yi Liu
collection	DOAJ
description	In Thin-Film Transistor Liquid-Crystal Display (TFT-LCD) manufacturing, conducting a machine learning based system with multiple data types has become actively desired to solve complicated problems. This paper proposes a multi-modal learning approach: <italic>TabVisionNet</italic>, which is modeled by utilizing the information from both tabular data and image data. A novel attention mechanism called <italic>Sequential Decision Attention</italic> was integrated into the multi-modal modeling framework that improves the comprehension of the information from two modalities. This cross-modal attention mechanism can capture the complex relationship between modalities then gain better generalization and faster convergence in the training process. Conducting an experiment, the performance of our novel approach was significantly better than single-modal and other multi-modal learning approaches in our real case scenario.
first_indexed	2024-12-10T15:11:31Z
format	Article
id	doaj.art-9655db87f2e14edcbc8f5fbd84c711b6
institution	Directory Open Access Journal
issn	2169-3536
language	English
last_indexed	2024-12-10T15:11:31Z
publishDate	2022-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj.art-9655db87f2e14edcbc8f5fbd84c711b62022-12-22T01:43:56ZengIEEEIEEE Access2169-35362022-01-0110330263303610.1109/ACCESS.2022.31589529733354A Novel Attention-Based Multi-Modal Modeling Technique on Mixed Type Data for Improving TFT-LCD Repair ProcessYi Liu0https://orcid.org/0000-0002-0533-8233Hsueh-Ping Lu1https://orcid.org/0000-0003-0839-9223Ching-Hao Lai2https://orcid.org/0000-0001-9488-1598Department of Data Science Analysis, Division of Digital Technology, AU Optronics Corporation, Taichung, TaiwanDivision of Digital Technology, AU Optronics Corporation, Hsinchu, TaiwanDepartment of Data Science Analysis, Division of Digital Technology, AU Optronics Corporation, Taichung, TaiwanIn Thin-Film Transistor Liquid-Crystal Display (TFT-LCD) manufacturing, conducting a machine learning based system with multiple data types has become actively desired to solve complicated problems. This paper proposes a multi-modal learning approach: <italic>TabVisionNet</italic>, which is modeled by utilizing the information from both tabular data and image data. A novel attention mechanism called <italic>Sequential Decision Attention</italic> was integrated into the multi-modal modeling framework that improves the comprehension of the information from two modalities. This cross-modal attention mechanism can capture the complex relationship between modalities then gain better generalization and faster convergence in the training process. Conducting an experiment, the performance of our novel approach was significantly better than single-modal and other multi-modal learning approaches in our real case scenario.https://ieeexplore.ieee.org/document/9733354/Smart manufacturingTFT-LCDdeep learningmulti-modality machine learningtabular dataimage data and attention mechanism
spellingShingle	Yi Liu Hsueh-Ping Lu Ching-Hao Lai A Novel Attention-Based Multi-Modal Modeling Technique on Mixed Type Data for Improving TFT-LCD Repair Process IEEE Access Smart manufacturing TFT-LCD deep learning multi-modality machine learning tabular data image data and attention mechanism
title	A Novel Attention-Based Multi-Modal Modeling Technique on Mixed Type Data for Improving TFT-LCD Repair Process
title_full	A Novel Attention-Based Multi-Modal Modeling Technique on Mixed Type Data for Improving TFT-LCD Repair Process
title_fullStr	A Novel Attention-Based Multi-Modal Modeling Technique on Mixed Type Data for Improving TFT-LCD Repair Process
title_full_unstemmed	A Novel Attention-Based Multi-Modal Modeling Technique on Mixed Type Data for Improving TFT-LCD Repair Process
title_short	A Novel Attention-Based Multi-Modal Modeling Technique on Mixed Type Data for Improving TFT-LCD Repair Process
title_sort	novel attention based multi modal modeling technique on mixed type data for improving tft lcd repair process
topic	Smart manufacturing TFT-LCD deep learning multi-modality machine learning tabular data image data and attention mechanism
url	https://ieeexplore.ieee.org/document/9733354/
work_keys_str_mv	AT yiliu anovelattentionbasedmultimodalmodelingtechniqueonmixedtypedataforimprovingtftlcdrepairprocess AT hsuehpinglu anovelattentionbasedmultimodalmodelingtechniqueonmixedtypedataforimprovingtftlcdrepairprocess AT chinghaolai anovelattentionbasedmultimodalmodelingtechniqueonmixedtypedataforimprovingtftlcdrepairprocess AT yiliu novelattentionbasedmultimodalmodelingtechniqueonmixedtypedataforimprovingtftlcdrepairprocess AT hsuehpinglu novelattentionbasedmultimodalmodelingtechniqueonmixedtypedataforimprovingtftlcdrepairprocess AT chinghaolai novelattentionbasedmultimodalmodelingtechniqueonmixedtypedataforimprovingtftlcdrepairprocess

A Novel Attention-Based Multi-Modal Modeling Technique on Mixed Type Data for Improving TFT-LCD Repair Process

Similar Items