Classification of Indian media titles using deep learning techniques

Automatic speech recognition is being used everywhere these days. An essential part of this is language identification. Our goal here is to identify the language of the media title, such as song names and movie titles, to help in speech recognition. The focus here is to classify solely using the tit...

Full description

Bibliographic Details
Main Authors: Sujit Kumar, Devesh D Rajesh, Sarthak Pranesh, V N Hemanth Kollipara, Gopal Kumar Agrawal, M Anbarasi, Valarmathi J
Format: Article
Language:English
Published: KeAi Communications Co., Ltd. 2022-06-01
Series:International Journal of Cognitive Computing in Engineering
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2666307422000109
_version_ 1797959110141935616
author Sujit Kumar
Devesh D Rajesh
Sarthak Pranesh
V N Hemanth Kollipara
Gopal Kumar Agrawal
M Anbarasi
Valarmathi J
author_facet Sujit Kumar
Devesh D Rajesh
Sarthak Pranesh
V N Hemanth Kollipara
Gopal Kumar Agrawal
M Anbarasi
Valarmathi J
author_sort Sujit Kumar
collection DOAJ
description Automatic speech recognition is being used everywhere these days. An essential part of this is language identification. Our goal here is to identify the language of the media title, such as song names and movie titles, to help in speech recognition. The focus here is to classify solely using the title of the media without any additional data in their transliterated form to classify them into their original native language using natural language processing, machine learning, and deep learning techniques. Transliterated titles of the song and movie names are being used. This work explores and implements various natural language processing and machine learning methods such as N-grams, SVMs, LSTMs, and MuRIL to classify the text titles according to their language. The results of various implementations are compared and contrasted as an approach of its own to classify the data.
first_indexed 2024-04-11T00:29:05Z
format Article
id doaj.art-0f9a35e2b2534347a1cce22b6858f656
institution Directory Open Access Journal
issn 2666-3074
language English
last_indexed 2024-04-11T00:29:05Z
publishDate 2022-06-01
publisher KeAi Communications Co., Ltd.
record_format Article
series International Journal of Cognitive Computing in Engineering
spelling doaj.art-0f9a35e2b2534347a1cce22b6858f6562023-01-08T04:15:01ZengKeAi Communications Co., Ltd.International Journal of Cognitive Computing in Engineering2666-30742022-06-013114123Classification of Indian media titles using deep learning techniquesSujit Kumar0Devesh D Rajesh1Sarthak Pranesh2V N Hemanth Kollipara3Gopal Kumar Agrawal4M Anbarasi5Valarmathi J6Vellore Institute of Technology, Vellore, IndiaVellore Institute of Technology, Vellore, IndiaVellore Institute of Technology, Vellore, IndiaVellore Institute of Technology, Vellore, IndiaArchitect at Samsung R&D Institute, Bangalore, IndiaVellore Institute of Technology, Vellore, India; Corresponding author.Vellore Institute of Technology, Vellore, IndiaAutomatic speech recognition is being used everywhere these days. An essential part of this is language identification. Our goal here is to identify the language of the media title, such as song names and movie titles, to help in speech recognition. The focus here is to classify solely using the title of the media without any additional data in their transliterated form to classify them into their original native language using natural language processing, machine learning, and deep learning techniques. Transliterated titles of the song and movie names are being used. This work explores and implements various natural language processing and machine learning methods such as N-grams, SVMs, LSTMs, and MuRIL to classify the text titles according to their language. The results of various implementations are compared and contrasted as an approach of its own to classify the data.http://www.sciencedirect.com/science/article/pii/S2666307422000109N-GramsSVMLSTMMuRILLanguage classificationIndian media titles
spellingShingle Sujit Kumar
Devesh D Rajesh
Sarthak Pranesh
V N Hemanth Kollipara
Gopal Kumar Agrawal
M Anbarasi
Valarmathi J
Classification of Indian media titles using deep learning techniques
International Journal of Cognitive Computing in Engineering
N-Grams
SVM
LSTM
MuRIL
Language classification
Indian media titles
title Classification of Indian media titles using deep learning techniques
title_full Classification of Indian media titles using deep learning techniques
title_fullStr Classification of Indian media titles using deep learning techniques
title_full_unstemmed Classification of Indian media titles using deep learning techniques
title_short Classification of Indian media titles using deep learning techniques
title_sort classification of indian media titles using deep learning techniques
topic N-Grams
SVM
LSTM
MuRIL
Language classification
Indian media titles
url http://www.sciencedirect.com/science/article/pii/S2666307422000109
work_keys_str_mv AT sujitkumar classificationofindianmediatitlesusingdeeplearningtechniques
AT deveshdrajesh classificationofindianmediatitlesusingdeeplearningtechniques
AT sarthakpranesh classificationofindianmediatitlesusingdeeplearningtechniques
AT vnhemanthkollipara classificationofindianmediatitlesusingdeeplearningtechniques
AT gopalkumaragrawal classificationofindianmediatitlesusingdeeplearningtechniques
AT manbarasi classificationofindianmediatitlesusingdeeplearningtechniques
AT valarmathij classificationofindianmediatitlesusingdeeplearningtechniques