Classification of Indian media titles using deep learning techniques
Automatic speech recognition is being used everywhere these days. An essential part of this is language identification. Our goal here is to identify the language of the media title, such as song names and movie titles, to help in speech recognition. The focus here is to classify solely using the tit...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
KeAi Communications Co., Ltd.
2022-06-01
|
Series: | International Journal of Cognitive Computing in Engineering |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2666307422000109 |
_version_ | 1797959110141935616 |
---|---|
author | Sujit Kumar Devesh D Rajesh Sarthak Pranesh V N Hemanth Kollipara Gopal Kumar Agrawal M Anbarasi Valarmathi J |
author_facet | Sujit Kumar Devesh D Rajesh Sarthak Pranesh V N Hemanth Kollipara Gopal Kumar Agrawal M Anbarasi Valarmathi J |
author_sort | Sujit Kumar |
collection | DOAJ |
description | Automatic speech recognition is being used everywhere these days. An essential part of this is language identification. Our goal here is to identify the language of the media title, such as song names and movie titles, to help in speech recognition. The focus here is to classify solely using the title of the media without any additional data in their transliterated form to classify them into their original native language using natural language processing, machine learning, and deep learning techniques. Transliterated titles of the song and movie names are being used. This work explores and implements various natural language processing and machine learning methods such as N-grams, SVMs, LSTMs, and MuRIL to classify the text titles according to their language. The results of various implementations are compared and contrasted as an approach of its own to classify the data. |
first_indexed | 2024-04-11T00:29:05Z |
format | Article |
id | doaj.art-0f9a35e2b2534347a1cce22b6858f656 |
institution | Directory Open Access Journal |
issn | 2666-3074 |
language | English |
last_indexed | 2024-04-11T00:29:05Z |
publishDate | 2022-06-01 |
publisher | KeAi Communications Co., Ltd. |
record_format | Article |
series | International Journal of Cognitive Computing in Engineering |
spelling | doaj.art-0f9a35e2b2534347a1cce22b6858f6562023-01-08T04:15:01ZengKeAi Communications Co., Ltd.International Journal of Cognitive Computing in Engineering2666-30742022-06-013114123Classification of Indian media titles using deep learning techniquesSujit Kumar0Devesh D Rajesh1Sarthak Pranesh2V N Hemanth Kollipara3Gopal Kumar Agrawal4M Anbarasi5Valarmathi J6Vellore Institute of Technology, Vellore, IndiaVellore Institute of Technology, Vellore, IndiaVellore Institute of Technology, Vellore, IndiaVellore Institute of Technology, Vellore, IndiaArchitect at Samsung R&D Institute, Bangalore, IndiaVellore Institute of Technology, Vellore, India; Corresponding author.Vellore Institute of Technology, Vellore, IndiaAutomatic speech recognition is being used everywhere these days. An essential part of this is language identification. Our goal here is to identify the language of the media title, such as song names and movie titles, to help in speech recognition. The focus here is to classify solely using the title of the media without any additional data in their transliterated form to classify them into their original native language using natural language processing, machine learning, and deep learning techniques. Transliterated titles of the song and movie names are being used. This work explores and implements various natural language processing and machine learning methods such as N-grams, SVMs, LSTMs, and MuRIL to classify the text titles according to their language. The results of various implementations are compared and contrasted as an approach of its own to classify the data.http://www.sciencedirect.com/science/article/pii/S2666307422000109N-GramsSVMLSTMMuRILLanguage classificationIndian media titles |
spellingShingle | Sujit Kumar Devesh D Rajesh Sarthak Pranesh V N Hemanth Kollipara Gopal Kumar Agrawal M Anbarasi Valarmathi J Classification of Indian media titles using deep learning techniques International Journal of Cognitive Computing in Engineering N-Grams SVM LSTM MuRIL Language classification Indian media titles |
title | Classification of Indian media titles using deep learning techniques |
title_full | Classification of Indian media titles using deep learning techniques |
title_fullStr | Classification of Indian media titles using deep learning techniques |
title_full_unstemmed | Classification of Indian media titles using deep learning techniques |
title_short | Classification of Indian media titles using deep learning techniques |
title_sort | classification of indian media titles using deep learning techniques |
topic | N-Grams SVM LSTM MuRIL Language classification Indian media titles |
url | http://www.sciencedirect.com/science/article/pii/S2666307422000109 |
work_keys_str_mv | AT sujitkumar classificationofindianmediatitlesusingdeeplearningtechniques AT deveshdrajesh classificationofindianmediatitlesusingdeeplearningtechniques AT sarthakpranesh classificationofindianmediatitlesusingdeeplearningtechniques AT vnhemanthkollipara classificationofindianmediatitlesusingdeeplearningtechniques AT gopalkumaragrawal classificationofindianmediatitlesusingdeeplearningtechniques AT manbarasi classificationofindianmediatitlesusingdeeplearningtechniques AT valarmathij classificationofindianmediatitlesusingdeeplearningtechniques |