Automatic Recognition of Arabic Poetry Meter from Speech Signal using Long Short-term Memory and Support Vector Machine

The recognition of the poetry meter in spoken lines is a natural language processing application that aims to identify a stressed and unstressed syllabic pattern in a line of a poem. Stateof-the-art studies include few works on the automatic recognition of Arud meters, all of which are text-based mo...

Full description

Bibliographic Details
Main Author: Abdulbasit K. Al-Talabani
Format: Article
Language:English
Published: Koya University 2020-04-01
Series:ARO-The Scientific Journal of Koya University
Subjects:
Online Access:https://aro.koyauniversity.org/index.php/aro/article/view/631
_version_ 1797724194076622848
author Abdulbasit K. Al-Talabani
author_facet Abdulbasit K. Al-Talabani
author_sort Abdulbasit K. Al-Talabani
collection DOAJ
description The recognition of the poetry meter in spoken lines is a natural language processing application that aims to identify a stressed and unstressed syllabic pattern in a line of a poem. Stateof-the-art studies include few works on the automatic recognition of Arud meters, all of which are text-based models, and none is voice based. Poetry meter recognition is not easy for an ordinary reader, it is very difficult for the listener and it is usually performed manually by experts. This paper proposes a model to detect the poetry meter from a single spoken line (“Bayt”) of an Arabic poem. Data of 230 samples collected from 10 poems of Arabic poetry, including three meters read by two speakers, are used in this work. The work adopts the extraction of linear prediction cepstrum coefficient and Mel frequency cepstral coefficient (MFCC) features, as a time series input to the proposed long short-term memory (LSTM) classifier, in addition to a global feature set that is computed using some statistics of the features across all of the frames to feed the support vector machine (SVM) classifier. The results show that the SVM model achieves the highest accuracy in the speakerdependent approach. It improves results by 3%, as compared to the state-of-the-art studies, whereas for the speaker-independent approach, the MFCC feature using LSTM exceeds the other proposed models.
first_indexed 2024-03-12T10:13:56Z
format Article
id doaj.art-92bc0e6d61f14c068c5665f002bece74
institution Directory Open Access Journal
issn 2410-9355
2307-549X
language English
last_indexed 2024-03-12T10:13:56Z
publishDate 2020-04-01
publisher Koya University
record_format Article
series ARO-The Scientific Journal of Koya University
spelling doaj.art-92bc0e6d61f14c068c5665f002bece742023-09-02T10:42:30ZengKoya UniversityARO-The Scientific Journal of Koya University2410-93552307-549X2020-04-018110.14500/aro.10631Automatic Recognition of Arabic Poetry Meter from Speech Signal using Long Short-term Memory and Support Vector MachineAbdulbasit K. Al-Talabani0Department of Software Engineering, Faculty of Engineering, Koya University, Koya KOY45, Kurdistan RegionThe recognition of the poetry meter in spoken lines is a natural language processing application that aims to identify a stressed and unstressed syllabic pattern in a line of a poem. Stateof-the-art studies include few works on the automatic recognition of Arud meters, all of which are text-based models, and none is voice based. Poetry meter recognition is not easy for an ordinary reader, it is very difficult for the listener and it is usually performed manually by experts. This paper proposes a model to detect the poetry meter from a single spoken line (“Bayt”) of an Arabic poem. Data of 230 samples collected from 10 poems of Arabic poetry, including three meters read by two speakers, are used in this work. The work adopts the extraction of linear prediction cepstrum coefficient and Mel frequency cepstral coefficient (MFCC) features, as a time series input to the proposed long short-term memory (LSTM) classifier, in addition to a global feature set that is computed using some statistics of the features across all of the frames to feed the support vector machine (SVM) classifier. The results show that the SVM model achieves the highest accuracy in the speakerdependent approach. It improves results by 3%, as compared to the state-of-the-art studies, whereas for the speaker-independent approach, the MFCC feature using LSTM exceeds the other proposed models.https://aro.koyauniversity.org/index.php/aro/article/view/631Speech processingLong short-term memorySupport vector machineProsodyCepstral features
spellingShingle Abdulbasit K. Al-Talabani
Automatic Recognition of Arabic Poetry Meter from Speech Signal using Long Short-term Memory and Support Vector Machine
ARO-The Scientific Journal of Koya University
Speech processing
Long short-term memory
Support vector machine
Prosody
Cepstral features
title Automatic Recognition of Arabic Poetry Meter from Speech Signal using Long Short-term Memory and Support Vector Machine
title_full Automatic Recognition of Arabic Poetry Meter from Speech Signal using Long Short-term Memory and Support Vector Machine
title_fullStr Automatic Recognition of Arabic Poetry Meter from Speech Signal using Long Short-term Memory and Support Vector Machine
title_full_unstemmed Automatic Recognition of Arabic Poetry Meter from Speech Signal using Long Short-term Memory and Support Vector Machine
title_short Automatic Recognition of Arabic Poetry Meter from Speech Signal using Long Short-term Memory and Support Vector Machine
title_sort automatic recognition of arabic poetry meter from speech signal using long short term memory and support vector machine
topic Speech processing
Long short-term memory
Support vector machine
Prosody
Cepstral features
url https://aro.koyauniversity.org/index.php/aro/article/view/631
work_keys_str_mv AT abdulbasitkaltalabani automaticrecognitionofarabicpoetrymeterfromspeechsignalusinglongshorttermmemoryandsupportvectormachine