House building tips (HBT) corpus dataset as a resource to discover Malay architectural ingenuity and identity1
House Building Tips is the title of a classic text containing historical information on early house construction in Malay communities. These tips were written by a scholar with knowledge of house construction through observation of the surrounding environment. In Malaysia, written sources or records...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2021-06-01
|
Series: | Data in Brief |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2352340921002973 |
_version_ | 1819240626689933312 |
---|---|
author | Muhamad Fadzllah Zaini Anida Sarudin Mazura Mastura Muhammad Zulkifli Osman Husna Faredza Mohamed Redzwan Muhammad Anas Al-Muhsin |
author_facet | Muhamad Fadzllah Zaini Anida Sarudin Mazura Mastura Muhammad Zulkifli Osman Husna Faredza Mohamed Redzwan Muhammad Anas Al-Muhsin |
author_sort | Muhamad Fadzllah Zaini |
collection | DOAJ |
description | House Building Tips is the title of a classic text containing historical information on early house construction in Malay communities. These tips were written by a scholar with knowledge of house construction through observation of the surrounding environment. In Malaysia, written sources or records of house construction are scarce and underexposed. As such, this research was conducted to guarantee the written legacy of the construction of Malay houses. The purpose of this paper is to introduce a statistical data source of house building tips that is laden with Malay ingenuity and identity. The wordlists generated from this study can become a source of reference for the field of Malay architecture. Accordingly, this study utilises the quantitative method by applying the Linguistic Corpus Statistical Approach; these data utilise specific corpus development procedures, beginning with text collection, scanning and cleaning processes, text annotation, and data storing in plain text. Next, the data analysis procedure utilises a corpus software, LancsBox, to generate specialised wordlists. The bubble graphs are developed based on these wordlists through the Tableau software, and illustrate the most used lexical items with the raw and relative frequency values. This facilitates searches for, and the reading of, architectural words and architectural word references. These data represent written sources that need to be preserved and become points of reference concerning Malay architectural ingenuity and identity. |
first_indexed | 2024-12-23T14:11:01Z |
format | Article |
id | doaj.art-97374043ee96471893f5b6a8ee5438d3 |
institution | Directory Open Access Journal |
issn | 2352-3409 |
language | English |
last_indexed | 2024-12-23T14:11:01Z |
publishDate | 2021-06-01 |
publisher | Elsevier |
record_format | Article |
series | Data in Brief |
spelling | doaj.art-97374043ee96471893f5b6a8ee5438d32022-12-21T17:44:03ZengElsevierData in Brief2352-34092021-06-0136107013House building tips (HBT) corpus dataset as a resource to discover Malay architectural ingenuity and identity1Muhamad Fadzllah Zaini0Anida Sarudin1Mazura Mastura Muhammad2Zulkifli Osman3Husna Faredza Mohamed Redzwan4Muhammad Anas Al-Muhsin5Department of Malay Language and Literature, Faculty of Language and Communication, Sultan Idris Education University, 35900 Tanjong Malim, Perak, MalaysiaDepartment of Malay Language and Literature, Faculty of Language and Communication, Sultan Idris Education University, 35900 Tanjong Malim, Perak, MalaysiaDepartment of English Language and Literature, Faculty of Language and Communication, Sultan Idris Education University, 35900 Tanjong Malim, Perak, MalaysiaDepartment of Malay Language and Literature, Faculty of Language and Communication, Sultan Idris Education University, 35900 Tanjong Malim, Perak, MalaysiaDepartment of Malay Language and Literature, Faculty of Language and Communication, Sultan Idris Education University, 35900 Tanjong Malim, Perak, MalaysiaDepartment of Modern Language and Literature, Faculty of Language and Communication, Sultan Idris Education University, 35900 Tanjong Malim, Perak, MalaysiaHouse Building Tips is the title of a classic text containing historical information on early house construction in Malay communities. These tips were written by a scholar with knowledge of house construction through observation of the surrounding environment. In Malaysia, written sources or records of house construction are scarce and underexposed. As such, this research was conducted to guarantee the written legacy of the construction of Malay houses. The purpose of this paper is to introduce a statistical data source of house building tips that is laden with Malay ingenuity and identity. The wordlists generated from this study can become a source of reference for the field of Malay architecture. Accordingly, this study utilises the quantitative method by applying the Linguistic Corpus Statistical Approach; these data utilise specific corpus development procedures, beginning with text collection, scanning and cleaning processes, text annotation, and data storing in plain text. Next, the data analysis procedure utilises a corpus software, LancsBox, to generate specialised wordlists. The bubble graphs are developed based on these wordlists through the Tableau software, and illustrate the most used lexical items with the raw and relative frequency values. This facilitates searches for, and the reading of, architectural words and architectural word references. These data represent written sources that need to be preserved and become points of reference concerning Malay architectural ingenuity and identity.http://www.sciencedirect.com/science/article/pii/S2352340921002973House building tipsManuscriptsCorpus linguisticsMalay architectureMalay classical lexical trendsMalay identity |
spellingShingle | Muhamad Fadzllah Zaini Anida Sarudin Mazura Mastura Muhammad Zulkifli Osman Husna Faredza Mohamed Redzwan Muhammad Anas Al-Muhsin House building tips (HBT) corpus dataset as a resource to discover Malay architectural ingenuity and identity1 Data in Brief House building tips Manuscripts Corpus linguistics Malay architecture Malay classical lexical trends Malay identity |
title | House building tips (HBT) corpus dataset as a resource to discover Malay architectural ingenuity and identity1 |
title_full | House building tips (HBT) corpus dataset as a resource to discover Malay architectural ingenuity and identity1 |
title_fullStr | House building tips (HBT) corpus dataset as a resource to discover Malay architectural ingenuity and identity1 |
title_full_unstemmed | House building tips (HBT) corpus dataset as a resource to discover Malay architectural ingenuity and identity1 |
title_short | House building tips (HBT) corpus dataset as a resource to discover Malay architectural ingenuity and identity1 |
title_sort | house building tips hbt corpus dataset as a resource to discover malay architectural ingenuity and identity1 |
topic | House building tips Manuscripts Corpus linguistics Malay architecture Malay classical lexical trends Malay identity |
url | http://www.sciencedirect.com/science/article/pii/S2352340921002973 |
work_keys_str_mv | AT muhamadfadzllahzaini housebuildingtipshbtcorpusdatasetasaresourcetodiscovermalayarchitecturalingenuityandidentity1 AT anidasarudin housebuildingtipshbtcorpusdatasetasaresourcetodiscovermalayarchitecturalingenuityandidentity1 AT mazuramasturamuhammad housebuildingtipshbtcorpusdatasetasaresourcetodiscovermalayarchitecturalingenuityandidentity1 AT zulkifliosman housebuildingtipshbtcorpusdatasetasaresourcetodiscovermalayarchitecturalingenuityandidentity1 AT husnafaredzamohamedredzwan housebuildingtipshbtcorpusdatasetasaresourcetodiscovermalayarchitecturalingenuityandidentity1 AT muhammadanasalmuhsin housebuildingtipshbtcorpusdatasetasaresourcetodiscovermalayarchitecturalingenuityandidentity1 |