House building tips (HBT) corpus dataset as a resource to discover Malay architectural ingenuity and identity1

House Building Tips is the title of a classic text containing historical information on early house construction in Malay communities. These tips were written by a scholar with knowledge of house construction through observation of the surrounding environment. In Malaysia, written sources or records...

Full description

Bibliographic Details
Main Authors: Muhamad Fadzllah Zaini, Anida Sarudin, Mazura Mastura Muhammad, Zulkifli Osman, Husna Faredza Mohamed Redzwan, Muhammad Anas Al-Muhsin
Format: Article
Language:English
Published: Elsevier 2021-06-01
Series:Data in Brief
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352340921002973
_version_ 1819240626689933312
author Muhamad Fadzllah Zaini
Anida Sarudin
Mazura Mastura Muhammad
Zulkifli Osman
Husna Faredza Mohamed Redzwan
Muhammad Anas Al-Muhsin
author_facet Muhamad Fadzllah Zaini
Anida Sarudin
Mazura Mastura Muhammad
Zulkifli Osman
Husna Faredza Mohamed Redzwan
Muhammad Anas Al-Muhsin
author_sort Muhamad Fadzllah Zaini
collection DOAJ
description House Building Tips is the title of a classic text containing historical information on early house construction in Malay communities. These tips were written by a scholar with knowledge of house construction through observation of the surrounding environment. In Malaysia, written sources or records of house construction are scarce and underexposed. As such, this research was conducted to guarantee the written legacy of the construction of Malay houses. The purpose of this paper is to introduce a statistical data source of house building tips that is laden with Malay ingenuity and identity. The wordlists generated from this study can become a source of reference for the field of Malay architecture. Accordingly, this study utilises the quantitative method by applying the Linguistic Corpus Statistical Approach; these data utilise specific corpus development procedures, beginning with text collection, scanning and cleaning processes, text annotation, and data storing in plain text. Next, the data analysis procedure utilises a corpus software, LancsBox, to generate specialised wordlists. The bubble graphs are developed based on these wordlists through the Tableau software, and illustrate the most used lexical items with the raw and relative frequency values. This facilitates searches for, and the reading of, architectural words and architectural word references. These data represent written sources that need to be preserved and become points of reference concerning Malay architectural ingenuity and identity.
first_indexed 2024-12-23T14:11:01Z
format Article
id doaj.art-97374043ee96471893f5b6a8ee5438d3
institution Directory Open Access Journal
issn 2352-3409
language English
last_indexed 2024-12-23T14:11:01Z
publishDate 2021-06-01
publisher Elsevier
record_format Article
series Data in Brief
spelling doaj.art-97374043ee96471893f5b6a8ee5438d32022-12-21T17:44:03ZengElsevierData in Brief2352-34092021-06-0136107013House building tips (HBT) corpus dataset as a resource to discover Malay architectural ingenuity and identity1Muhamad Fadzllah Zaini0Anida Sarudin1Mazura Mastura Muhammad2Zulkifli Osman3Husna Faredza Mohamed Redzwan4Muhammad Anas Al-Muhsin5Department of Malay Language and Literature, Faculty of Language and Communication, Sultan Idris Education University, 35900 Tanjong Malim, Perak, MalaysiaDepartment of Malay Language and Literature, Faculty of Language and Communication, Sultan Idris Education University, 35900 Tanjong Malim, Perak, MalaysiaDepartment of English Language and Literature, Faculty of Language and Communication, Sultan Idris Education University, 35900 Tanjong Malim, Perak, MalaysiaDepartment of Malay Language and Literature, Faculty of Language and Communication, Sultan Idris Education University, 35900 Tanjong Malim, Perak, MalaysiaDepartment of Malay Language and Literature, Faculty of Language and Communication, Sultan Idris Education University, 35900 Tanjong Malim, Perak, MalaysiaDepartment of Modern Language and Literature, Faculty of Language and Communication, Sultan Idris Education University, 35900 Tanjong Malim, Perak, MalaysiaHouse Building Tips is the title of a classic text containing historical information on early house construction in Malay communities. These tips were written by a scholar with knowledge of house construction through observation of the surrounding environment. In Malaysia, written sources or records of house construction are scarce and underexposed. As such, this research was conducted to guarantee the written legacy of the construction of Malay houses. The purpose of this paper is to introduce a statistical data source of house building tips that is laden with Malay ingenuity and identity. The wordlists generated from this study can become a source of reference for the field of Malay architecture. Accordingly, this study utilises the quantitative method by applying the Linguistic Corpus Statistical Approach; these data utilise specific corpus development procedures, beginning with text collection, scanning and cleaning processes, text annotation, and data storing in plain text. Next, the data analysis procedure utilises a corpus software, LancsBox, to generate specialised wordlists. The bubble graphs are developed based on these wordlists through the Tableau software, and illustrate the most used lexical items with the raw and relative frequency values. This facilitates searches for, and the reading of, architectural words and architectural word references. These data represent written sources that need to be preserved and become points of reference concerning Malay architectural ingenuity and identity.http://www.sciencedirect.com/science/article/pii/S2352340921002973House building tipsManuscriptsCorpus linguisticsMalay architectureMalay classical lexical trendsMalay identity
spellingShingle Muhamad Fadzllah Zaini
Anida Sarudin
Mazura Mastura Muhammad
Zulkifli Osman
Husna Faredza Mohamed Redzwan
Muhammad Anas Al-Muhsin
House building tips (HBT) corpus dataset as a resource to discover Malay architectural ingenuity and identity1
Data in Brief
House building tips
Manuscripts
Corpus linguistics
Malay architecture
Malay classical lexical trends
Malay identity
title House building tips (HBT) corpus dataset as a resource to discover Malay architectural ingenuity and identity1
title_full House building tips (HBT) corpus dataset as a resource to discover Malay architectural ingenuity and identity1
title_fullStr House building tips (HBT) corpus dataset as a resource to discover Malay architectural ingenuity and identity1
title_full_unstemmed House building tips (HBT) corpus dataset as a resource to discover Malay architectural ingenuity and identity1
title_short House building tips (HBT) corpus dataset as a resource to discover Malay architectural ingenuity and identity1
title_sort house building tips hbt corpus dataset as a resource to discover malay architectural ingenuity and identity1
topic House building tips
Manuscripts
Corpus linguistics
Malay architecture
Malay classical lexical trends
Malay identity
url http://www.sciencedirect.com/science/article/pii/S2352340921002973
work_keys_str_mv AT muhamadfadzllahzaini housebuildingtipshbtcorpusdatasetasaresourcetodiscovermalayarchitecturalingenuityandidentity1
AT anidasarudin housebuildingtipshbtcorpusdatasetasaresourcetodiscovermalayarchitecturalingenuityandidentity1
AT mazuramasturamuhammad housebuildingtipshbtcorpusdatasetasaresourcetodiscovermalayarchitecturalingenuityandidentity1
AT zulkifliosman housebuildingtipshbtcorpusdatasetasaresourcetodiscovermalayarchitecturalingenuityandidentity1
AT husnafaredzamohamedredzwan housebuildingtipshbtcorpusdatasetasaresourcetodiscovermalayarchitecturalingenuityandidentity1
AT muhammadanasalmuhsin housebuildingtipshbtcorpusdatasetasaresourcetodiscovermalayarchitecturalingenuityandidentity1