Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study

Abstract Drawing on a growing body of research on the interface between corpus linguistics and second/foreign language testing and assessment, we adopted Peykare, a large-scale, annotated, Persian written language resource to evaluate the content (i.e., coverage and typicality) and construct validit...

Full description

Bibliographic Details
Main Authors: Mahmood BijanKhan, Parvaneh ShayesteFar, Hassan Mohebbi
Format: Article
Language:English
Published: SpringerOpen 2023-02-01
Series:Language Testing in Asia
Subjects:
Online Access:https://doi.org/10.1186/s40468-023-00217-5
_version_ 1797863885864173568
author Mahmood BijanKhan
Parvaneh ShayesteFar
Hassan Mohebbi
author_facet Mahmood BijanKhan
Parvaneh ShayesteFar
Hassan Mohebbi
author_sort Mahmood BijanKhan
collection DOAJ
description Abstract Drawing on a growing body of research on the interface between corpus linguistics and second/foreign language testing and assessment, we adopted Peykare, a large-scale, annotated, Persian written language resource to evaluate the content (i.e., coverage and typicality) and construct validity of a Persian language proficiency test developed for certification of proficiency in Persian as a foreign language (PFL) of non-native speakers. Designed at the Research Center for Intelligent Signal Processing (RCISP), Peykare contains 35,058 text files over five linguistic varieties and 24 different registers of contemporary Persian. This study addresses how corpora, as rich database resources, can practically be applied to test validation purposes and insightfully inform the test life cycle. The results of content validity phase revealed evidence supporting content representativeness, relevance, and typicality of the test. The linkage between the corpus-extracted criterial features or parameters and those covered by the test was not, however, strongly evidenced by items measuring ezafeh constructions, homographs/homophones, PRO (proposition), and POST (postposition). The analysis of content typicality indicated chunks that did not closely conform to the corpus typical output. The construct validity phase, assessing the test hypothesized factor structure (i.e., hierarchical, unitary, correlated, and uncorrelated models) in two randomly split samples of PFL learners from Asian and European countries (N=121), showed that the correlated model fit the data best in both samples. The results supported the presence of distinctive factors of receptive skills, providing empirical evidence for score interpretations of the corpus-based test.
first_indexed 2024-04-09T22:42:51Z
format Article
id doaj.art-9ea99248b0724df5aece1e5828b61371
institution Directory Open Access Journal
issn 2229-0443
language English
last_indexed 2024-04-09T22:42:51Z
publishDate 2023-02-01
publisher SpringerOpen
record_format Article
series Language Testing in Asia
spelling doaj.art-9ea99248b0724df5aece1e5828b613712023-03-22T12:02:42ZengSpringerOpenLanguage Testing in Asia2229-04432023-02-0113112610.1186/s40468-023-00217-5Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed studyMahmood BijanKhan0Parvaneh ShayesteFar1Hassan MohebbiDepartment of Linguistics, Faculty of Literature and Humanities, University of TehranDepartment of English Language Teaching, Farhangian Teacher Education UniversityAbstract Drawing on a growing body of research on the interface between corpus linguistics and second/foreign language testing and assessment, we adopted Peykare, a large-scale, annotated, Persian written language resource to evaluate the content (i.e., coverage and typicality) and construct validity of a Persian language proficiency test developed for certification of proficiency in Persian as a foreign language (PFL) of non-native speakers. Designed at the Research Center for Intelligent Signal Processing (RCISP), Peykare contains 35,058 text files over five linguistic varieties and 24 different registers of contemporary Persian. This study addresses how corpora, as rich database resources, can practically be applied to test validation purposes and insightfully inform the test life cycle. The results of content validity phase revealed evidence supporting content representativeness, relevance, and typicality of the test. The linkage between the corpus-extracted criterial features or parameters and those covered by the test was not, however, strongly evidenced by items measuring ezafeh constructions, homographs/homophones, PRO (proposition), and POST (postposition). The analysis of content typicality indicated chunks that did not closely conform to the corpus typical output. The construct validity phase, assessing the test hypothesized factor structure (i.e., hierarchical, unitary, correlated, and uncorrelated models) in two randomly split samples of PFL learners from Asian and European countries (N=121), showed that the correlated model fit the data best in both samples. The results supported the presence of distinctive factors of receptive skills, providing empirical evidence for score interpretations of the corpus-based test.https://doi.org/10.1186/s40468-023-00217-5PeykareCorpus database resourcePersian as a foreign language (PFL)Content typicalityCorpus-informed testsPersian language proficiency test (PLPT)
spellingShingle Mahmood BijanKhan
Parvaneh ShayesteFar
Hassan Mohebbi
Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study
Language Testing in Asia
Peykare
Corpus database resource
Persian as a foreign language (PFL)
Content typicality
Corpus-informed tests
Persian language proficiency test (PLPT)
title Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study
title_full Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study
title_fullStr Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study
title_full_unstemmed Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study
title_short Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study
title_sort assessing the content typicality and construct of persian language proficiency test plpt for non persian speakers a corpus informed study
topic Peykare
Corpus database resource
Persian as a foreign language (PFL)
Content typicality
Corpus-informed tests
Persian language proficiency test (PLPT)
url https://doi.org/10.1186/s40468-023-00217-5
work_keys_str_mv AT mahmoodbijankhan assessingthecontenttypicalityandconstructofpersianlanguageproficiencytestplptfornonpersianspeakersacorpusinformedstudy
AT parvanehshayestefar assessingthecontenttypicalityandconstructofpersianlanguageproficiencytestplptfornonpersianspeakersacorpusinformedstudy
AT hassanmohebbi assessingthecontenttypicalityandconstructofpersianlanguageproficiencytestplptfornonpersianspeakersacorpusinformedstudy