Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study
Abstract Drawing on a growing body of research on the interface between corpus linguistics and second/foreign language testing and assessment, we adopted Peykare, a large-scale, annotated, Persian written language resource to evaluate the content (i.e., coverage and typicality) and construct validit...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2023-02-01
|
Series: | Language Testing in Asia |
Subjects: | |
Online Access: | https://doi.org/10.1186/s40468-023-00217-5 |
_version_ | 1797863885864173568 |
---|---|
author | Mahmood BijanKhan Parvaneh ShayesteFar Hassan Mohebbi |
author_facet | Mahmood BijanKhan Parvaneh ShayesteFar Hassan Mohebbi |
author_sort | Mahmood BijanKhan |
collection | DOAJ |
description | Abstract Drawing on a growing body of research on the interface between corpus linguistics and second/foreign language testing and assessment, we adopted Peykare, a large-scale, annotated, Persian written language resource to evaluate the content (i.e., coverage and typicality) and construct validity of a Persian language proficiency test developed for certification of proficiency in Persian as a foreign language (PFL) of non-native speakers. Designed at the Research Center for Intelligent Signal Processing (RCISP), Peykare contains 35,058 text files over five linguistic varieties and 24 different registers of contemporary Persian. This study addresses how corpora, as rich database resources, can practically be applied to test validation purposes and insightfully inform the test life cycle. The results of content validity phase revealed evidence supporting content representativeness, relevance, and typicality of the test. The linkage between the corpus-extracted criterial features or parameters and those covered by the test was not, however, strongly evidenced by items measuring ezafeh constructions, homographs/homophones, PRO (proposition), and POST (postposition). The analysis of content typicality indicated chunks that did not closely conform to the corpus typical output. The construct validity phase, assessing the test hypothesized factor structure (i.e., hierarchical, unitary, correlated, and uncorrelated models) in two randomly split samples of PFL learners from Asian and European countries (N=121), showed that the correlated model fit the data best in both samples. The results supported the presence of distinctive factors of receptive skills, providing empirical evidence for score interpretations of the corpus-based test. |
first_indexed | 2024-04-09T22:42:51Z |
format | Article |
id | doaj.art-9ea99248b0724df5aece1e5828b61371 |
institution | Directory Open Access Journal |
issn | 2229-0443 |
language | English |
last_indexed | 2024-04-09T22:42:51Z |
publishDate | 2023-02-01 |
publisher | SpringerOpen |
record_format | Article |
series | Language Testing in Asia |
spelling | doaj.art-9ea99248b0724df5aece1e5828b613712023-03-22T12:02:42ZengSpringerOpenLanguage Testing in Asia2229-04432023-02-0113112610.1186/s40468-023-00217-5Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed studyMahmood BijanKhan0Parvaneh ShayesteFar1Hassan MohebbiDepartment of Linguistics, Faculty of Literature and Humanities, University of TehranDepartment of English Language Teaching, Farhangian Teacher Education UniversityAbstract Drawing on a growing body of research on the interface between corpus linguistics and second/foreign language testing and assessment, we adopted Peykare, a large-scale, annotated, Persian written language resource to evaluate the content (i.e., coverage and typicality) and construct validity of a Persian language proficiency test developed for certification of proficiency in Persian as a foreign language (PFL) of non-native speakers. Designed at the Research Center for Intelligent Signal Processing (RCISP), Peykare contains 35,058 text files over five linguistic varieties and 24 different registers of contemporary Persian. This study addresses how corpora, as rich database resources, can practically be applied to test validation purposes and insightfully inform the test life cycle. The results of content validity phase revealed evidence supporting content representativeness, relevance, and typicality of the test. The linkage between the corpus-extracted criterial features or parameters and those covered by the test was not, however, strongly evidenced by items measuring ezafeh constructions, homographs/homophones, PRO (proposition), and POST (postposition). The analysis of content typicality indicated chunks that did not closely conform to the corpus typical output. The construct validity phase, assessing the test hypothesized factor structure (i.e., hierarchical, unitary, correlated, and uncorrelated models) in two randomly split samples of PFL learners from Asian and European countries (N=121), showed that the correlated model fit the data best in both samples. The results supported the presence of distinctive factors of receptive skills, providing empirical evidence for score interpretations of the corpus-based test.https://doi.org/10.1186/s40468-023-00217-5PeykareCorpus database resourcePersian as a foreign language (PFL)Content typicalityCorpus-informed testsPersian language proficiency test (PLPT) |
spellingShingle | Mahmood BijanKhan Parvaneh ShayesteFar Hassan Mohebbi Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study Language Testing in Asia Peykare Corpus database resource Persian as a foreign language (PFL) Content typicality Corpus-informed tests Persian language proficiency test (PLPT) |
title | Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study |
title_full | Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study |
title_fullStr | Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study |
title_full_unstemmed | Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study |
title_short | Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study |
title_sort | assessing the content typicality and construct of persian language proficiency test plpt for non persian speakers a corpus informed study |
topic | Peykare Corpus database resource Persian as a foreign language (PFL) Content typicality Corpus-informed tests Persian language proficiency test (PLPT) |
url | https://doi.org/10.1186/s40468-023-00217-5 |
work_keys_str_mv | AT mahmoodbijankhan assessingthecontenttypicalityandconstructofpersianlanguageproficiencytestplptfornonpersianspeakersacorpusinformedstudy AT parvanehshayestefar assessingthecontenttypicalityandconstructofpersianlanguageproficiencytestplptfornonpersianspeakersacorpusinformedstudy AT hassanmohebbi assessingthecontenttypicalityandconstructofpersianlanguageproficiencytestplptfornonpersianspeakersacorpusinformedstudy |