Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study

Abstract Drawing on a growing body of research on the interface between corpus linguistics and second/foreign language testing and assessment, we adopted Peykare, a large-scale, annotated, Persian written language resource to evaluate the content (i.e., coverage and typicality) and construct validit...

Full description

Bibliographic Details
Main Authors:	Mahmood BijanKhan, Parvaneh ShayesteFar, Hassan Mohebbi
Format:	Article
Language:	English
Published:	SpringerOpen 2023-02-01
Series:	Language Testing in Asia
Subjects:	Peykare Corpus database resource Persian as a foreign language (PFL) Content typicality Corpus-informed tests Persian language proficiency test (PLPT)
Online Access:	https://doi.org/10.1186/s40468-023-00217-5

_version_	1797863885864173568
author	Mahmood BijanKhan Parvaneh ShayesteFar Hassan Mohebbi
author_facet	Mahmood BijanKhan Parvaneh ShayesteFar Hassan Mohebbi
author_sort	Mahmood BijanKhan
collection	DOAJ
description	Abstract Drawing on a growing body of research on the interface between corpus linguistics and second/foreign language testing and assessment, we adopted Peykare, a large-scale, annotated, Persian written language resource to evaluate the content (i.e., coverage and typicality) and construct validity of a Persian language proficiency test developed for certification of proficiency in Persian as a foreign language (PFL) of non-native speakers. Designed at the Research Center for Intelligent Signal Processing (RCISP), Peykare contains 35,058 text files over five linguistic varieties and 24 different registers of contemporary Persian. This study addresses how corpora, as rich database resources, can practically be applied to test validation purposes and insightfully inform the test life cycle. The results of content validity phase revealed evidence supporting content representativeness, relevance, and typicality of the test. The linkage between the corpus-extracted criterial features or parameters and those covered by the test was not, however, strongly evidenced by items measuring ezafeh constructions, homographs/homophones, PRO (proposition), and POST (postposition). The analysis of content typicality indicated chunks that did not closely conform to the corpus typical output. The construct validity phase, assessing the test hypothesized factor structure (i.e., hierarchical, unitary, correlated, and uncorrelated models) in two randomly split samples of PFL learners from Asian and European countries (N=121), showed that the correlated model fit the data best in both samples. The results supported the presence of distinctive factors of receptive skills, providing empirical evidence for score interpretations of the corpus-based test.
first_indexed	2024-04-09T22:42:51Z
format	Article
id	doaj.art-9ea99248b0724df5aece1e5828b61371
institution	Directory Open Access Journal
issn	2229-0443
language	English
last_indexed	2024-04-09T22:42:51Z
publishDate	2023-02-01
publisher	SpringerOpen
record_format	Article
series	Language Testing in Asia
spelling	doaj.art-9ea99248b0724df5aece1e5828b613712023-03-22T12:02:42ZengSpringerOpenLanguage Testing in Asia2229-04432023-02-0113112610.1186/s40468-023-00217-5Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed studyMahmood BijanKhan0Parvaneh ShayesteFar1Hassan MohebbiDepartment of Linguistics, Faculty of Literature and Humanities, University of TehranDepartment of English Language Teaching, Farhangian Teacher Education UniversityAbstract Drawing on a growing body of research on the interface between corpus linguistics and second/foreign language testing and assessment, we adopted Peykare, a large-scale, annotated, Persian written language resource to evaluate the content (i.e., coverage and typicality) and construct validity of a Persian language proficiency test developed for certification of proficiency in Persian as a foreign language (PFL) of non-native speakers. Designed at the Research Center for Intelligent Signal Processing (RCISP), Peykare contains 35,058 text files over five linguistic varieties and 24 different registers of contemporary Persian. This study addresses how corpora, as rich database resources, can practically be applied to test validation purposes and insightfully inform the test life cycle. The results of content validity phase revealed evidence supporting content representativeness, relevance, and typicality of the test. The linkage between the corpus-extracted criterial features or parameters and those covered by the test was not, however, strongly evidenced by items measuring ezafeh constructions, homographs/homophones, PRO (proposition), and POST (postposition). The analysis of content typicality indicated chunks that did not closely conform to the corpus typical output. The construct validity phase, assessing the test hypothesized factor structure (i.e., hierarchical, unitary, correlated, and uncorrelated models) in two randomly split samples of PFL learners from Asian and European countries (N=121), showed that the correlated model fit the data best in both samples. The results supported the presence of distinctive factors of receptive skills, providing empirical evidence for score interpretations of the corpus-based test.https://doi.org/10.1186/s40468-023-00217-5PeykareCorpus database resourcePersian as a foreign language (PFL)Content typicalityCorpus-informed testsPersian language proficiency test (PLPT)
spellingShingle	Mahmood BijanKhan Parvaneh ShayesteFar Hassan Mohebbi Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study Language Testing in Asia Peykare Corpus database resource Persian as a foreign language (PFL) Content typicality Corpus-informed tests Persian language proficiency test (PLPT)
title	Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study
title_full	Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study
title_fullStr	Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study
title_full_unstemmed	Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study
title_short	Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study
title_sort	assessing the content typicality and construct of persian language proficiency test plpt for non persian speakers a corpus informed study
topic	Peykare Corpus database resource Persian as a foreign language (PFL) Content typicality Corpus-informed tests Persian language proficiency test (PLPT)
url	https://doi.org/10.1186/s40468-023-00217-5
work_keys_str_mv	AT mahmoodbijankhan assessingthecontenttypicalityandconstructofpersianlanguageproficiencytestplptfornonpersianspeakersacorpusinformedstudy AT parvanehshayestefar assessingthecontenttypicalityandconstructofpersianlanguageproficiencytestplptfornonpersianspeakersacorpusinformedstudy AT hassanmohebbi assessingthecontenttypicalityandconstructofpersianlanguageproficiencytestplptfornonpersianspeakersacorpusinformedstudy

Assessing the content typicality and construct of Persian language proficiency test (PLPT) for non-Persian speakers: a corpus-informed study

Similar Items