MoReThesisCorpus

The article discusses the on-going process for the creation of the MoReThesisCorpus, outlining its major characteristics and offering an account of the considerations and issues involved so far. The corpus, composed of the theses submitted to the University of Modena and Reggio Emilia between 2011 a...

Full description

Bibliographic Details
Main Authors: Marina Bondi, Matteo Di Cristofaro
Format: Article
Language:English
Published: Department of Foreign Languages and Literatures at the University of Verona 2023-06-01
Series:Iperstoria
Subjects:
Online Access:https://iperstoria.it/article/view/1265
_version_ 1827914919416168448
author Marina Bondi
Matteo Di Cristofaro
author_facet Marina Bondi
Matteo Di Cristofaro
author_sort Marina Bondi
collection DOAJ
description The article discusses the on-going process for the creation of the MoReThesisCorpus, outlining its major characteristics and offering an account of the considerations and issues involved so far. The corpus, composed of the theses submitted to the University of Modena and Reggio Emilia between 2011 and 2020, is being developed as part of the project CAP (‘Comunicazione Accademica e Professionale;’ Academic and Professional Communication), and is meant to foster research into academic language in a cross-disciplinary discourse perspective, as well as to facilitate the production of educational materials aimed at university students. It aims at supporting the acquisition of discipline-related vocabularies and styles to improve the learning of academic writing through corpus tools and resources, following a data-driven learning approach. Technical details surrounding the acquisition and subsequent processing of the data are discussed, along with considerations on a number of issues pertaining both to computer science and linguistics, directly impinging on the capability of the corpus to correctly support an investigation of academic discourse across different languages and disciplines.
first_indexed 2024-03-13T02:52:06Z
format Article
id doaj.art-042ceaa09721440aa29d2a0379a403ef
institution Directory Open Access Journal
issn 2281-4582
language English
last_indexed 2024-03-13T02:52:06Z
publishDate 2023-06-01
publisher Department of Foreign Languages and Literatures at the University of Verona
record_format Article
series Iperstoria
spelling doaj.art-042ceaa09721440aa29d2a0379a403ef2023-06-28T09:45:39ZengDepartment of Foreign Languages and Literatures at the University of VeronaIperstoria2281-45822023-06-012110.13136/2281-4582/2023.i21.12651216MoReThesisCorpusMarina BondiMatteo Di CristofaroThe article discusses the on-going process for the creation of the MoReThesisCorpus, outlining its major characteristics and offering an account of the considerations and issues involved so far. The corpus, composed of the theses submitted to the University of Modena and Reggio Emilia between 2011 and 2020, is being developed as part of the project CAP (‘Comunicazione Accademica e Professionale;’ Academic and Professional Communication), and is meant to foster research into academic language in a cross-disciplinary discourse perspective, as well as to facilitate the production of educational materials aimed at university students. It aims at supporting the acquisition of discipline-related vocabularies and styles to improve the learning of academic writing through corpus tools and resources, following a data-driven learning approach. Technical details surrounding the acquisition and subsequent processing of the data are discussed, along with considerations on a number of issues pertaining both to computer science and linguistics, directly impinging on the capability of the corpus to correctly support an investigation of academic discourse across different languages and disciplines.https://iperstoria.it/article/view/1265corpus linguisticseapacademic discourseacademic writing
spellingShingle Marina Bondi
Matteo Di Cristofaro
MoReThesisCorpus
Iperstoria
corpus linguistics
eap
academic discourse
academic writing
title MoReThesisCorpus
title_full MoReThesisCorpus
title_fullStr MoReThesisCorpus
title_full_unstemmed MoReThesisCorpus
title_short MoReThesisCorpus
title_sort morethesiscorpus
topic corpus linguistics
eap
academic discourse
academic writing
url https://iperstoria.it/article/view/1265
work_keys_str_mv AT marinabondi morethesiscorpus
AT matteodicristofaro morethesiscorpus