A Proposal to Exploit Legal Term Repertoires Extracted Automatically from a Legal English Corpus

The use of specialised corpora as support material in ESP/EAP is a widespread phenomenon yet, to the best of our knowledge, the amount of corpus-based materials in the area of legal English is considerably reduced. This study presents the proposal of several corpus-based activities to foster the ac...

Full description

Bibliographic Details
Main Author: María José Marín Pérez
Format: Article
Language:English
Published: Universidad de Zaragoza 2014-12-01
Series:Miscelánea: A Journal of English and American Studies
Subjects:
Online Access:https://papiro.unizar.es/ojs/index.php/misc/article/view/8774
_version_ 1797786758976372736
author María José Marín Pérez
author_facet María José Marín Pérez
author_sort María José Marín Pérez
collection DOAJ
description The use of specialised corpora as support material in ESP/EAP is a widespread phenomenon yet, to the best of our knowledge, the amount of corpus-based materials in the area of legal English is considerably reduced. This study presents the proposal of several corpus-based activities to foster the acquisition of legal terminology. To that end, an 8.85 million-word legal English corpus (BLaRC) was compiled and processed to identify its most relevant terms using P.Drouin's (2003) TermoStat software. A corpus comprising three legal English textbooks (LegTeXT) was created to establish a comparison between the term inventories obtained from both corpora and thus determine the usefulness of the term list obtained from the former corpus to be used as support teaching material. Several tests were carried out finding 67% overlap between the two term inventories. Furthermore, using Heatley and Nation's (1996) Range software, it was found that 12.37% of the running words in the legal English textbooks were covered by the term inventory obtained from BLaRC. After reviewing the literature on the subject, a taxonomy is also suggested for the classification of legal terms followed by the proposal of several vocabulary activities focusing on varied linguistic levels, namely, morphological, syntactic, semantic and discursive.
first_indexed 2024-03-13T01:12:26Z
format Article
id doaj.art-341ceaf212c5495487052502d45fb5fd
institution Directory Open Access Journal
issn 1137-6368
2386-4834
language English
last_indexed 2024-03-13T01:12:26Z
publishDate 2014-12-01
publisher Universidad de Zaragoza
record_format Article
series Miscelánea: A Journal of English and American Studies
spelling doaj.art-341ceaf212c5495487052502d45fb5fd2023-07-05T16:56:59ZengUniversidad de ZaragozaMiscelánea: A Journal of English and American Studies1137-63682386-48342014-12-014910.26754/ojs_misc/mj.20148774A Proposal to Exploit Legal Term Repertoires Extracted Automatically from a Legal English CorpusMaría José Marín Pérez0Universidad de Murcia The use of specialised corpora as support material in ESP/EAP is a widespread phenomenon yet, to the best of our knowledge, the amount of corpus-based materials in the area of legal English is considerably reduced. This study presents the proposal of several corpus-based activities to foster the acquisition of legal terminology. To that end, an 8.85 million-word legal English corpus (BLaRC) was compiled and processed to identify its most relevant terms using P.Drouin's (2003) TermoStat software. A corpus comprising three legal English textbooks (LegTeXT) was created to establish a comparison between the term inventories obtained from both corpora and thus determine the usefulness of the term list obtained from the former corpus to be used as support teaching material. Several tests were carried out finding 67% overlap between the two term inventories. Furthermore, using Heatley and Nation's (1996) Range software, it was found that 12.37% of the running words in the legal English textbooks were covered by the term inventory obtained from BLaRC. After reviewing the literature on the subject, a taxonomy is also suggested for the classification of legal terms followed by the proposal of several vocabulary activities focusing on varied linguistic levels, namely, morphological, syntactic, semantic and discursive. https://papiro.unizar.es/ojs/index.php/misc/article/view/8774Specialised corporaEnglish for specialised purposes (ESP)datadriven learning (DDL)legal English
spellingShingle María José Marín Pérez
A Proposal to Exploit Legal Term Repertoires Extracted Automatically from a Legal English Corpus
Miscelánea: A Journal of English and American Studies
Specialised corpora
English for specialised purposes (ESP)
datadriven learning (DDL)
legal English
title A Proposal to Exploit Legal Term Repertoires Extracted Automatically from a Legal English Corpus
title_full A Proposal to Exploit Legal Term Repertoires Extracted Automatically from a Legal English Corpus
title_fullStr A Proposal to Exploit Legal Term Repertoires Extracted Automatically from a Legal English Corpus
title_full_unstemmed A Proposal to Exploit Legal Term Repertoires Extracted Automatically from a Legal English Corpus
title_short A Proposal to Exploit Legal Term Repertoires Extracted Automatically from a Legal English Corpus
title_sort proposal to exploit legal term repertoires extracted automatically from a legal english corpus
topic Specialised corpora
English for specialised purposes (ESP)
datadriven learning (DDL)
legal English
url https://papiro.unizar.es/ojs/index.php/misc/article/view/8774
work_keys_str_mv AT mariajosemarinperez aproposaltoexploitlegaltermrepertoiresextractedautomaticallyfromalegalenglishcorpus
AT mariajosemarinperez proposaltoexploitlegaltermrepertoiresextractedautomaticallyfromalegalenglishcorpus