Corpulyzer: A Novel Framework for Building Low Resource Language Corpora
The rapid proliferation of artificial intelligence has led to the development of sophisticated cutting-edge systems in natural language processing and computational linguistics domains. These systems heavily rely on high-quality dataset/corpora for the training of deep-learning algorithms to develop...
Main Authors: | Bilal Tahir, Muhammad Amir Mehmood |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9316706/ |
Similar Items
-
<span style="font-variant: small-caps">esCorpius-m</span>: A Massive Multilingual Crawling Corpus with a Focus on Spanish
by: Asier Gutiérrez-Fandiño, et al.
Published: (2023-11-01) -
Steps for Creating two Persian Specialized Corpora
by: Elham Alayiaboozar, et al.
Published: (2022-10-01) -
Design and Implementation of a Language Specific Crawler to Improve Crawling of Persian Web Documents
by: Masomeh Azimzadeh, et al.
Published: (2009-12-01) -
THE WEB AS CORPUS AND ONLINE CORPORA FOR LEGAL TRANSLATIONS
by: Patrizia GIAMPIERI
Published: (2019-05-01) -
THE WEB AS CORPUS AND ONLINE CORPORA FOR LEGAL TRANSLATIONS
by: Patrizia GIAMPIERI
Published: (2019-05-01)