Dataset of stopwords extracted from Uzbek texts
Filtering stop words is an important task when processing text queries to search for information in large data sets. It enables a reduction of the search space without losing the semantic meaning. The stop words, which have only grammatical roles and not contributing to information content still add...
Main Authors: | Khabibulla Madatov, Shukurla Bekchanov, Jernej Vičič |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2022-08-01
|
Series: | Data in Brief |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2352340922005522 |
Similar Items
-
Dataset of Karakalpak language stop words
by: Khabibulla Madatov, et al.
Published: (2023-06-01) -
Text Mining technologies in sociological analysis (using the example of studying students`ideas about the mission of a modern university)
by: Antonina N. Pinchuk, et al.
Published: (2024-03-01) -
Dataset for Analysis of Russian-Language Reviews on MOOCs Extracted from Stepik
by: Yulia Dyulicheva
Published: (2022-12-01) -
STRUCTURAL PECULIARITIES OF BIGRAM-COLLOCATIONS IN LEGAL ENGLISH
by: Ol’ga M. Litvishko
Published: (2019-08-01) -
Classifying cuneiform symbols using machine learning algorithms with unigram features on a balanced dataset
by: Mahmood Maha, et al.
Published: (2023-09-01)