A set theory based similarity measure for text clustering and classification

Abstract Similarity measures have long been utilized in information retrieval and machine learning domains for multi-purposes including text retrieval, text clustering, text summarization, plagiarism detection, and several other text-processing applications. However, the problem with these measures...

Full description

Bibliographic Details
Main Authors: Ali A. Amer, Hassan I. Abdalla
Format: Article
Language:English
Published: SpringerOpen 2020-09-01
Series:Journal of Big Data
Subjects:
Online Access:http://link.springer.com/article/10.1186/s40537-020-00344-3