The N-Grams Based Text Similarity Detection Approach Using Self-Organizing Maps and Similarity Measures

In the paper the word-level n-grams based approach is proposed to find similarity between texts. The approach is a combination of two separate and independent techniques: self-organizing map (SOM) and text similarity measures. SOM’s uniqueness is that the obtained results of data clusterin...

Full description

Bibliographic Details
Main Authors: Pavel Stefanovič, Olga Kurasova, Rokas Štrimaitis
Format: Article
Language:English
Published: MDPI AG 2019-05-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/9/9/1870