Comparative analysis of similarity measures for sentence level semantic measurement of text

The accuracy of similarity measurement between sentences is critical to the performance of several applications such as text mining, question answering, and text summarization. This paper focuses on calculating semantic similarities between sentences and performing a comparative analysis among ident...

Full description

Bibliographic Details
Main Authors: Mohd Saad, Sazianti, Kamarudin, Siti Sakira
Format: Conference or Workshop Item
Language:English
Published: 2013
Subjects:
Online Access:https://repo.uum.edu.my/id/eprint/10259/1/22.pdf
Description
Summary:The accuracy of similarity measurement between sentences is critical to the performance of several applications such as text mining, question answering, and text summarization. This paper focuses on calculating semantic similarities between sentences and performing a comparative analysis among identified similarity measurement techniques.Comparison between three popular similarity measurements which are Jaccard, Cosine and Dice similarity measures has been conducted.The performance of each identified measurement was evaluated and recorded.In this paper, we use a large lexical database of English known as WordNet to calculate the world-toward semantic similarity.The result of this research concludes that the Jaccard and Dice performs better in measuring the semantic similarity between sentences.