The Fractal Patterns of Words in a Text: A Method for Automatic Keyword Extraction.

A text can be considered as a one dimensional array of words. The locations of each word type in this array form a fractal pattern with certain fractal dimension. We observe that important words responsible for conveying the meaning of a text have dimensions considerably different from one, while th...

Full description

Bibliographic Details
Main Authors: Elham Najafi, Amir H Darooneh
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2015-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4474631?pdf=render
_version_ 1818584016687726592
author Elham Najafi
Amir H Darooneh
author_facet Elham Najafi
Amir H Darooneh
author_sort Elham Najafi
collection DOAJ
description A text can be considered as a one dimensional array of words. The locations of each word type in this array form a fractal pattern with certain fractal dimension. We observe that important words responsible for conveying the meaning of a text have dimensions considerably different from one, while the fractal dimensions of unimportant words are close to one. We introduce an index quantifying the importance of the words in a given text using their fractal dimensions and then ranking them according to their importance. This index measures the difference between the fractal pattern of a word in the original text relative to a shuffled version. Because the shuffled text is meaningless (i.e., words have no importance), the difference between the original and shuffled text can be used to ascertain degree of fractality. The degree of fractality may be used for automatic keyword detection. Words with the degree of fractality higher than a threshold value are assumed to be the retrieved keywords of the text. We measure the efficiency of our method for keywords extraction, making a comparison between our proposed method and two other well-known methods of automatic keyword extraction.
first_indexed 2024-12-16T08:14:28Z
format Article
id doaj.art-77ebc7fc6c3d4451a4b5cf0d676a5247
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-16T08:14:28Z
publishDate 2015-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-77ebc7fc6c3d4451a4b5cf0d676a52472022-12-21T22:38:17ZengPublic Library of Science (PLoS)PLoS ONE1932-62032015-01-01106e013061710.1371/journal.pone.0130617The Fractal Patterns of Words in a Text: A Method for Automatic Keyword Extraction.Elham NajafiAmir H DaroonehA text can be considered as a one dimensional array of words. The locations of each word type in this array form a fractal pattern with certain fractal dimension. We observe that important words responsible for conveying the meaning of a text have dimensions considerably different from one, while the fractal dimensions of unimportant words are close to one. We introduce an index quantifying the importance of the words in a given text using their fractal dimensions and then ranking them according to their importance. This index measures the difference between the fractal pattern of a word in the original text relative to a shuffled version. Because the shuffled text is meaningless (i.e., words have no importance), the difference between the original and shuffled text can be used to ascertain degree of fractality. The degree of fractality may be used for automatic keyword detection. Words with the degree of fractality higher than a threshold value are assumed to be the retrieved keywords of the text. We measure the efficiency of our method for keywords extraction, making a comparison between our proposed method and two other well-known methods of automatic keyword extraction.http://europepmc.org/articles/PMC4474631?pdf=render
spellingShingle Elham Najafi
Amir H Darooneh
The Fractal Patterns of Words in a Text: A Method for Automatic Keyword Extraction.
PLoS ONE
title The Fractal Patterns of Words in a Text: A Method for Automatic Keyword Extraction.
title_full The Fractal Patterns of Words in a Text: A Method for Automatic Keyword Extraction.
title_fullStr The Fractal Patterns of Words in a Text: A Method for Automatic Keyword Extraction.
title_full_unstemmed The Fractal Patterns of Words in a Text: A Method for Automatic Keyword Extraction.
title_short The Fractal Patterns of Words in a Text: A Method for Automatic Keyword Extraction.
title_sort fractal patterns of words in a text a method for automatic keyword extraction
url http://europepmc.org/articles/PMC4474631?pdf=render
work_keys_str_mv AT elhamnajafi thefractalpatternsofwordsinatextamethodforautomatickeywordextraction
AT amirhdarooneh thefractalpatternsofwordsinatextamethodforautomatickeywordextraction
AT elhamnajafi fractalpatternsofwordsinatextamethodforautomatickeywordextraction
AT amirhdarooneh fractalpatternsofwordsinatextamethodforautomatickeywordextraction