A Close-to-linear Topic Detection Algorithm using Relative Entropy based Relevance Model and Inverted Indices Retrieval

Although timely access to information is becoming increasingly important and gaining such access is no longer a problem, the capacity for humans to assimilate such huge amounts of information is limited. Topic Detection(TD) is then a promising research area that addresses speedy access of desired in...

Full description

Bibliographic Details
Main Authors: Steve Kansheng Shi, Lemin Li
Format: Article
Language:English
Published: Springer 2012-08-01
Series:International Journal of Computational Intelligence Systems
Subjects:
Online Access:https://www.atlantis-press.com/article/25868005.pdf
_version_ 1811291363091677184
author Steve Kansheng Shi
Lemin Li
author_facet Steve Kansheng Shi
Lemin Li
author_sort Steve Kansheng Shi
collection DOAJ
description Although timely access to information is becoming increasingly important and gaining such access is no longer a problem, the capacity for humans to assimilate such huge amounts of information is limited. Topic Detection(TD) is then a promising research area that addresses speedy access of desired information. However, ironically, the time complexity of existing TD algorithms themselves is usually up to the -th power of . Linear performance requirement of real world topic detection has not been significantly addressed. This paper reveals a new patented topic detection algorithm called that combines elevance odel with nformation etrieval technique to improve on time efficiency. Relevance Model(RM) is a theoretical extension of statistical language modeling that was developed for the task of document retrieval. To reduce the costs of fetching RM, we reduce the number of comparisons for stories by a query-based approach that makes similar stories exist in the top-k query results. We also build our query based on inverted indices, which have the complexity close to linear. The time cost of rest of operations in the topic detection process is a constant. Hence, the total complexity of topic detection algorithm should be close to linear as shown in experimental results. In addition, also gains better detection rates and robustness by relative entropy based topic model design
first_indexed 2024-04-13T04:28:14Z
format Article
id doaj.art-029d7f223ec7469a98007201b11029d8
institution Directory Open Access Journal
issn 1875-6883
language English
last_indexed 2024-04-13T04:28:14Z
publishDate 2012-08-01
publisher Springer
record_format Article
series International Journal of Computational Intelligence Systems
spelling doaj.art-029d7f223ec7469a98007201b11029d82022-12-22T03:02:24ZengSpringerInternational Journal of Computational Intelligence Systems1875-68832012-08-015410.1080/18756891.2012.718156A Close-to-linear Topic Detection Algorithm using Relative Entropy based Relevance Model and Inverted Indices RetrievalSteve Kansheng ShiLemin LiAlthough timely access to information is becoming increasingly important and gaining such access is no longer a problem, the capacity for humans to assimilate such huge amounts of information is limited. Topic Detection(TD) is then a promising research area that addresses speedy access of desired information. However, ironically, the time complexity of existing TD algorithms themselves is usually up to the -th power of . Linear performance requirement of real world topic detection has not been significantly addressed. This paper reveals a new patented topic detection algorithm called that combines elevance odel with nformation etrieval technique to improve on time efficiency. Relevance Model(RM) is a theoretical extension of statistical language modeling that was developed for the task of document retrieval. To reduce the costs of fetching RM, we reduce the number of comparisons for stories by a query-based approach that makes similar stories exist in the top-k query results. We also build our query based on inverted indices, which have the complexity close to linear. The time cost of rest of operations in the topic detection process is a constant. Hence, the total complexity of topic detection algorithm should be close to linear as shown in experimental results. In addition, also gains better detection rates and robustness by relative entropy based topic model designhttps://www.atlantis-press.com/article/25868005.pdfTopic DetectionLink Topic DetectionRetrospective Event DetectionInformation RetrievalRelevance ModelsInverted Indices
spellingShingle Steve Kansheng Shi
Lemin Li
A Close-to-linear Topic Detection Algorithm using Relative Entropy based Relevance Model and Inverted Indices Retrieval
International Journal of Computational Intelligence Systems
Topic Detection
Link Topic Detection
Retrospective Event Detection
Information Retrieval
Relevance Models
Inverted Indices
title A Close-to-linear Topic Detection Algorithm using Relative Entropy based Relevance Model and Inverted Indices Retrieval
title_full A Close-to-linear Topic Detection Algorithm using Relative Entropy based Relevance Model and Inverted Indices Retrieval
title_fullStr A Close-to-linear Topic Detection Algorithm using Relative Entropy based Relevance Model and Inverted Indices Retrieval
title_full_unstemmed A Close-to-linear Topic Detection Algorithm using Relative Entropy based Relevance Model and Inverted Indices Retrieval
title_short A Close-to-linear Topic Detection Algorithm using Relative Entropy based Relevance Model and Inverted Indices Retrieval
title_sort close to linear topic detection algorithm using relative entropy based relevance model and inverted indices retrieval
topic Topic Detection
Link Topic Detection
Retrospective Event Detection
Information Retrieval
Relevance Models
Inverted Indices
url https://www.atlantis-press.com/article/25868005.pdf
work_keys_str_mv AT stevekanshengshi aclosetolineartopicdetectionalgorithmusingrelativeentropybasedrelevancemodelandinvertedindicesretrieval
AT leminli aclosetolineartopicdetectionalgorithmusingrelativeentropybasedrelevancemodelandinvertedindicesretrieval
AT stevekanshengshi closetolineartopicdetectionalgorithmusingrelativeentropybasedrelevancemodelandinvertedindicesretrieval
AT leminli closetolineartopicdetectionalgorithmusingrelativeentropybasedrelevancemodelandinvertedindicesretrieval