Summary: | One way of improving the performance of a search engine is increasing the hit ratio of the search engine cache. A common and widely used approach for increasing the hit ratio is a combination of the result cache, the posting list cache, and the intersection cache, yielding a three-level cache architecture. However, the existing multi-level cache architectures do not consider the dependencies among the content cached in different parts. Thus, the same content might be stored multiple times in the architectures, resulting in duplicate hits. In other words, a large amount of space in the cache is wasted. In this paper, we propose a new static content-based three-level cache architecture that adopts a new content-based three-level Web cache filling algorithm that takes into account the dependencies among the content cached in different parts. In the proposed cache architecture, duplicate hits are reduced and the hit ratio increases. Extensive experiments were conducted on a real data set. The results have shown a significant improvement on the hit ratios compared with two existing approaches.
|