Cloud Information Retrieval: Model Description and Scheme Design

The fast development of cloud technology has brought about a new trend in the field of information service: more and more information is being transferred to the cloud as requested. However, the data, such as texts, images, sounds, and videos, before being moved to the cloud, in most cases, has to b...

Full description

Bibliographic Details
Main Authors: Zhen Yang, Jiliang Tang, Huan Liu
Format: Article
Language:English
Published: IEEE 2018-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8272322/
_version_ 1819163098540408832
author Zhen Yang
Jiliang Tang
Huan Liu
author_facet Zhen Yang
Jiliang Tang
Huan Liu
author_sort Zhen Yang
collection DOAJ
description The fast development of cloud technology has brought about a new trend in the field of information service: more and more information is being transferred to the cloud as requested. However, the data, such as texts, images, sounds, and videos, before being moved to the cloud, in most cases, has to be encrypted so that intelligible information will not be obtained from unauthorized accesses. While having done a nice work in protecting the data privacy of its owners, this encrypting process, has produced a great challenge for retrieval of the document stored via traditional IR model based on document, query and relevance. In order to retrieve encrypted information from cloud, an alternative retrieval system is needed. To satisfy such a need, we have: 1) build a cloud information retrieval framework characterized by its retrieval risk formula, which, enables, for the very first time to the best of our knowledge, an effective retrieval of keywords from encrypted cloud data without undermining key word privacy and retrieval performance; and 2) upgraded the existing searchable encryption scheme that can only support simple equality queries on encrypted data and has been proved to perform slightly better than random selection, so that it can now support the state-of-art information retrieval methods, such as vector space, probabilistic, and language model. To evaluate the effect of the system proposed above, we've conducted a wide range of experiments on benchmark data sets, of which the results shows that solution can fulfill its purposes quite well in various settings.
first_indexed 2024-12-22T17:38:44Z
format Article
id doaj.art-50aac4748c3a41e8a5a0216e495010a8
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-22T17:38:44Z
publishDate 2018-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-50aac4748c3a41e8a5a0216e495010a82022-12-21T18:18:28ZengIEEEIEEE Access2169-35362018-01-016154201543010.1109/ACCESS.2018.27971318272322Cloud Information Retrieval: Model Description and Scheme DesignZhen Yang0https://orcid.org/0000-0002-6058-0217Jiliang Tang1Huan Liu2College of Computer Science, Beijing University of Technology, Beijing, ChinaDepartment of Computer Science and Engineering, Michigan State University, East Lansing, MI, USASchool of Computing, Informatics, and Decision Systems Engineering, Ira A. Fulton Schools of Engineering, Arizona State University, Tempe, AZ, USAThe fast development of cloud technology has brought about a new trend in the field of information service: more and more information is being transferred to the cloud as requested. However, the data, such as texts, images, sounds, and videos, before being moved to the cloud, in most cases, has to be encrypted so that intelligible information will not be obtained from unauthorized accesses. While having done a nice work in protecting the data privacy of its owners, this encrypting process, has produced a great challenge for retrieval of the document stored via traditional IR model based on document, query and relevance. In order to retrieve encrypted information from cloud, an alternative retrieval system is needed. To satisfy such a need, we have: 1) build a cloud information retrieval framework characterized by its retrieval risk formula, which, enables, for the very first time to the best of our knowledge, an effective retrieval of keywords from encrypted cloud data without undermining key word privacy and retrieval performance; and 2) upgraded the existing searchable encryption scheme that can only support simple equality queries on encrypted data and has been proved to perform slightly better than random selection, so that it can now support the state-of-art information retrieval methods, such as vector space, probabilistic, and language model. To evaluate the effect of the system proposed above, we've conducted a wide range of experiments on benchmark data sets, of which the results shows that solution can fulfill its purposes quite well in various settings.https://ieeexplore.ieee.org/document/8272322/Information retrievalcloud computingsearchable encryptionkeyword extractionquery expansion
spellingShingle Zhen Yang
Jiliang Tang
Huan Liu
Cloud Information Retrieval: Model Description and Scheme Design
IEEE Access
Information retrieval
cloud computing
searchable encryption
keyword extraction
query expansion
title Cloud Information Retrieval: Model Description and Scheme Design
title_full Cloud Information Retrieval: Model Description and Scheme Design
title_fullStr Cloud Information Retrieval: Model Description and Scheme Design
title_full_unstemmed Cloud Information Retrieval: Model Description and Scheme Design
title_short Cloud Information Retrieval: Model Description and Scheme Design
title_sort cloud information retrieval model description and scheme design
topic Information retrieval
cloud computing
searchable encryption
keyword extraction
query expansion
url https://ieeexplore.ieee.org/document/8272322/
work_keys_str_mv AT zhenyang cloudinformationretrievalmodeldescriptionandschemedesign
AT jiliangtang cloudinformationretrievalmodeldescriptionandschemedesign
AT huanliu cloudinformationretrievalmodeldescriptionandschemedesign