EFFECTIVE SUMMARY FOR MASSIVE DATA SET

The research efforts attempt to investigate size of the data increasing interest in designing the effective algorithm for space and time reduction. Providing high-dimensional technique over large data set is difficult. However, Randomized techniques are used for analyzing the data set where the perf...

Full description

Bibliographic Details
Main Author: A. Radhika
Format: Article
Language:English
Published: ICT Academy of Tamil Nadu 2015-07-01
Series:ICTACT Journal on Soft Computing
Subjects:
Online Access:http://ictactjournals.in/paper/IJSC_Paper_7_pp_1046_1056.pdf
Description
Summary:The research efforts attempt to investigate size of the data increasing interest in designing the effective algorithm for space and time reduction. Providing high-dimensional technique over large data set is difficult. However, Randomized techniques are used for analyzing the data set where the performance of the data from part of storage in networks needs to be collected and analyzed continuously. Previously collaborative filtering approach is used for finding the similar patterns based on the user ranking but the outcomes are not observed yet. Linear approach requires high running time and more space. To overcome this sketching technique is used to represent massive data sets. Sketching allows short fingerprints of the item sets of users which allow approximately computing similarity between sets of different users. The concept of sketching is to generate minimum subset of record that executes all the original records. Sketching performs two techniques dimensionality reduction which reduces rows or columns and data reduction. It is proved that sketching can be performed using Principal Component Analysis for finding index value
ISSN:0976-6561
2229-6956