期刊名称:International Journal of Computer Trends and Technology
电子版ISSN:2231-2803
出版年度:2016
卷号:35
期号:2
页码:114-117
DOI:10.14445/22312803/IJCTT-V35P120
出版社:Seventh Sense Research Group
摘要:A huge data space includes set of interesting points; Skyline is an important operation in many applications to return a set of interesting points from a potentially huge data space. This survey paper highlights the characteristics of big data and their challenges. This paper also discusses the tools and techniques of big data. The existing algorithms like SaLSa, SSPL are novel computation algorithms. SaLSa exploits the idea of presorting the input data so as to effectively limit the number of tuples to be read and compared. SSPL utilizes sorted positional index lists which require low space overhead to reduce I/O cost significantly. SSPL consists of two phases. In phase 1, SSPL computes scan depth of the involved sorted positional index lists. During retrieving the lists in a roundrobin fashion, SSPL performs pruning on any candidate positional index to discard the candidate whose corresponding tuple is not skyline result. Phase 1 ends when there is a candidate positional index seen in all of the involved lists. In phase 2, SSPL exploits the obtained candidate positional indexes to get skyline results by a selective and sequential scan on the table.