首页    期刊浏览 2024年12月01日 星期日
登录注册

文章基本信息

  • 标题:DMBVA - A Compression-Based Distributed Data Warehouse Management In Parallel Environment
  • 本地全文:下载
  • 作者:Fazlul Hasan Siddiqui ; Abu Sayed Md. Latiful Hoque
  • 期刊名称:Malaysian Journal of Computer Science
  • 印刷版ISSN:0127-9084
  • 出版年度:2007
  • 卷号:20
  • 期号:1
  • 出版社:University of Malaya * Faculty of Computer Science and Information Technology
  • 摘要:Parallel and distributed data warehouse architectures have been evolved to support online queries on massive data in a short time. Unfortunately, the emergence of eapplication has been creating extremely high volume of data that reaches to terabyte threshold. The conventional data warehouse management system is costlier in terms of storage space and processing speed and sometimes it is unable to handle such huge amount of data. As a result, there is a crucial need for the new algorithms and techniques to store and manipulate these data. In this paper, we have presented a compressionbased distributed data warehouse architecture – ‘DMBVA’ for storage of warehouse data, and support online queries efficiently. We have achieved a factor of 2530 compression compared to SQL server data warehouse. The main computational component of data warehouse is the generation and querying on the data cube. Our algorithm – ‘PCVDC’ generates data cube directly from the compressed form of data in parallel. The reduction in the size of data cube is a factor of 3045 compared to existing methods. The response time has also been significantly improved. These improvements are achieved by eliminating the suffix and prefix redundancy, virtual nature of the data cube, direct addressability of compressed form of data and parallel computation. Experimental evaluation shows the improved performance over the existing systems.
  • 关键词:Data Warehouse; Compression; Parallel; Virtual Data Cube
国家哲学社会科学文献中心版权所有