文章基本信息

标题：Clustering Files with Extended File Attributes in Metadata
本地全文：下载
作者：Han, Lin ; Huang, Hao ; Xie, Changsheng 等
期刊名称：Journal of Multimedia
印刷版ISSN：1796-2048
出版年度：2014
卷号：9
期号：2
页码：278-285
DOI：10.4304/jmm.9.2.278-285
语种：English
出版社：Academy Publisher
摘要：Classification and searching play an important role in modern file systems and file clustering is an effective approach to do this. This paper presents a new labeling system by making use of the Extended File Attributes [1] of file system, and a simple file clustering algorithm based on this labeling system is also introduced. By regarding attributes and attribute-value pairs as labels of files, features of a file can be represented as binary vectors of labels. And some well-known binary vector dissimilarity measures can be performed on this binary vector space, so clustering based on these measures can be done also. This approach is evaluated with several real-life datasets, and results indicate that precise clustering of files is achieved at an acceptable cost
关键词：File Clustering;Extended File Attributes;File System;Binary Vector;Dissimilarity Measure