期刊名称:Bulletin of the Technical Committee on Data Engineering
出版年度:2013
卷号:36
期号:3
出版社:IEEE Computer Society
摘要:Crisis informatics is a field of research that investigates the use of computer-mediated communication—including social media—by members of the public and other entities during times of mass emergency.Supporting this type of research is challenging because large amounts of ephemeral event data canbe generated very quickly and so must then be just as rapidly captured. Such data sets are challeng-ing to analyze because of their heterogeneity and size. We have been designing, developing, anddeploying software infrastructure to enable the large-scale collection and analysis of social mediadata during crisis events. We report on the challenges encountered when working in this space,the desired characteristics of such infrastructure, and the techniques, technology, and architecturesthat have been most useful in providing both scalability and .exibility. We also discuss the types ofanalytics this infrastructure supports and implications for future crisis informatics research