首页    期刊浏览 2025年03月02日 星期日
登录注册

文章基本信息

  • 标题:TREND DETECTION IN THE ARABIC SOCIAL MEDIA USING VOTING COMBINATION
  • 本地全文:下载
  • 作者:ALI SABAH ABDULAMEER ; SAIDAH SAAD ; LAILATUL QADRI ZAKARIA
  • 期刊名称:Journal of Theoretical and Applied Information Technology
  • 印刷版ISSN:1992-8645
  • 电子版ISSN:1817-3195
  • 出版年度:2015
  • 卷号:81
  • 期号:3
  • 出版社:Journal of Theoretical and Applied
  • 摘要:The amount of information has been increasing tremendously, especially with the use of social media applications, such as Twitter, Facebook, and YouTube. Twitter is a common social application that enables users to share their current thoughts and actions, comment on breaking news, and engage in discussions. Trends are typically driven by emerging events, breaking news, and general topics that attract the attention of a large fraction of Twitter users. Thus, trend detection is highly valuable to news reporters and analysts because they may point to fast-evolving news stories. Researchers have been attempting to detect trends using machine-learning techniques, such as clustering method based on major languages (e.g., English, German, and French). The Arabic language remains in its infancy, but the Arabic social media have been contributing to a large amount of data because of the significant events in the Middle East. The present research aims to detect trends in the Arabic social media. However, this research must overcome several issues such as processing of Arabic user-generated content and lack of resources. To solve these issues, this research presents a voting combination clustering approach, which is divided into six phases, namely, dataset collection from Twitter, text pre-processing, spam filtering using Na�ve Bayes, feature selection based on term frequency�inverse document frequency and entropy, statistical analyses, and evaluation. Three statistical approaches for clustering are used, namely, co-occurrence, k-means, and voting combination. The analyses are performed to classify the trends into three categories, namely, Arabic nationality events, personal events, and other events. Experimental results indicate that the voting combination clustering achieved 93%, 87%, and 90% for precision, recall, and f-measure in trend detection, respectively. Finally, trend detection of events is important to companies, governments, national security agencies, and journalists to develop strategies to rectify them.
  • 关键词:Trend detection; Arabic social media; Term Clustering; K-means and Voting Combination.
国家哲学社会科学文献中心版权所有