首页    期刊浏览 2024年12月04日 星期三
登录注册

文章基本信息

  • 标题:Improving Arabic Cognitive Distortion Classification in Twitter using BERTopic
  • 本地全文:下载
  • 作者:Fatima Alhaj ; Ali Al-Haj ; Ahmad Sharieh
  • 期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
  • 印刷版ISSN:2158-107X
  • 电子版ISSN:2156-5570
  • 出版年度:2022
  • 卷号:13
  • 期号:1
  • DOI:10.14569/IJACSA.2022.0130199
  • 语种:English
  • 出版社:Science and Information Society (SAI)
  • 摘要:Social media platforms allow users to share thoughts, experiences, and beliefs. These platforms represent a rich resource for natural language processing techniques to make inferences in the context of cognitive psychology. Some inaccurate and biased thinking patterns are defined as cognitive distortions. Detecting these distortions helps users restructure how to perceive thoughts in a healthier way. This paper proposed a machine learning-based approach to improve cognitive distortions’ classi-fication of the Arabic content over Twitter. One of the challenges that face this task is the text shortness, which results in a sparsity of co-occurrence patterns and a lack of context information (semantic features). The proposed approach enriches text rep-resentation by defining the latent topics within tweets. Although classification is a supervised learning concept, the enrichment step uses unsupervised learning. The proposed algorithm utilizes a transformer-based topic modeling (BERTopic). It employs two types of document representations and performs averaging and concatenation to produce contextual topic embeddings. A comparative analysis of F1-score, precision, recall, and accuracy is presented. The experimental results demonstrate that our enriched representation outperformed the baseline models by different rates. These encouraging results suggest that using latent topic distribution, obtained from the BERTopic technique, can improve the classifier’s ability to distinguish between different CD categories.
  • 关键词:Arabic tweets; cognitive distortions’ classification; machine learning; social media; supervised learning; unsupervised learning; transformers; BERTopic; topic modeling
国家哲学社会科学文献中心版权所有