期刊名称:TELKOMNIKA (Telecommunication Computing Electronics and Control)
印刷版ISSN:2302-9293
出版年度:2017
卷号:15
期号:2
页码:656-664
DOI:10.12928/telkomnika.v15i2.4061
语种:English
出版社:Universitas Ahmad Dahlan
其他摘要:In this paper, we proposed a work onrhetorical corpus construction andsentence classification model experiment that specifically could be incorporated in automatic paper title generation task for scientific article. Rhetorical classification is treated as sequence labeling. Rhetorical sentence classification model is useful in task which considers document’s discourse structure. We performed experiments using two domains of datasets: computer science (CS dataset), and chemistry (GaN dataset). We evaluated the models using 10-fold-cross validation (0.70-0.79 weighted average F-measure) as well as on-the-run (0.30-0.36 error rate at best). We argued that our models performed best when handled using SMOTE filter for imbalanced data
关键词:rhetorical corpus construction; rhetorical classification; automatic title generation; scientific article