文章基本信息

标题：Identification of Spontaneous Spoken Texts in Slovak
本地全文：下载
作者：Róbert Sabo ; Peter Krammer ; Ján Mojžiš 等
期刊名称：Journal of Linguistics/Jazykovedný casopis
印刷版ISSN：0021-5597
出版年度：2019
卷号：70
期号：2
页码：481-490
DOI：10.2478/jazcas-2019-0076
出版社：Walter de Gruyter GmbH
摘要：We propose a text classification method for the purpose of creating a language model for automatic recognition of spontaneous spoken speech. Transcripts from our departmental speech database served as spontaneous spoken texts. Using supervised machine learning methods, we have created multiple classification models (including neural networks), that were able to distinguish them from written texts with high accuracy. We subsequently verified the accuracy of our trained models on a database of texts containing direct speech extracted from newspaper articles.
关键词：spontaneous speech ; text classification ; supervised machine learning ; neural networks ; Slovak language