首页    期刊浏览 2024年12月02日 星期一
登录注册

文章基本信息

  • 标题:Htab2RDF: Mapping HTML Tables to RDF Triples
  • 作者:Bouchiha, Djelloul ; Malki, Mimoun ; Alghamdi, Abdullah
  • 期刊名称:COMPUTING AND INFORMATICS
  • 印刷版ISSN:1335-9150
  • 出版年度:2018
  • 卷号:36
  • 期号:6
  • 页码:1467-1491
  • 语种:English
  • 出版社:COMPUTING AND INFORMATICS
  • 其他摘要:The Web has become a tremendously huge data source hidden under linked documents. A significant number of Web documents include HTML tables generated dynamically from relational databases. Often, there is no direct public access to the databases themselves. On the other hand, RDF (Resource Description Framework) gives an efficient mechanism to represent directly data on the Web based on a Web-scalable architecture for identification and interpretation of terms. This leads to the concept of Linked Data on the Web. To allow direct access to data on the Web as Linked Data, we propose in this paper an approach to transform HTML tables into RDF triples. It consists of three main phases: refining, pre-treatment and mapping. The whole process is assisted by a domain ontology and the WordNet lexical database. A tool called Htab2RDF has been implemented. Experiments have been carried out to evaluate and show efficiency of the proposed approach.
  • 关键词:HTML tables; RDF; relational databases; Linked Data; domain ontology; WordNet
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有