摘要:Documents use tables to communicate multidimensional information clearly, summarise and present data in aneasy-to-interpret way. Tabular information in scanned PDF due to its nature without further processing is notaccessible for vision-impaired people who use assistive technology such as screen readers. The lack of access to tablecontents limits educational and workplace opportunities for people with vision impairment. They require a completeequivalent to access table. This paper describes techniques which apply to scanned PDF document for table detection,extraction and cell segmentation to retrieve cell contents and represent them in a navigable manner tovision-impaired.The output is in mark-up format and provides navigation ability to access content of a table.
其他摘要:Documents use tables to communicate multidimensional information clearly, summarise and present data in aneasy-to-interpret way. Tabular information in scanned PDF due to its nature without further processing is notaccessible for vision-impaired people who use assistive technology such as screen readers. The lack of access to tablecontents limits educational and workplace opportunities for people with vision impairment. They require a completeequivalent to access table. This paper describes techniques which apply to scanned PDF document for table detection,extraction and cell segmentation to retrieve cell contents and represent them in a navigable manner tovision-impaired.The output is in mark-up format and provides navigation ability to access content of a table.