摘要:A trie is an ordered tree data structure to store keywords. It is used in natural language processing and so on. The trie is represented by the double array. The double array can retrieve fast at time complexity of O(1). The double array using linear functions (DALF) is proposed as a compression method of the double array. DALF reduces space usage of the double array to 60%. DALF is built by using parameters, and its space usage and its construction time depend on these parameters. However, appropriate values of them are not determined. This paper observes these parameters and evaluates parameters by experiments. From experiments, appropriate parameters are found, and it turns out that DALF can be built more efficiently by keyword sets including multibyte characters.
其他关键词:Trie, double array, construction method, keyword search.