Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
Design and Implementation of Full Text Search System for Structured Language Resources
MASAYA YAMAGUCHIMAKIRO TANAKA
Author information
JOURNAL FREE ACCESS

2005 Volume 12 Issue 4 Pages 55-77

Details
Abstract
In this paper, we design and implement a full text search system “Himawari”.Himawari is designed to handle various structures and usages of language resources that are made to be used for language study and research.For the variety of structure, Himawari has the ability to search language resources structured by XML, extracting tagged information that may be used to constrain the results.Himawari provides some kind of indexes such as Suffix Array for the improvement of the search process. To resolve the problem of the variety of usages, a query and a method of reference for language resources can be defined by a user as suitable for the target language resource.Search results are displayed as a table including KWIC(Key Word In Context), and can be output to external reference system, for example, HTML browser, sound player, when the result is not able to be displayed as text data.By applying our system to a Japanese thesaurus “Bunrui Goi Hyo” and “Corpus of Spontaneous Japanese”, the adaptability for the varieties is verified and proved.
Content from these authors
© The Association for Natural Language Processing
Previous article Next article
feedback
Top