IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Mining Text Documents for Thematic Hierarchies Using Self-Organizing Maps

Mining Text Documents for Thematic Hierarchies Using Self-Organizing Maps
View Sample PDF
Author(s): Hsin-Chang Yang (Chang Jung University, Taiwan)and Chung-Hong Lee (Chang Jung University, Taiwan)
Copyright: 2003
Pages: 21
Source title: Data Mining: Opportunities and Challenges
Source Author(s)/Editor(s): John Wang (Montclair State University, USA)
DOI: 10.4018/978-1-59140-051-6.ch008

Purchase

View Mining Text Documents for Thematic Hierarchies Using Self-Organizing Maps on the publisher's website for pricing and purchasing information.

Abstract

Recently, many approaches have been devised for mining various kinds of knowledge from texts. One important application of text mining is to identify themes and the semantic relations among these themes for text categorization. Traditionally, these themes were arranged in a hierarchical manner to achieve effective searching and indexing as well as easy comprehension for human beings. The determination of category themes and their hierarchical structures was mostly done by human experts. In this work, we developed an approach to automatically generate category themes and reveal the hierarchical structure among them. We also used the generated structure to categorize text documents. The document collection was trained by a self-organizing map to form two feature maps. We then analyzed these maps and obtained the category themes and their structure. Although the test corpus contains documents written in Chinese, the proposed approach can be applied to documents written in any language, and such documents can be transformed into a list of separated terms.

Related Content

. © 2023. 34 pages.
. © 2023. 15 pages.
. © 2023. 15 pages.
. © 2023. 18 pages.
. © 2023. 24 pages.
. © 2023. 32 pages.
. © 2023. 21 pages.
Body Bottom