IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Lexical Granularity for Automatic Indexing and Means to Achieve It: The Case of Swedish MeSH®

Lexical Granularity for Automatic Indexing and Means to Achieve It: The Case of Swedish MeSH®
View Sample PDF
Author(s): Dimitrios Kokkinakis (University of Gothenburg, Sweden)
Copyright: 2009
Pages: 27
Source title: Information Retrieval in Biomedicine: Natural Language Processing for Knowledge Integration
Source Author(s)/Editor(s): Violaine Prince (University Montpellier 2, France)and Mathieu Roche (University Montpellier 2, France)
DOI: 10.4018/978-1-60566-274-9.ch002

Purchase

View Lexical Granularity for Automatic Indexing and Means to Achieve It: The Case of Swedish MeSH® on the publisher's website for pricing and purchasing information.

Abstract

The identification and mapping of terminology from large repositories of life science data onto concept hierarchies constitute an important initial step for a deeper semantic exploration of unstructured textual content. Accurate and efficient mapping of this kind is likely to provide better means of enhancing indexing and retrieval of text, uncovering subtle differences, similarities and useful patterns, and hopefully new knowledge, among complex surface realisations, overlooked by shallow techniques based on various forms of lexicon look-up approaches. However, a finer-grained level of mapping between terms as they occur in natural language and domain concepts is a cumbersome enterprise that requires various levels of processing in order to make explicit relevant linguistic structures. This chapter highlights some of the challenges encountered in the process of bridging free text to controlled vocabularies and thesauri and vice versa. The author investigates how the extensive variability of lexical terms in authentic data can be efficiently projected to hierarchically structured codes, while means to increase the coverage of the underlying lexical resources are also investigated.

Related Content

Rahul Kumar, Devvret Verma, Bahman Khoshru, Adeyemi Nurudeen Olatunbosun. © 2026. 36 pages.
S. Ida Evangeline. © 2026. 34 pages.
Rahul Kumar, Rachan Karmakar, Sanja Živković, Tanja Vasić. © 2026. 42 pages.
Poonam K. Verma, Nisha Chandran. © 2026. 20 pages.
Odangowei Inetiminebi Ogidi, Shoheb Shakil Shaikh, Mukul Machhindra Barwant. © 2026. 42 pages.
Harsh Virendrabhai Purohit, Veda Pandya. © 2026. 30 pages.
Rachan Karmakar, Divya Gunsola, Debasis Mitra, Viralkumar B. Mandaliya, Arti Thakur, Addisu Assefa, Sourav Chattaraj, Mukul Machhindra Barwant, Uma Eswaranpillai, Ponmurugan Karuppiah. © 2026. 28 pages.
Body Bottom