IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Word Sense Disambiguation in Biomedical Applications: A Machine Learning Approach

Word Sense Disambiguation in Biomedical Applications: A Machine Learning Approach
View Sample PDF
Author(s): Torsten Schiemann (Humboldt-Universität zu Berlin, Germany), Ulf Leser (Humboldt-Universität zu Berlin, Germany)and Jörg Hakenberg (Arizona State University, USA)
Copyright: 2009
Pages: 20
Source title: Information Retrieval in Biomedicine: Natural Language Processing for Knowledge Integration
Source Author(s)/Editor(s): Violaine Prince (University Montpellier 2, France)and Mathieu Roche (University Montpellier 2, France)
DOI: 10.4018/978-1-60566-274-9.ch008

Purchase

View Word Sense Disambiguation in Biomedical Applications: A Machine Learning Approach on the publisher's website for pricing and purchasing information.

Abstract

Ambiguity is a common phenomenon in text, especially in the biomedical domain. For instance, it is frequently the case that a gene, a protein encoded by the gene, and a disease associated with the protein share the same name. Resolving this problem, that is, assigning to an ambiguous word in a given context its correct meaning is called word sense disambiguation (WSD). It is a pre-requisite for associating entities in text to external identifiers and thus to put the results from text mining into a larger knowledge framework. In this chapter, we introduce the WSD problem and sketch general approaches for solving it. The authors then describe in detail the results of a study in WSD using classification. For each sense of an ambiguous term, they collected a large number of exemplary texts automatically and used them to train an SVM-based classifier. This method reaches a median success rate of 97%. The authors also provide an analysis of potential sources and methods to obtain training examples, which proved to be the most difficult part of this study.

Related Content

Rahul Kumar, Devvret Verma, Bahman Khoshru, Adeyemi Nurudeen Olatunbosun. © 2026. 36 pages.
S. Ida Evangeline. © 2026. 34 pages.
Rahul Kumar, Rachan Karmakar, Sanja Živković, Tanja Vasić. © 2026. 42 pages.
Poonam K. Verma, Nisha Chandran. © 2026. 20 pages.
Odangowei Inetiminebi Ogidi, Shoheb Shakil Shaikh, Mukul Machhindra Barwant. © 2026. 42 pages.
Harsh Virendrabhai Purohit, Veda Pandya. © 2026. 30 pages.
Rachan Karmakar, Divya Gunsola, Debasis Mitra, Viralkumar B. Mandaliya, Arti Thakur, Addisu Assefa, Sourav Chattaraj, Mukul Machhindra Barwant, Uma Eswaranpillai, Ponmurugan Karuppiah. © 2026. 28 pages.
Body Bottom