The IRMA Community
Newsletters
Research IRM
Click a keyword to search titles using our InfoSci-OnDemand powered search:
|
Word Sense Disambiguation in Biomedical Applications: A Machine Learning Approach
Abstract
Ambiguity is a common phenomenon in text, especially in the biomedical domain. For instance, it is frequently the case that a gene, a protein encoded by the gene, and a disease associated with the protein share the same name. Resolving this problem, that is, assigning to an ambiguous word in a given context its correct meaning is called word sense disambiguation (WSD). It is a pre-requisite for associating entities in text to external identifiers and thus to put the results from text mining into a larger knowledge framework. In this chapter, we introduce the WSD problem and sketch general approaches for solving it. The authors then describe in detail the results of a study in WSD using classification. For each sense of an ambiguous term, they collected a large number of exemplary texts automatically and used them to train an SVM-based classifier. This method reaches a median success rate of 97%. The authors also provide an analysis of potential sources and methods to obtain training examples, which proved to be the most difficult part of this study.
Related Content
|
Rahul Kumar, Devvret Verma, Bahman Khoshru, Adeyemi Nurudeen Olatunbosun.
© 2026.
36 pages.
|
|
S. Ida Evangeline.
© 2026.
34 pages.
|
|
Rahul Kumar, Rachan Karmakar, Sanja Živković, Tanja Vasić.
© 2026.
42 pages.
|
|
Poonam K. Verma, Nisha Chandran.
© 2026.
20 pages.
|
|
Odangowei Inetiminebi Ogidi, Shoheb Shakil Shaikh, Mukul Machhindra Barwant.
© 2026.
42 pages.
|
|
Harsh Virendrabhai Purohit, Veda Pandya.
© 2026.
30 pages.
|
|
Rachan Karmakar, Divya Gunsola, Debasis Mitra, Viralkumar B. Mandaliya, Arti Thakur, Addisu Assefa, Sourav Chattaraj, Mukul Machhindra Barwant, Uma Eswaranpillai, Ponmurugan Karuppiah.
© 2026.
28 pages.
|
|
|