Document Classification

View Sample PDF

Copyright: 2021
Pages: 5
Source title: Developing a Keyword Extractor and Document Classifier: Emerging Research and Opportunities
Source Author(s)/Editor(s): Dimple Valayil Paul (Department of Computer Science, Dnyanprassarak Mandal's College and Research Centre, Goa University, Goa, India)
DOI: 10.4018/978-1-7998-3772-5.ch007

Keywords: Data Mining / Data Mining and Databases / Information Science Reference / Library & Information Science

Purchase

View Document Classification on the publisher's website for pricing and purchasing information.

Abstract

Keywords can be used as attributes for mining rules or as a basis for measuring the similarity of new (unclassified) documents with existing (classified) ones. The focus is on the problem of extracting keywords from document collection in order to use them as attributes for document classification. Document classification is a hot topic in machine learning. Typical approaches extract “features,” generally words, from document, and use the feature vectors as input to a machine learning scheme that learns how to classify documents. This “bag of keywords” model neglects keyword order and contextual effects.

The IRMA Community

Research IRM

Document Classification

Purchase

Abstract

Related Content

IRMA Sponsors