Handling Large Databases in Data Mining

View Free PDF

Author(s): M. Mehdi Owrang (American University, USA)
Copyright: 2000
Pages: 5
Source title: Challenges of Information Technology Management in the 21st Century
Source Editor(s): Mehdi Khosrow-Pour, D.B.A. (Information Resources Management Association, USA)
DOI: 10.4018/978-1-878289-84-1.ch012
ISBN13: 9781878289841
EISBN13: 9781466665316

Keywords: Engineering Science Reference / IT Research & Theory / IT Research and Theory / Library & Information Science

Abstract

Current database technology involves processing a large volume of data in order to discover new knowledge. The high volume of data makes discovery process computationally expensive. In addition, real-world databases tend to be incomplete, redundant, and inconsistent that could lead to discovering redundant and inconsistent knowledge. We propose to use domain knowledge to reduce the size of the database being considered for discovery and to optimize the hypothesis (representing the pattern to be discovered) by eliminating implied, unnecessary, and redundant conditions from the hypothesis. The benefits can be greater efficiency and the discovery of more meaningful, non-redundant, non-trivial, and consistent rules.

IRMA Offers Over 2,500 Full Text Open Access Research Papers for Free Download Click to Start Searching Free IRM Research!

IRMA Sponsors

Encyclopedia of Information Science and Technology, Fourth Edition