Information Resources Management Association

Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Search IRMA Research

navleft

navright

Research IRM Open Access IRMA Journals IRM Books Proceedings Membership

The IRMA Community

Calls for Papers

Online Symposium

Research IRM

Click a keyword to search titles using our InfoSci-OnDemand powered search:

Data Quality in Data Warehouses

Data Quality in Data Warehouses

View Sample PDF

Author(s): William E. Winkler (U.S. Bureau of the Census, USA)
Copyright: 2009
Pages: 6
Source title: Encyclopedia of Data Warehousing and Mining, Second Edition
Source Author(s)/Editor(s): John Wang (Montclair State University, USA)
DOI: 10.4018/978-1-60566-010-3.ch086

Purchase

View Data Quality in Data Warehouses on the publisher's website for pricing and purchasing information.

Abstract

Fayyad and Uthursamy (2002) have stated that the majority of the work (representing months or years) in creating a data warehouse is in cleaning up duplicates and resolving other anomalies. This paper provides an overview of two methods for improving quality. The first is record linkage for finding duplicates within files or across files. The second is edit/imputation for maintaining business rules and for filling-in missing data. The fastest record linkage methods are suitable for files with hundreds of millions of records (Winkler, 2004a, 2008). The fastest edit/imputation methods are suitable for files with millions of records (Winkler, 2004b, 2007a).

Related Content

Digital Wisdom in Education: The Missing Link

Girija Ramdas, Irfan Naufal Umar, Nurullizam Jamiat, Nurul Azni Mhd Alkasirah. © 2024. 18 pages.

Unleashing the Potential of Every Child: The Transformative Role of Artificial Intelligence in Personalized Learning

Natalia Riapina. © 2024. 29 pages.

The Online Forum Impact on Student Engagement and Critical Thinking Disposition in General Education

Xinyu Chen, Wan Ahmad Jaafar Wan Yahaya. © 2024. 21 pages.

The Effectiveness of Breakout Rooms in Blended Learning: A Case Study in the Faculty of Engineering, Design, and Information Technology (EDICT) Degree at Bahrain Polytechnic

Fatema Ahmed Wali, Zahra Tammam. © 2024. 24 pages.

Promoting Critical Thinking Disposition Through Virtual Reality Serious Games

Su Jiayuan, Jingru Zhang. © 2024. 26 pages.

Synergistic Play Design: An Integrated Framework for Game Element and Mechanic Implementation to Enhance Game-Based Learning Experiences

Pua Shiau Chen. © 2024. 21 pages.

Exploiting Simulation Games to Teach Business Program

Minh Tung Tran, Thu Trinh Thi, Lan Duong Hoai. © 2024. 23 pages.

IRMA Offers Over 2,500 Full Text Open Access Research Papers for Free Download Click to Start Searching Free IRM Research!

IRMA Sponsors

Sponsor IGI Global

Sponsor eContentPro

Encyclopedia of Information Science and Technology, Fourth Edition

Body Bottom