The IRMA Community
Newsletters
Research IRM
Click a keyword to search titles using our InfoSci-OnDemand powered search:
|
Data Lakes
|
|
Author(s): Anjani Kumar (University of Nebraska at Omaha, USA)and Parvathi Chundi (University of Nebraska at Omaha, USA)
Copyright: 2023
Pages: 15
Source title:
Encyclopedia of Data Science and Machine Learning
Source Author(s)/Editor(s): John Wang (Montclair State University, USA)
DOI: 10.4018/978-1-7998-9220-5.ch025
PurchaseView Data Lakes on the publisher's website for pricing and purchasing information.
|
Abstract
Data lake (DL) technology is popular for its flexibility to handle different raw data formats at the ingestion time as well as at the time of retrieval from the data lake. It typically includes the following five layers data ingestion, staging, processed data, storage and visualization, and analytics. These five layers together provide access to seemingly infinite computation and storage resources for democratizing data access and for supporting a wide variety of analytics tasks in an enterprise. This work is going to explain the four steps approach for doing the analysis task. It will describe the three pillars for building a DL. Then, it will give a brief history of the evolution from Excel Sheet to DL. It will explain the five layers: data ingestion, staging, processed data, storage and visualization, and analytics. It will briefly explain three DL systems, Snowflake, Databricks, and Redshift, and then nine important metrics for these three DL systems will be compared.
Related Content
|
G. Boopathy, Balaji Ganesan, P. Sivaprakasam, T. Kumaran.
© 2026.
42 pages.
|
|
G. Prasad.
© 2026.
14 pages.
|
|
Kishorebabu Dasari, Sujana Parry, Srinivas Mekala.
© 2026.
30 pages.
|
|
Chikesh Ranjan, Jonnalagadda Srinivas, P. S. Balaji, Kaushik Kumar.
© 2026.
24 pages.
|
|
G. Ananthi, S. Mehala Shevani, P. Priyadharshini Devi.
© 2026.
24 pages.
|
|
G. Prasad, Snehal Malik, Aadya Gupta, Yash Nigam.
© 2026.
26 pages.
|
|
Dhirendra Patel, M. L. Azad.
© 2026.
36 pages.
|
|
|