The IRMA Community
Newsletters
Research IRM
Click a keyword to search titles using our InfoSci-OnDemand powered search:
|
Big Data Techniques for Supporting Official Statistics: The Use of Web Scraping for Collecting Price Data
Abstract
Following the advent of Big Data, statistical offices have been largely exploring the use of Internet as data source for modernizing their data collection process. Particularly, prices are collected online in several statistical institutes through a technique known as web scraping. The objective of the chapter is to discuss the challenges of web scraping for setting up a continuous data collection process, exploring and classifying the more widespread techniques and presenting how they are used in practical cases. The main technical notions behind web scraping are presented and explained in order to give also to readers with no background in IT the sufficient elements to fully comprehend scraping techniques, promoting the building of mixed skills that is at the core of the spirit of modern data science. Challenges for official statistics deriving from the use of web scraping are briefly sketched. Finally, research ideas for overcoming the limitations of current techniques are presented and discussed.
Related Content
N. Geethanjali, K. M. Ashifa, Avantika Raina, Jayashree Patil, Rameshwaran Byloppilly, S. Suman Rajest.
© 2024.
19 pages.
|
Praveen Kakada, Muhammed Shafi M. K..
© 2024.
14 pages.
|
P. S. Venkateswaran, Divya Marupaka, Sachin Parate, Amit Bhanushali, Latha Thammareddi, P. Paramasivan.
© 2024.
15 pages.
|
M. Lishmah Dominic, P. S. Venkateswaran, Latha Thamma Reddi, Sandeep Rangineni, R. Regin, S. Suman Rajest.
© 2024.
15 pages.
|
S. Sivabala, P. Vidyasri.
© 2024.
23 pages.
|
H. Hajra, G. Jayalakshmi.
© 2024.
22 pages.
|
Anusha Thakur.
© 2024.
15 pages.
|
|
|