The IRMA Community
Newsletters
Research IRM
Click a keyword to search titles using our InfoSci-OnDemand powered search:
|
Search Engine-Based Web Information Extraction
|
Author(s): Gijs Geleijnse (Philips Research, The Netherlands)and Jan Korst (Philips Research, The Netherlands)
Copyright: 2010
Pages: 34
Source title:
Web Technologies: Concepts, Methodologies, Tools, and Applications
Source Author(s)/Editor(s): Arthur Tatnall (Victoria University, Australia)
DOI: 10.4018/978-1-60566-982-3.ch109
PurchaseView on the publisher's website for pricing and purchasing information.
|
Abstract
In this chapter we discuss approaches to find, extract, and structure information from natural language texts on the Web. Such structured information can be expressed and shared using the standard Semantic Web languages and hence be machine interpreted. In this chapter we focus on two tasks in Web information extraction. The first part focuses on mining facts from the Web, while in the second part, we present an approach to collect community-based meta-data. A search engine is used to retrieve potentially relevant texts. From these texts, instances and relations are extracted. The proposed approaches are illustrated using various case-studies, showing that we can reliably extract information from the Web using simple techniques.
Related Content
Dina Darwish.
© 2024.
28 pages.
|
Dina Darwish.
© 2024.
28 pages.
|
Muhammad Ahmed, Adnan Ahmad, Furkh Zeshan, Hamid Turab.
© 2024.
33 pages.
|
Pankaj Bhambri.
© 2024.
17 pages.
|
Kaushikkumar Patel.
© 2024.
20 pages.
|
Vijaya Kittu Manda, Arnold Mashud Abukari, Vivek Gupta, Madavarapu Jhansi Bharathi.
© 2024.
24 pages.
|
Pankaj Bhambri.
© 2024.
17 pages.
|
|
|