IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Social Information Retrieval Systems: Emerging Technologies and Applications for Searching the Web Effectively

Social Information Retrieval Systems: Emerging Technologies and Applications for Searching the Web Effectively
Author(s)/Editor(s): Dion Goh (Nanyang Technological University, Singapore)and Schubert Foo (Nanyang Technological University, Singapore)
Copyright: ©2008
DOI: 10.4018/978-1-59904-543-6
ISBN13: 9781599045436
ISBN10: 1599045435
EISBN13: 9781599045450

Purchase

View Social Information Retrieval Systems: Emerging Technologies and Applications for Searching the Web Effectively on the publisher's website for pricing and purchasing information.


Description

The wealth of information accessible on the Internet has grown exponentially since its advent. This mass of content must be systemically sifted to glean pertinent data, and the utilization of the collective intelligence of other users, or social information retrieval, is an innovative, emerging technique.

Social Information Retrieval Systems: Emerging Technologies & Applications for Searching the Web Effectively provides relevant content in the areas of information retrieval systems, services, and research; covering topics such as social tagging, collaborative querying, social network analysis, subjective relevance judgments, and collaborative filtering. Answering the increasing demand for authoritative resources on Internet technologies, this Premier Reference Source will make an indispensable addition to any library collection.



Preface

The popularity of the Web has led to a tremendous growth in the volume of available online information. The result is that people have now come to depend on the Web to meet their information needs via search engines, portals, digital libraries and other information retrieval systems. However, the amount of information and its growth is a double-edged sword due to the problem of information overload, exacerbated by the fact that not all content on the Web is relevant or of acceptable quality to information seekers. Information overload has led to a situation where users are swamped with too much information and have to sift through the materials in search of relevant content.

A variety of techniques have been adopted on the Web to address these problems inherent in information search, drawing from the fields of information retrieval, information filtering, human computer interaction and the study of information seeking behavior. Work in these areas has yielded in many novel and useful algorithmic and user interface techniques. Research in information seeking behavior suggests yet an alternative promising approach in helping users meet their information needs. Many studies have found that interaction and collaboration with other people is an important part in the process of information seeking and use. It is not uncommon that in searching for information, we tap on our social networks – friends, colleagues, librarians, to help locate what we need.

Social information retrieval refers to a family of techniques that assist users in meeting their information needs by harnessing the collective intelligence of other users – their expert knowledge or search experience. Elements of social information retrieval may be found on the Web through the hyperlinks that connect different Web sites (e.g. bookmark lists), subject directories (e.g. Yahoo, Open Directory Project), Google’s PageRank algorithm, and user annotations of resources (e.g. Amazon.com’s book reviews and ratings). More contemporary techniques include social tagging, collaborative querying, social network analysis, subjective relevance judgments, and collaborative filtering.

Social information retrieval is an emerging area and a promising avenue for the design and implementation of a new generation of information retrieval systems. It has drawn interest in academia as well as in industry. This book introduces readers to this area as well as discusses the state-of-the-art techniques in social information retrieval. It serves as a resource for those dealing with information retrieval systems, services and research and is written for academics, researchers, information retrieval product managers and software developers, librarians and students.

Organization

The book is organized around five sections into chapters with the following themes:

  • Collaborative querying
  • Collaborative classification and organization
  • Social networks for information retrieval
  • Social information seeking models
  • Social issues
  • Applications and case studies

Section I deals with Collaborative Querying, discussing various techniques that support searching by harnessing other users’ search experiences. This section consists of two chapters:

    Chapter 1, “Collaborating to search effectively in different searcher modes through cues and specialty search” by Naresh Kumar Agarwal and Danny Poo argues that searchers generally have difficulty searching in knowledge repositories because of the quantity of data involved and because searcher mechanisms are not tailored to their differing needs. Collaboration among searchers is one possible solution. They review concepts related to social information retrieval and other collaborative mechanisms, and discusses two collaborative searching mechanisms – cues and specialty search. A case study of an online educational portal is also presented to show how collaboration could enhance learning and search efficacy.

    Chapter 2, “A hybrid clustering approach for collaborative querying” by Ray Chandrani Sinha, Dion Goh, Schubert Foo, Nyein Chan Soe Win and Khasfariyati Razikin describes the concept of collaborative querying, a technique which makes use of past users’ search experiences in order to help the current user formulate an appropriate query. Here, related queries are extracted from query logs, clustered and these queries are used as candidates for recommendations. Query similarity is determined using a combination of query terms as well as search result documents. For the latter, features such as titles, URLs and snippets from the result documents are used. Experimental results reveal that the best clusters are obtained by using a combination of these sources rather than using only query terms or only result URLs alone.

Collaborative Classification and Organization is covered in Section II. Chapters in this section examine how classification schemes and metadata can be constructed collaboratively. The section consists of three chapters:

    Chapter 3, “Collaborative classification for group-oriented organization of search results” by Keiichi Nakata and Amrish Singh begins this section by examining the use of collaborative classification to support social information retrieval by organizing search results. Two approaches, collaborative indexing and search result classification based on shared classification schemes, are described and compared.

    Chapter 4, “A case study of use centered descriptions: Archival descriptions of what can be done with a collection” by Richard Butterworth argues that there is a mismatch between current metadata standards for the description of archival holdings and what many users actually want to know about a collection. Use centered descriptions are proposed as a way of systematically describing what can be done with a collection, and are, in effect, an encoding of library staff’s knowledge about what can be done with a collection. An example of its use by the University of London to encourage wider access to their archival holdings is presented.

    Chapter 5, “Metadata for social recommendations: Storing, sharing and reusing evaluations of learning resources” by Riina Vuorikari, Nikos Manouselis, and Erik Duval discusses how social information retrieval systems can benefit greatly from sharable and reusable evaluations of online resources in the form of metadata. To achieve interoperability among various systems, a common framework to describe the evaluation of such resources is required. Through a review of various approaches, they present an evaluation framework and apply it to learning resources.

Section III focuses on Using Social Networks for Information Retrieval. Although the idea of using social networks to find information is not new, it has gained popularity since the introduction of the Google search engine and therefore warrants an in-depth examination of the techniques involved. There are three chapters in this section:

    Chapter 6, “Social network models for enhancing reference based search engine rankings” by Nikolaos Korfiatis, Miguel-Angel Sicilia, Claudia Hess, Klaus Stein, and Christoph Schlieder begins this section by discussing the integration of information retrieval information from two sources: a social network and a document reference network for enhancing reference based search engine rankings. They elaborate on the basic intuitions that highlight the contribution of the social context, that can be mined from social networks, into the improvement of the rankings provided in reference based search engines. A case study on the Web-based encyclopedia Wikipedia is presented as an illustration of the ideas introduced in this chapter.

    Chapter 7, “From PageRank to Social Rank: Authority-based retrieval in social information spaces” by Sebastian Marius Kirsch, Melanie Gnasa, Markus Won, and Armin Cremers presents methods for utilizing social networks for information retrieval, by applying graph authority measures to the social network. They present techniques for integrating authority measures in an information retrieval algorithm. To demonstrate the applicability of their algorithm the authors examine the structure and statistical properties of social networks, present examples of social networks as well as evaluation results.

    Chapter 8, “Adaptive peer to peer social networks for distributed content based web search” by Le-Shin Wu, Ruj Akavipat, Ana Maguitman, and Filippo Menczer employs social networks in information retrieval from the perspective of collaborative peer to peer networks. Their system, called 6Search (6S) aims to address the scalability limitations of centralized search engines. Each peer crawls the Web in a focused way, guided by its user's information context. Peers also act as a search “servent” by submitting and responding to queries to/from its neighbors. Prototypes of the 6S system are evaluated via simulations which model users based on actual Web crawls. The quality of the results obtained are also compared against centralized search engines such as Google.

Section IV shifts its attention to examine Social Issues pertaining to social information retrieval systems. This section consists of two chapters:

    Chapter 9, “The Ethics of Social Information Retrieval” by Brendan Luyt and Chu-Keong Lee attempts to examine social networking and social information retrieval in the context of Habermas’s concepts of public sphere and communication actions against the problem of homophily (where a contact between similar people occurs at a higher rate than among dissimilar people), and posits that such activity is likely to increase the fragmentation of society and a reduction in social diversity as groups become more homogenous and isolated from rest of society who also have important roles to play towards learning, among others. They conclude with a call for more responsible and aware technological designs with an emphasis on values so that the effects of homophily are addressed.

    Chapter 10, “The Social Context of Knowledge“ by Daniel Memmi demonstrates that collaborative information management is now the dominant type of occupation with human information processing predominately taking place in a large social context. Interactions are supported by a range of collaborative information tools and systems which are designed to support the seeking, diffusion and management of explicit and tacit knowledge. While some of these systems are virtually grouping similar users together (thereby promoting the problem of homophily as discussed in Chapter 9!), there is also a need to find new solutions by better understanding and modeling human cognitive processes in information processing from diverse heterogeneous sources to enable the creation of new design ideas for future systems.

Section V on Social Information Seeking Models presents a set of different information seeking models in different contexts and highlights the wide ranging applicability of this new emerging field of social information retrieval. This section consists of three chapters:

    Chapter 11, “Social Information Seeking in Digital Libraries” by George Buchanan and Annika Hinze, first showed a number of contrasting uses of the social aspects of information seeking and abstracts them over a number of different approaches to present underlying principles, architectures and models. Using digital library as a technological platform, the authors propose the provision of social context by the addition and integration of recommendation, alerting and communication services into the architecture. They suggest that effective social information seeking pivots on closing the gap between human communication and the digital library.

    Chapter 12, “Relevant Intra-actions in Networked Environments” by Theresa Anderson provides a conceptual framework for relevance as a socially-situated phenomenon, and goes on to describe an ethnographic study of academics engaged in research projects making relevant judgements of information when working with networked information systems. Relevance assessment when theorized as intra-action shows such judgments as emergent constructions that arose as a result of interplay between social and personal, technical and human elements in such a networked environment. The understanding of such a perspective can enable better collaborative systems to be designed, such as the facilitation for creating collaborative metadata schemes to enable alternative representations of content and to cater for different information seeking behaviours in different contexts.

    Chapter 13, “Publication and Citation Analysis as a Tool for Information Retrieval” by Ronald Rousseau defines citation analysis in the context of applied social network theory and highlights the relationship between citation linking as a source for information retrieval based on social interaction where authors cite and co-cite each other’s publications. Relations between citation analysis and web links in the emerging field of Webometrics are also distinguished. The author takes a peek into the future where he envisages the integration of local and regional citations indexes into a virtual world citation atlas; the spot translation of existing scientific non-English literature on the web to increase the knowledge base and visibility and citation levels of such authors; and the establishment of global repositories for research, and others.

Section VI concludes this book by presenting Applications and Case Studies in Social Information Retrieval. The focus of this chapter is to examine where and how social information retrieval systems have been applied. There are three chapters in this section:

    Chapter 14, “Personalized information retrieval in a semantic-based learning environment” by Antonella Carbonaro and Rodolfo Ferrini apply social information retrieval techniques to the education domain. They introduce a personalized learning environment providing intelligent support to achieve the expectations of active learning. The system exploits collaborative and semantic approaches to extract concepts from documents, and maintain user and resources profiles based on domain ontologies. With this approach, the information retrieval process is able to produce personalized views of the learning environment.

    Chapter 15, “Multi-Agent Tourism System (MATS)” by Soe Yu Maw and Myo-Myo Naing addresses the tourism domain. They argue that tourism information on the Web is dynamic and it is not easy to obtain relevant and updated information to meet an individual’s needs. To address this issue, they developed the Multi-Agent Tourism System (MATS) with the goal of providing relevant and updated information tailored to the user's interests. Key to the system is the Rule-based Personalization with Collaborative Filtering Technique for personalization in MATS. The technique is able to address the limitations of pure collaborative filtering which includes scalability, sparsity and cold-start problems.

    Chapter 16, “Hybrid recommendation systems: A case study on the movies domain” by Konstantinos Markellos, Penelope Markellou, Aristotelis Mertis, Ioanna Mousourouli, Angeliki Panayiotaki, and Athanasios Tsakalidis examine how social information retrieval can applied to e-commerce sites, focusing in particular on recommendation systems. They investigate hybrid recommendation systems and the way they can support movie e-shops to suggest movies to customers. Specifically, the authors introduce a recommendation approach where knowledge about customers and movies is extracted from usage mining and ontological data in conjunction with customer-movie ratings and matching techniques between customers.

More...
Less...

Reviews and Testimonials

This book introduces readers to this area as well as discusses the state-of-the-art techniques in social information retrieval.

– Dion Goh, Nanyang Technological University, Singapore

With the increasing demand for authoritative resources on internet technologies, this reference source will be indispensible for appropriate libraries.

– APADE (2007)

This book's 17 chapters have a consistent style: an introduction describing what will be presented, often followed by relevant definitions, then the body of the chapter, a summary conclusion, and references. Summing Up: Recommended.

– CHOICE, Vol. 45, No. 08 (April 2008)

Author's/Editor's Biography

Dion Goh (Ed.)
Dion Goh has a PhD in computer science and is an Associate Professor at the Division of Information Studies, Wee Kim Wee School of Communication and Information. He is also Director of the school’s Master of Science in Information Systems program. His research areas are in the areas of information retrieval, digital libraries, text mining, use of information technology in education and user interfaces.

Schubert Foo (Ed.)
Dr. Schubert Foo is Professor and Vice-Dean of the School of Communication and Information at Nanyang Technological University, Singapore. He was the former Head of the Division of Information Studies that offers two Master of Science programmes in Information Studies and Knowledge Management. Dr Foo holds a PhD in Materials Engineering, M.B.A. and a B.Sc. (Hons) in Mechanical Engineering from the University of Strathclyde, UK. He is Fellow of the Institution of Mechanical Engineers, UK, and Fellow of the British Computer Society, UK. He sits on the editorial boards of Journal of Information Science, Journal of Information & Knowledge Management, and International Yearbook of Library and Information Management. He has published more than 110 international journal and conference papers and book chapters in the areas of multimedia technology, Internet technology, multilingual information retrieval, digital libraries and knowledge management.

More...
Less...

Body Bottom