IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Data Intensive Computing for Bioinformatics

Data Intensive Computing for Bioinformatics
View Sample PDF
Author(s): Judy Qiu (Indiana University - Bloomington, USA), Jaliya Ekanayake (Indiana University - Bloomington, USA), Thilina Gunarathne (Indiana University - Bloomington, USA), Jong Youl Choi (Indiana University - Bloomington, USA), Seung-Hee Bae (Indiana University - Bloomington, USA), Yang Ruan (Indiana University - Bloomington, USA), Saliya Ekanayake (Indiana University - Bloomington, USA), Stephen Wu (Indiana University - Bloomington, USA), Scott Beason (Computer Sciences Corporation, USA), Geoffrey Fox (Indiana University - Bloomington, USA), Mina Rho (Indiana University - Bloomington, USA)and Haixu Tang (Indiana University - Bloomington, USA)
Copyright: 2013
Pages: 35
Source title: Bioinformatics: Concepts, Methodologies, Tools, and Applications
Source Author(s)/Editor(s): Information Resources Management Association (USA)
DOI: 10.4018/978-1-4666-3604-0.ch016

Purchase

View Data Intensive Computing for Bioinformatics on the publisher's website for pricing and purchasing information.

Abstract

Data intensive computing, cloud computing, and multicore computing are converging as frontiers to address massive data problems with hybrid programming models and/or runtimes including MapReduce, MPI, and parallel threading on multicore platforms. A major challenge is to utilize these technologies and large-scale computing resources effectively to advance fundamental science discoveries such as those in Life Sciences. The recently developed next-generation sequencers have enabled large-scale genome sequencing in areas such as environmental sample sequencing leading to metagenomic studies of collections of genes. Metagenomic research is just one of the areas that present a significant computational challenge because of the amount and complexity of data to be processed. This chapter discusses the use of innovative data-mining algorithms and new programming models for several Life Sciences applications. The authors particularly focus on methods that are applicable to large data sets coming from high throughput devices of steadily increasing power. They show results for both clustering and dimension reduction algorithms, and the use of MapReduce on modest size problems. They identify two key areas where further research is essential, and propose to develop new O(NlogN) complexity algorithms suitable for the analysis of millions of sequences. They suggest Iterative MapReduce as a promising programming model combining the best features of MapReduce with those of high performance environments such as MPI.

Related Content

Alessandra Lima da Silva, Diego Mariano, Mariana Parise, Angie L. A. Puelles, Tatiane Senna Bialves, Luana Luiza Bastos, Lucas Santos, Rafael Pereira Lemos. © 2025. 22 pages.
Seyyed Mohammad Amin Mousavi Sagharchi, Mohsen Sheykhhasan, Atousa Ghorbani, Elina Afrazeh, Naresh Poondla, Naser Kalhor, Hamid Tanzadehpanah, Hanie Mahaki, Hamed Manoochehri. © 2025. 46 pages.
Eduarda Guimarães Sousa, Lucas Gabriel Rodrigues Gomes, Fernanda Diniz Prates, Talita Pereira Gomes, Gabriel Camargos Gomes, Janaíne Aparecida de Paula, Ana Lua de Oliveira Vinhal, Bernardo Buhr Alves Mendonça, Mariana Letícia Costa Pedrosa, Luiza Pereira Reis, Aline Ferreira Maciel de Oliveira, Marcus Vinicius Canário Viana, Arun Kumar Jaiswal, Siomar de Castro Soares, Vasco Ariston de Carvalho Azevedo. © 2025. 38 pages.
Diego Mariano, Lucas Moraes dos Santos, Raquel Cardoso de Melo-Minardi. © 2025. 30 pages.
Alessandra G. Cioletti, Frederico C. Carvalho, Lucas M. Dos Santos, Raquel C. M. Minardi. © 2025. 32 pages.
Leandro Morais de Oliveira, Luana Luiza Bastos, Vivian Morais Paixão, Leticia Aparecida Gontijo, Tatiane Senna Bialves, Diego Mariano, Raquel Cardoso de Melo Minardi. © 2025. 40 pages.
Angie Atoche Puelles, Luana Luiza Bastos, Vivian Morais Paixão, Sheila Cruz Araujo, Raquel Cardoso de Melo Minardi. © 2025. 28 pages.
Body Bottom