IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Semi-Supervised Clustering for the Identification of Different Cancer Types Using the Gene Expression Profiles

Semi-Supervised Clustering for the Identification of Different Cancer Types Using the Gene Expression Profiles
View Sample PDF
Author(s): Manuel Martín-Merino (University Pontificia of Salamanca, Spain)
Copyright: 2013
Pages: 17
Source title: Bioinformatics: Concepts, Methodologies, Tools, and Applications
Source Author(s)/Editor(s): Information Resources Management Association (USA)
DOI: 10.4018/978-1-4666-3604-0.ch084

Purchase


Abstract

DNA Microarrays allow for monitoring the expression level of thousands of genes simultaneously across a collection of related samples. Supervised learning algorithms such as k-NN or SVM (Support Vector Machines) have been applied to the classification of cancer samples with encouraging results. However, the classification algorithms are not able to discover new subtypes of diseases considering the gene expression profiles. In this chapter, the author reviews several supervised clustering algorithms suitable to discover new subtypes of cancer. Next, he introduces a semi-supervised clustering algorithm that learns a linear combination of dissimilarities from the a priory knowledge provided by human experts. A priori knowledge is formulated in the form of equivalence constraints. The minimization of the error function is based on a quadratic optimization algorithm. A L2 norm regularizer is included that penalizes the complexity of the family of distances and avoids overfitting. The method proposed has been applied to several benchmark data sets and to human complex cancer problems using the gene expression profiles. The experimental results suggest that considering a linear combination of heterogeneous dissimilarities helps to improve both classification and clustering algorithms based on a single similarity.

Related Content

Alessandra Lima da Silva, Diego Mariano, Mariana Parise, Angie L. A. Puelles, Tatiane Senna Bialves, Luana Luiza Bastos, Lucas Santos, Rafael Pereira Lemos. © 2025. 22 pages.
Seyyed Mohammad Amin Mousavi Sagharchi, Mohsen Sheykhhasan, Atousa Ghorbani, Elina Afrazeh, Naresh Poondla, Naser Kalhor, Hamid Tanzadehpanah, Hanie Mahaki, Hamed Manoochehri. © 2025. 46 pages.
Eduarda Guimarães Sousa, Lucas Gabriel Rodrigues Gomes, Fernanda Diniz Prates, Talita Pereira Gomes, Gabriel Camargos Gomes, Janaíne Aparecida de Paula, Ana Lua de Oliveira Vinhal, Bernardo Buhr Alves Mendonça, Mariana Letícia Costa Pedrosa, Luiza Pereira Reis, Aline Ferreira Maciel de Oliveira, Marcus Vinicius Canário Viana, Arun Kumar Jaiswal, Siomar de Castro Soares, Vasco Ariston de Carvalho Azevedo. © 2025. 38 pages.
Diego Mariano, Lucas Moraes dos Santos, Raquel Cardoso de Melo-Minardi. © 2025. 30 pages.
Alessandra G. Cioletti, Frederico C. Carvalho, Lucas M. Dos Santos, Raquel C. M. Minardi. © 2025. 32 pages.
Leandro Morais de Oliveira, Luana Luiza Bastos, Vivian Morais Paixão, Leticia Aparecida Gontijo, Tatiane Senna Bialves, Diego Mariano, Raquel Cardoso de Melo Minardi. © 2025. 40 pages.
Angie Atoche Puelles, Luana Luiza Bastos, Vivian Morais Paixão, Sheila Cruz Araujo, Raquel Cardoso de Melo Minardi. © 2025. 28 pages.
Body Bottom