UDC 004.423, DOI: 10.2298/csis0902165H
Microarray Missing Values Imputation Methods: Critical Analysis Review
- Faculty of Information Technology, Al Ahliyya Amman University
Amman, Jordan
Mouath.hourani@gmail.com - Mouath.hourani@gmail.com
Amman, Jordan
omary57@hotmail.com
Abstract
Gene expression data often contain missing expression values. For the purpose of conducting an effective clustering analysis and since many algorithms for gene expression data analysis require a complete matrix of gene array values, choosing the most effective missing value estimation method is necessary. In this paper, the most commonly used imputation methods from literature are critically reviewed and analyzed to explain the proper use, weakness and point the observations on each published method. From the conducted analysis, we conclude that the Local Least Square (LLS) and Support Vector Regression (SVR) algorithms have achieved the best performances. SVR can be considered as a complement algorithm for LLS especially when applied to noisy data. However, both algorithms suffer from some deficiencies presented in choosing the value of Number of Selected Genes (K) and the appropriate kernel function. To overcome these drawbacks, the need for new method that automatically chooses the parameters of the function and it also has an appropriate computational complexity is imperative.
Key words
Completely at random (MCAR), Missing At Random (MAR), Sequential K-Nearest Neighbors (SKNN), Gene Ontology (GO), Singular Value Decomposition (SVD), Least Squares Imputation (LSI), Local Least Square Imputation (LLSI), Bayesian Principal Component Analysis (BPCA) and Fixed Rank Approximation Method (FRAA)
Digital Object Identifier (DOI)
https://doi.org/10.2298/csis0902165H
Publication information
Volume 6, Issue 2 (December 2009)
Year of Publication: 2009
ISSN: 2406-1018 (Online)
Publisher: ComSIS Consortium
Full text
Available in PDF
Portable Document Format
How to cite
Hourani, M., Emary, I. M. M. E.: Microarray Missing Values Imputation Methods: Critical Analysis Review. Computer Science and Information Systems, Vol. 6, No. 2, 165-190. (2009), https://doi.org/10.2298/csis0902165H