Quad-Tree Based Multiple Kernel Fuzzy C-Means Clustering for Gene Expression Data
E. Monica Sushil Cynthia, S. Kannan "Quad-Tree Based Multiple Kernel Fuzzy C-Means Clustering for Gene Expression Data". International Journal of Computer Trends and Technology (IJCTT) V27(3):121-125, September 2015. ISSN:2231-2803. www.ijcttjournal.org. Published by Seventh Sense Research Group.
Abstract -
Minute variations in genes can have a
major impact on how humans respond to disease,
environmental factors such as bacteria, viruses,
toxins, chemicals and drugs and other therapies..
Cluster analysis seeks to partition a given data set
into groups based on specified features so that the
data points within a group are more similar to each
other than the points in different groups. The
clustering algorithms have been proven useful for
identifying biologically relevant groups of genes and
samples. Hence in this paper we propose a new
clustering algorithm for gene expression data
associated to three different types of cancer and also
compare with the existing approaches to prove the
novel approach proposed here, has a better
performance, reliability and provide more
meaningful biological significance.
References
[1]. Duda , R. O. & Hart, P. E.(1973), Pattern classification and
scene analysis, Wiley, New York.
[2]. Everitt, B.S. Cluster Analysis. 1993. Third Edition. (New
York and Toronto: Halsted Press, of John Wiley & Sons
Inc.).
[3]. M Telgarsky, A Vattani Hartigan`s Method: k-means
Clustering without Voronoi.
[4]. Mirkin CA, Letsinger RL, Mucic RC, Storhoff JJ. A DNAbased
method for rationally assembling nanoparticles into
macroscopic materials.
[5]. Jiang, D., Tang, C. and Zhang, A.(2004) „Cluster analysis for
gene expression data: a survey?, IEEE Transaction on
Knowledge and Data Engineering, Vol. 16, No. 11, pp.1370-
1386.
[6]. Selva Kumar, S. and Hannah Inbarani, H. (2013) „Analysis of
mixed C-means clustering approach for brain tumor gene
expression data?, Int, J. Data Analysis Techniques and
Strategies, Vol. 5, No. 2, pp.214-228.
[7]. J. C. Dunn A Fuzzy Relative of the ISODATA Process and
Its Use in Detecting Compact Well-Separated Clusters.
[8]. Dezdek, J. C., 1981, Pattern Recognition with fuzzy objective
function algorithms, Plenum press, Newyork, NY.
[9]. Tomida S, et al. (2002) Analysis of expression profile using
fuzzy adaptive resonance theory. Bioinformatics 18(8):1073-
83
[10]. Bishnu PS, Bhattacherjee V (2012) Software fault prediction
using quad tree-based K-means clustering algorithm. IEEE
Trans Knowl Data Eng 24(6):1146–1150
[11]. H.-C. Huang, Y.-Y. Chuang, and C.-S. Chen, “Multiple
kernel fuzzy clustering,” Fuzzy Systems, IEEE Transactions
on, vol. 20, no. 1, pp. 120 –134, feb. 2012.
[12]. X.L. Xie and G. Beni. A validity measure for fuzzy
clustering. IEEE transactions on Pattern Analysis and
Machine Intelligence, 13(4):841-846, 1991.
[13]. D. L. Davies and D. W. Bouldin. A cluster separation
measures IEEE Transactions and Pattern Analysis and
Machine Intelligence, PAMI-1, no. 2:222-227,1979.
[14]. Sitansu Mohanty, Kaberi Das, Debahuti Mishra, Ruchi
Ranjan"Cluster Validity Indices for Gene Expression
Data"International Journal of Computer Trends and
Technology (IJCTT),V4(5):1465-1470 May 2013.ISSN
2231-2803.
[15]. D.Vanisri, Dr.C.Loganathan An Efficient Fuzzy Possibilistic
C-Means with Penalized and Compensated Constraints
Keywords
Clustering, Clustering Algorithms, Gene
Expression analysis, Fuzzy C-Means, Hierarchical
Clustering, Gene Clustering, Gene Expression data,
Quad Tree, Kernel fuzzy C-means.