Biclustering Using Modified Matrix Bandwidth Minimization and Biogeography-based Optimization.

Briti Deb, Indrajit Mukherjee

arXiv: Data Structures and Algorithms(2018)

引用 22|浏览1
暂无评分
摘要
Data matrix having different sets of entities in its rows and columns are known as two mode data or affiliation data. Many practical problems require to find relationships between the two modes by simultaneously clustering the rows and columns, a problem commonly known as biclustering. We propose a novel biclustering algorithm by using matrix reordering approach introduced by Cuthill-McKeeu0027s bandwidth minimization algorithm, and adapting it to operate on non-square and non-binary matrices, without the need to know apriori the number of naturally occurring biclusters. This transforms a two-mode matrix into almost block diagonals, where the blocks indicate the clusters between the two modes of the matrix. To optimize the bandwidth minimization problem, we adapted the Biogeography-based Optimization algorithm using logistic equation to model its migration rates. Preliminary studies indicate that this technique can reveal the underlying biclusters in the data and has potential of further research for two-mode data analysis.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要