谷歌Chrome浏览器插件
订阅小程序
在清言上使用

TotalPLS: Local Dimension Reduction for Multicategory Microarray Data

Human-Machine Systems, IEEE Transactions  (2014)

引用 23|浏览21
暂无评分
摘要
Dimension reduction is an important topic in data mining, which is widely used in the areas of genetics, medicine, and bioinformatics. We propose a new local dimension reduction algorithm TotalPLS that operates in a unified partial least squares (PLS) framework and implement an information fusion of PLS-based feature selection and feature extraction. This paper focuses on extracting the potential structure hidden in high-dimensional multicategory microarray data, and interpreting and understanding the results provided by the potential structure information. First, we propose using PLS-based recursive feature elimination (PLSRFE) in multicategory problems. Then, we perform feature importance analysis based on PLSRFE for high-dimensional microarray data to determine the information feature (biomarkers) subset, which relates to the studied tumor subtypes problem. Finally, PLS-based supervised feature extraction is conducted on the selected specific genes subset to extract comprehensive features that best reflect the nature of classification to have a discriminating ability. The proposed algorithm is compared with several state-of-the-art methods using multiple high-dimensional multicategory microarray datasets. Our comparison is performed in terms of recognition accuracy, relevance, and redundancy. Experimental results show that the algorithm proposed by us can improve the recognition rate and computational efficiency. Furthermore, mining potential structure information improves the interpretability and understandability of recognition results. The proposed algorithm can be effectively applied to microarray data analysis for the discovery of gene coexpression and coregulation.
更多
查看译文
关键词
local dimension reduction algorithm,gene coexpression,microarray data analysis,high-dimensional multicategory microarray data,pls-based recursive feature elimination,learning (artificial intelligence),genetics,pls-based feature selection,information fusion,pattern classification,data analysis,totalpls,unified partial least squares framework,biology computing,dimension reduction,least squares approximations,medicine,feature extraction,classification nature,plsrfe,partial least squares (pls),pls framework,gene coregulation,data mining,feature selection,pls-based supervised feature extraction,bioinformatics,potential structure information,learning artificial intelligence
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要