正在加载图片...
第1期 陈爱国,等:基于极大嫡的知识迁移模糊聚类算法 ·103. 的数据集上的实验结果,可以得出本文所提 International Conference on Management of Data MEKTFCA算法较6种相关算法在聚类性能上有明 Philadelphia,Pennsylvania,USA,1999:49-60. 显的优越性。本文的创新点主要是,在MECA算法 [12]ARIAS-CASTRO E,CHEN Guangliang,LERMAN G. Spectral Clustering based on local linear approximations 的基础了提出了一个新的知识迁移方法,该方法能 [J].Electronic journal of statistics,2011,5:1537 有效解决实际生活中数据短缺和有噪声数据场景 -1587. 下的聚类性能问题。此外,本文所提的MEKTFCA [13]PAN S J,YANG Qiang.A survey on transfer learning[J]. IEEE transactions on knowledge and data engineering, 算法也存在着局限性,如在不同应用场景下两个平衡 2010,22(10):1345-1359. 参数如何进行有效确定的问题。这也是我们对该算法 [14]GU Quanquan,ZHOU Jie.Learning the shared subspace 进行进一步研究的方向。 for multi-task clustering and transductive transfer classification [C ]//Proceedings of Ninth IEEE 参考文献: International Conference on Data Mining.Miami,FL, USA,2009:159-168. [1]CARIOU C,CHEHDI K.Unsupervised nearest neighbors [15 DAI Wenyuan,YANG Qiang,XUE Guirong,et al. clustering with application to hyperspectral images[J]. Self-taught clustering C ]//Proceedings of the 25th IEEE journal of selected topics in signal processing,2015, International Conference on Machine Leaming.New 9(6):1105-1116. York,NY.USA,2008:200-207. [ALI A,BOYACI A,BAYNAL K.Data mining application [16]GU Quanquan,ZHOU Jie.Co-clustering on manifolds in banking sector with clustering and classification methods [C ]//Proceedings of the 15th ACM SIGKDD [C]//Proceedings of 2015 International Conference on International Conference on Knowledge Discovery and Industrial Engineering and Operations Management.Dubai, Data Mining.New York,USA,2009:359-368. 0AE,2015:1-8. [17]JIANG Wenhao,CHUNG F L.Transfer spectral clustering [3]LI Shuai,ZHOU Xiaofeng,SHI Haibo,et al.An efficient [M]//FLACH P A,BIE T D,CRISTIANINI N.Machine clustering method for medical data applications [C] Learning and Knowledge Discovery in Databases.Berlin Proceedings of 2015 IEEE International Conference on Cyber Heidelberg:Springer,2012:789-803. Technology in Automation,Control,and Intelligent System. [18]JING Liping,NG K M,HUANG JZ.An entropy weighting Shenyang,China,2015:133-138. k-means algorithm for subspace clustering of high- [4]LIKAS A,VLASSIS N,VERBEEK JJ.The global k-means dimensional sparse data [J].IEEE transactions on clustering algorithm[J].Pattern recognition,2003,36(2): knowledge and data engineering,2007,19 (8): 451-461 1026-1041. [5]BEZDEK J C.Pattern recognition with fuzzy objective [19]LIU Jun,MOHAMMED J,CARTER J,et al.Distance- function algorithms M ]New York:Springer,1981: based clustering of CGH data[J].Bioinformatics,2006, 43-93. 22(16):1971-1978. [6]KARAYIANNIS N B.MECA:maximum entropy clustering [20]DAI Wenyuan,XUE Guirong,YANG Qiang,et al.Co- algorithm[C]//Proceedings of the 3rd IEEE International clustering based classification for out-of-domain documents Conference on Fuzzy Systems.Orlando,USA,1994,1: [C]//Proceedings of the 13th ACM SIGKDD International 630-635 Conference on Knowledge Discovery and Data Mining.New [7 LI Ruiping,MUKAIDONO M.A maximum-entropy York,NY,USA,2007:210-219. approach to fuzzy clustering[C]//Proceedings of 1995 the [21]MCCALLUM A K.Bow:a toolkit for statistical language 4th IEEE International Conference on Fuzzy System. modeling,text retrieval,classification and clustering[EB/ Yokohama,Japan,1995,4:2227-2232. OL].1996.http://www.cs.cmu.edu/mccallum/bow [8]ZHANG Tian,RAMAKRISHNAN R,LIVNY M.BIRCH: [22]BAY S D,KIBLER D,PAZZANI M J,et al.The UCI an efficient data clustering method for very large databases [C]//Proceedings of the 1996 ACM SIGMOD International KDD archive of large data sets for data mining research and experimentation [J].ACM SIGKDD explorations Conference on Management of Data.New York,NY,USA, newsletter,2000.2(2):81-85. 1996:103-114. 作者简介: [9]GUHA S,RASTOGI R,SHIM K.CURE:an efficient 陈爱国,男,1975年生,博士研究 clustering algorithm for large databases[C]//Proceedings 生,主要研究方向为模式识别与机器 of the 1998 ACM SIGMOD International Conference on 学习。 Management of Data.New York,NY,USA,1998:73-84. [10]ESTER M,KRIEGEL H P,SANDER J,et al.A density-based algorithm for discovering clusters in large spatial databases with noise [C]//Proceeding of the Second International Conference on Knowledge Discovery 王士同,男.1964年生.教授.博士 and Data Mining.Portland,Oregon,USA,1996:226 生导师,中国离散数学学会常务理事 -231. 中国机器学习学会常务理事,主要研究 [11]ANKERST M,BREUNIG MM,KRIEGEL H P,et al. 方向为人工智能、模式识别和生物信 OPTICS:ordering Points to Identify the Clustering 息。发表学术论文近百篇,其中被SC Structure [C]//Proceedings of the 1999 ACM SIGMOD EI检索50余篇。的数 据 集 上 的 实 验 结 果, 可 以 得 出 本 文 所 提 MEKTFCA 算法较 6 种相关算法在聚类性能上有明 显的优越性。 本文的创新点主要是,在 MECA 算法 的基础了提出了一个新的知识迁移方法,该方法能 有效解决实际生活中数据短缺和有噪声数据场景 下的聚类性能问题。 此外,本文所提的 MEKTFCA 算法也存在着局限性,如在不同应用场景下两个平衡 参数如何进行有效确定的问题。 这也是我们对该算法 进行进一步研究的方向。 参考文献: [1] CARIOU C, CHEHDI K. Unsupervised nearest neighbors clustering with application to hyperspectral images [ J ]. IEEE journal of selected topics in signal processing, 2015, 9(6): 1105-1116. [2]ALI A, BOYACI A, BAYNAL K. Data mining application in banking sector with clustering and classification methods [C ] / / Proceedings of 2015 International Conference on Industrial Engineering and Operations Management. Dubai, UAE, 2015: 1-8. [3]LI Shuai, ZHOU Xiaofeng, SHI Haibo, et al. An efficient clustering method for medical data applications [ C ] / / Proceedings of 2015 IEEE International Conference on Cyber Technology in Automation, Control, and Intelligent System. Shenyang, China, 2015: 133-138. [4]LIKAS A, VLASSIS N, VERBEEK J J. The global k⁃means clustering algorithm[J]. Pattern recognition, 2003, 36(2): 451-461. [ 5 ] BEZDEK J C. Pattern recognition with fuzzy objective function algorithms [ M ]. New York: Springer, 1981: 43-93. [6] KARAYIANNIS N B. MECA: maximum entropy clustering algorithm[ C] / / Proceedings of the 3rd IEEE International Conference on Fuzzy Systems. Orlando, USA, 1994, 1: 630-635. [ 7 ] LI Ruiping, MUKAIDONO M. A maximum⁃entropy approach to fuzzy clustering[C] / / Proceedings of 1995 the 4th IEEE International Conference on Fuzzy System. Yokohama, Japan, 1995, 4: 2227-2232. [8] ZHANG Tian, RAMAKRISHNAN R, LIVNY M. BIRCH: an efficient data clustering method for very large databases [C] / / Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data. New York, NY, USA, 1996: 103-114. [9] GUHA S, RASTOGI R, SHIM K. CURE: an efficient clustering algorithm for large databases [ C] / / Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data. New York, NY, USA, 1998: 73-84. [10 ] ESTER M, KRIEGEL H P, SANDER J, et al. A density⁃based algorithm for discovering clusters in large spatial databases with noise [ C ] / / Proceeding of the Second International Conference on Knowledge Discovery and Data Mining. Portland, Oregon, USA, 1996: 226 -231. [11] ANKERST M, BREUNIG M M, KRIEGEL H P, et al. OPTICS: ordering Points to Identify the Clustering Structure [ C] / / Proceedings of the 1999 ACM SIGMOD International Conference on Management of Data. Philadelphia, Pennsylvania, USA, 1999: 49-60. [12] ARIAS⁃CASTRO E, CHEN Guangliang, LERMAN G. Spectral Clustering based on local linear approximations [ J ]. Electronic journal of statistics, 2011, 5: 1537 -1587. [13]PAN S J, YANG Qiang. A survey on transfer learning[J]. IEEE transactions on knowledge and data engineering, 2010, 22(10): 1345-1359. [14]GU Quanquan, ZHOU Jie. Learning the shared subspace for multi⁃task clustering and transductive transfer classification [ C ] / / Proceedings of Ninth IEEE International Conference on Data Mining. Miami, FL, USA, 2009: 159-168. [ 15 ] DAI Wenyuan, YANG Qiang, XUE Guirong, et al. Self⁃taught clustering [ C ] / / Proceedings of the 25th International Conference on Machine Learning. New York, NY, USA, 2008: 200-207. [ 16 ] GU Quanquan, ZHOU Jie. Co⁃clustering on manifolds [ C ] / / Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, USA, 2009: 359-368. [17]JIANG Wenhao, CHUNG F L. Transfer spectral clustering [M] / / FLACH P A, BIE T D, CRISTIANINI N. Machine Learning and Knowledge Discovery in Databases. Berlin Heidelberg: Springer, 2012: 789-803. [18]JING Liping, NG K M, HUANG J Z. An entropy weighting k⁃means algorithm for subspace clustering of high⁃ dimensional sparse data [ J ]. IEEE transactions on knowledge and data engineering, 2007, 19 ( 8 ): 1026-1041. [19] LIU Jun, MOHAMMED J, CARTER J, et al. Distance⁃ based clustering of CGH data[ J]. Bioinformatics, 2006, 22(16): 1971-1978. [20]DAI Wenyuan, XUE Guirong, YANG Qiang, et al. Co- clustering based classification for out⁃of⁃domain documents [C] / / Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, NY, USA, 2007: 210-219. [21] MCCALLUM A K. Bow: a toolkit for statistical language modeling, text retrieval, classification and clustering[EB/ OL]. 1996. http: / / www.cs.cmu.edu / mccallum/ bow. [22] BAY S D, KIBLER D, PAZZANI M J, et al. The UCI KDD archive of large data sets for data mining research and experimentation [ J ]. ACM SIGKDD explorations newsletter, 2000, 2(2): 81-85. 作者简介: 陈爱国,男,1975 年生,博士研究 生,主要研究方向为模式识别与机器 学习。 王士同,男,1964 年生,教授,博士 生导师,中国离散数学学会常务理事, 中国机器学习学会常务理事,主要研究 方向为人工智能、模式识别和生物信 息。 发表学术论文近百篇,其中被 SCI、 EI 检索 50 余篇。 第 1 期 陈爱国,等:基于极大熵的知识迁移模糊聚类算法 ·103·
<<向上翻页
©2008-现在 cucdc.com 高等教育资讯网 版权所有