DOI: 10.11992/tis.201905046 网络出版地址: h

正在加载图片...

第15卷第2期智能系统学报 Vol.15 No.2 2020年3月 CAAI Transactions on Intelligent Systems Mar.2020 D0:10.11992/tis.201905046 网络出版地址：http:/kns.cnki.net/kcms/detail/23.1538.TP.20200312.1147.002.html 基于模糊不一致对的多标记属性约简高琪，李德玉2，王素格2 (1,山西大学计算机科学与信息技术学院，山西太原030006：2.山西大学计算智能与中文信息处理教育部重点实验室，山西太原030006) 摘要：在实际生活当中，存在着大量的高维多标记数据，为解决维度灾难问题，通常需要约简属性集。针对目前的多标记属性约简算法未考虑标记关系问题.本文提出了一种融合标记关系的模糊不一致对多标记属性约简算法。利用相对嫡(KL散度)度量标记之间的关系，定义标记权重，结合标记权重，定义模糊不一致对，考虑到属性对于模糊不一致对的区分性，定义属性重要性并进行属性约简。在8个数据集上的对比实验表明，所提基于模糊不一致对的多标记属性约简算法优于当前的多标记属性约简算法。关键词：多标记数据：属性约简：模糊不一致对：标记权重；KL散度：标记关系：模糊粗糙集；区分矩阵中图分类号：TP391文献标志码：A 文章编号：1673-4785(2020)02-0374-12 中文引用格式：高琪，李德玉，王素格.基于模糊不一致对的多标记属性约简.智能系统学报，2020,15(2)：374-385. 英文引用格式：GAO Qi,LI Deyu,WANG Suge..Multi-label attribute reduction based on fuzzy inconsistency pairsJ.CAAI trans- actions on intelligent systems,2020,15(2):374-385. Multi-label attribute reduction based on fuzzy inconsistency pairs GAO Qi',LI Deyu'2,WANG Suge2 (1.School of Computer and Information Technology,Shanxi University,Taiyuan 030006,China;2.Key Laboratory of Computation- al Intelligence and Chinese Information Processing of Ministry of Education University,Shanxi University,Taiyuan 030006,China) Abstract:In real life,there is a large amount of multi-label data,and in multi-label data processing,attribute reduction is one of the important methods to solve the high-dimensional disaster of multi-label data.Because there is a relation- ship between labels,in this paper we firstly use the KL divergence metric to determine the relationship between labels, then define the label weight,and then combine the label weight to define the fuzzy inconsistency pairs.Finally,consid- ering the distingishing ability of attributes to the fuzzy inconsistency pairs,we propose a multi-label attribute reduction algorithm based on fuzzy inconsistency pairs.Extensive experiments carried out on eight publicly available data sets verify effectiveness of the proposed algorithm named MLAR-FL by comparing it with some state-of-the-art approaches. Keywords:multi-label data;attribute reduction;fuzzy inconsistency pairs;label weight;Kullback-Leibler divergence; the relationship of labels;fuzzy rough sets;distinguished matrix 传统的监督学习问题中，数据只有一个类别股票等几个主题；一个图片可以有蓝天、湖泊、树标记变量，通常称为单标记学习问题。但是，在木、绿地等多个语义标注。因此，对于多标记数现实世界当中，每个数据对象可能同时具有多个据的学习，成为近些年的机器学习领域关注重点语义项。例如，一个新闻可以包含经济、体育、之一。收稿日期：2019-05-24.网络出版日期：2020-03-12. 在多标记学习当中，数据的高维性会严重影基金项目：国家自然科学基金项目(61672331,61573231，响多标记分类器的性能，因而降维是解决该问题 61432011,61802237):山西省重点研发计划项目 (201803D421024,201903D42I041):山西省高等学校的重要手段。多标记的特征降维的方法主要有特优秀成果培育项目(2019SK036):山西省高等学校青年科研人员培育计划. 征提取和特征选择。特征提取是通过转换或者映通信作者：李德玉.E-mail:Iidy@sxu.edu.cn 射的方法，将原始高维特征空间转换到一个新的DOI: 10.11992/tis.201905046 网络出版地址: http://kns.cnki.net/kcms/detail/23.1538.TP.20200312.1147.002.html 基于模糊不一致对的多标记属性约简高琪1 ，李德玉1,2，王素格1,2 （1. 山西大学计算机科学与信息技术学院，山西太原 030006; 2. 山西大学计算智能与中文信息处理教育部重点实验室，山西太原 030006）摘要：在实际生活当中，存在着大量的高维多标记数据，为解决维度灾难问题，通常需要约简属性集。针对目前的多标记属性约简算法未考虑标记关系问题，本文提出了一种融合标记关系的模糊不一致对多标记属性约简算法。利用相对熵 (KL 散度) 度量标记之间的关系，定义标记权重，结合标记权重，定义模糊不一致对，考虑到属性对于模糊不一致对的区分性，定义属性重要性并进行属性约简。在 8 个数据集上的对比实验表明，所提基于模糊不一致对的多标记属性约简算法优于当前的多标记属性约简算法。关键词：多标记数据；属性约简；模糊不一致对；标记权重；KL 散度；标记关系；模糊粗糙集；区分矩阵中图分类号：TP391 文献标志码：A 文章编号：1673−4785(2020)02−0374−12 中文引用格式：高琪, 李德玉, 王素格. 基于模糊不一致对的多标记属性约简 [J]. 智能系统学报, 2020, 15(2): 374–385. 英文引用格式：GAO Qi, LI Deyu, WANG Suge. Multi-label attribute reduction based on fuzzy inconsistency pairs[J]. CAAI transactions on intelligent systems, 2020, 15(2): 374–385. Multi-label attribute reduction based on fuzzy inconsistency pairs GAO Qi1 ，LI Deyu1,2 ，WANG Suge1,2 (1. School of Computer and Information Technology, Shanxi University, Taiyuan 030006, China; 2. Key Laboratory of Computational Intelligence and Chinese Information Processing of Ministry of Education University, Shanxi University, Taiyuan 030006, China) Abstract: In real life, there is a large amount of multi-label data, and in multi-label data processing, attribute reduction is one of the important methods to solve the high-dimensional disaster of multi-label data. Because there is a relationship between labels, in this paper we firstly use the KL divergence metric to determine the relationship between labels, then define the label weight, and then combine the label weight to define the fuzzy inconsistency pairs. Finally, considering the distingishing ability of attributes to the fuzzy inconsistency pairs, we propose a multi-label attribute reduction algorithm based on fuzzy inconsistency pairs. Extensive experiments carried out on eight publicly available data sets verify effectiveness of the proposed algorithm named MLAR-FL by comparing it with some state-of-the-art approaches. Keywords: multi-label data; attribute reduction; fuzzy inconsistency pairs; label weight; Kullback-Leibler divergence; the relationship of labels; fuzzy rough sets; distinguished matrix 传统的监督学习问题中，数据只有一个类别标记变量，通常称为单标记学习问题[1]。但是，在现实世界当中，每个数据对象可能同时具有多个语义项[2]。例如，一个新闻可以包含经济、体育、股票等几个主题；一个图片可以有蓝天、湖泊、树木、绿地等多个语义标注。因此，对于多标记数据的学习，成为近些年的机器学习领域关注重点之一。在多标记学习当中，数据的高维性会严重影响多标记分类器的性能，因而降维是解决该问题的重要手段。多标记的特征降维的方法主要有特征提取和特征选择。特征提取是通过转换或者映射的方法，将原始高维特征空间转换到一个新的收稿日期：2019−05−24. 网络出版日期：2020−03−12. 基金项目：国家自然科学基金项目 (61672331, 61573231, 61432011, 61802237)；山西省重点研发计划项目 (201803D421024, 201903D421041)；山西省高等学校优秀成果培育项目 (2019SK036)；山西省高等学校青年科研人员培育计划. 通信作者：李德玉. E-mail：lidy@sxu.edu.cn. 第 15 卷第 2 期智能系统学报 Vol.15 No.2 2020 年 3 月 CAAI Transactions on Intelligent Systems Mar. 2020

向下翻页>>

点击下载：【人工智能】基于模糊不一致对的多标记属性约简