0.8 0.9 1.0 0.7 0.6 分类准确率 0 200 400 6

正在加载图片...

第4期李顺勇，等：一种新的最大相关最小冗余特征选择算法 ·659· 1.0 上比传统算法选择结果更好。 0.9 1.0 0.9 08 Laplacian-Score 0.7 MRMR New-MRMR-L-NI .New-MRMR-L-P 0.7 0.6 .Information-Gain 200 400 600 800 MRMR 数据维度K Ne-MRMR-G- 0.6 图I6New-MRMR-L-NI、New-MRMR-L-P、Laplacian- 0 200 400 600 800 数据维度K Score、MRMR在数据集Parkinson's Disease 上分类准确率的变化趋势图17 New-MRMR-lG-NI、New-MRMR-IG-P、Informa- Fig.16 Correct classification trend of New-MRMR-L-NI, tion-Gain、MRMR在数据集Parkinson'sDis- New-MRMR-L-P,Laplacian-Score,MRMR ease上分类准确率的变化趋势 on the Parkinson's Disease dataset Fig.17 Correct classification trend of New-MRMR- IG-NI,New-MRMR-IG-P,Information-Gain, 由图17可以看出，在维度为120和540时， MRMR on the Parkinson's Disease dataset New-MRMR-IG-P的分类准确率与算法MRMR的分类准确率较为接近，但在其余维度上，其分类 2.3实验结果的T检验准确率均大于MRMR的分类准确率。而且，在分为更加有效地证明本文新提的8种特征选择类准确率达到最大时，New-MRMR-IG-P所选择算法的有效性，以下采用成对单边T检验来证明的特征子集数仅为180，远小于MRMR的最优特其有效性。原假设为：本文新提算法与传统算法征子集数。此外，New-MRMR-IG-NI的分类准确的特征选择效果相同：备择假设为：本文新提算率的曲线高于算法MRMR、Information-Gain的分法的特征选择效果优于传统特征选择算法。表7 类准确率曲线。由上述分析可知，针对数据集为假设检验结果，其中包含了检验的统计量，置 Parkinson's Disease而言，本文提出算法在整体信区间以及P值。表7新提算法与传统算法的成对单边T检验的检验结果 Table 7 Test results of paired unilateral T-test between the new algorithm and the traditional algorithm 成对T检验 P值 T统计量 95%置信区间 New-MRMR-F-NI vs MRMR 0.0268 3.0890 (0.7987,Inf New-MRMR-F-P vs MRMR 0.0088 4.7566 (0.8302,lnf) New-MRMR-K-NI vs MRMR 0.0271 3.0765 (0.7966.,Inf) New-MRMR-K-P vs MRMR 0.0152 3.8754 (0.8126,Inf) New-MRMR-L-NI vs MRMR 0.0105 4.4561 (0.8230,lnf New-MRMR-L-P vs MRMR 0.0321 2.8649 (0.7933,Inf) New-MRMR-IG-NI vs MRMR 0.0206 3.4353 (0.8078.lnf) New-MRMR-IG-P vs-MRMR 0.0189 3.5595 (0.8089,Inf) New-MRMR-F-NI vs Fisher-Score 0.0134 4.0658 (0.7987,1nf) New-MRMR-F-P vs Fisher-Score 0.0005 5.7347 (0.8302,Inf New-MRMR-K-NI vs Chi-Square-Test 0.0030 6.9165 (0.7966,InfD New-MRMR-K-P vs Chi-Square-Test 0.0026 7.3174 (0.8126,1nf) New-MRMR-L-NI vs Laplacian-Score 0.0328 2.8405 (0.7730,Inf) New-MRMR-L-P vs Laplacian-Score 0.0130 4.1147 (0.7933,Inf) New-MRMR-IG-NI vs Information-Gain 0.0064 5.3303 (0.8068,Inf New-MRMR-IG-P vs Information-Gain 0.0056 5.5739 (0.8089,1nf0.8 0.9 1.0 0.7 0.6 分类准确率 0 200 400 600 800 数据维度 K MRMR Laplacian-Score New-MRMR-L-NI New-MRMR-L-P 图 16 New-MRMR-L-NI、New-MRMR-L-P、LaplacianScore、MRMR 在数据集 Parkinson’s Disease 上分类准确率的变化趋势 Fig. 16 Correct classification trend of New-MRMR-L-NI, New-MRMR-L-P, Laplacian-Score, MRMR on the Parkinson’s Disease dataset 由图 17 可以看出，在维度为 120 和 540 时， New-MRMR-IG-P 的分类准确率与算法 MRMR 的分类准确率较为接近，但在其余维度上，其分类准确率均大于 MRMR 的分类准确率。而且，在分类准确率达到最大时，New-MRMR-IG-P 所选择的特征子集数仅为 180，远小于 MRMR 的最优特征子集数。此外，New-MRMR-IG-NI 的分类准确率的曲线高于算法 MRMR、Information-Gain 的分类准确率曲线。由上述分析可知，针对数据集 Parkinson’s Disease 而言，本文提出算法在整体上比传统算法选择结果更好。 0.8 0.9 1.0 0.7 0.6 分类准确率 0 200 400 600 800 数据维度 K MRMR Information-Gain New-MRMR-IG-NI New-MRMR-IG-P 图 17 New-MRMR-IG-NI、New-MRMR-IG-P、Information- Gain、MRMR 在数据集 Parkinson’s Disease 上分类准确率的变化趋势 Fig. 17 Correct classification trend of New-MRMRIG-NI, New-MRMR-IG-P, Information-Gain, MRMR on the Parkinson’s Disease dataset 2.3 实验结果的 T 检验为更加有效地证明本文新提的 8 种特征选择算法的有效性，以下采用成对单边 T 检验来证明其有效性。原假设为：本文新提算法与传统算法的特征选择效果相同；备择假设为：本文新提算法的特征选择效果优于传统特征选择算法。表 7 为假设检验结果，其中包含了检验的统计量，置信区间以及 P 值。表 7 新提算法与传统算法的成对单边 T 检验的检验结果 Table 7 Test results of paired unilateral T-test between the new algorithm and the traditional algorithm 成对T检验 P值 T统计量 95%置信区间 New-MRMR-F-NI vs MRMR 0.026 8 3.089 0 (0.7987, Inf) New-MRMR-F-P vs MRMR 0.008 8 4.756 6 (0.8302, Inf) New-MRMR-K-NI vs MRMR 0.027 1 3.076 5 (0.7966, Inf) New-MRMR-K-P vs MRMR 0.015 2 3.875 4 (0.8126, Inf) New-MRMR-L-NI vs MRMR 0.010 5 4.456 1 (0.8230, Inf) New-MRMR-L-P vs MRMR 0.032 1 2.864 9 (0.7933, Inf) New-MRMR-IG-NI vs MRMR 0.020 6 3.435 3 (0.8078, Inf) New-MRMR-IG-P vs-MRMR 0.018 9 3.559 5 (0.8089, Inf) New-MRMR-F-NI vs Fisher-Score 0.013 4 4.065 8 (0.7987, Inf) New-MRMR-F-P vs Fisher-Score 0.000 5 5.734 7 (0.8302, Inf) New-MRMR-K-NI vs Chi-Square-Test 0.003 0 6.916 5 (0.7966, Inf) New-MRMR-K-P vs Chi-Square-Test 0.002 6 7.317 4 (0.8126, Inf) New-MRMR-L-NI vs Laplacian-Score 0.032 8 2.840 5 (0.7730, Inf) New-MRMR-L-P vs Laplacian-Score 0.013 0 4.114 7 (0.7933, Inf) New-MRMR-IG-NI vs Information-Gain 0.006 4 5.330 3 (0.8068, Inf) New-MRMR-IG-P vs Information-Gain 0.005 6 5.573 9 (0.8089, Inf) 第 4 期李顺勇，等：一种新的最大相关最小冗余特征选择算法 ·659·

<<向上翻页向下翻页>>

点击下载：【机器学习】一种新的最大相关最小冗余特征选择算法