工程科学学报.第42卷.第4期:476-482.2020年4月 Chinese Journal of Engineering,Vol.42,No.4:476-482,April 2020 https://doi.org/10.13374/j.issn2095-9389.2019.09.17.002;http://cje.ustb.edu.cn 基于数控机床设备故障领域的命名实体识别 王欢2),朱文球2),吴岳忠12)四,何频捷2),万烂军1,2) 1)湖南工业大学计算机学院.株洲4120082)湖南省智能信息感知及处理技术重点实验室,株洲412008 ☒通信作者,E-mail:yuezhong..wu@163.com 摘要为了给数控机床故障的精准诊断提供保障,延长数控机床使用周期,以数控机床历史维修记录为研究对象,对数控 机床设备故障领域的命名实体识别进行了研究.在分析历史维修记录中的故障描述特点后,提出了一种基于双向长短期记 忆网络(Bidirectional long short-term memory,.BLSTM)与具有回路的条件随机场(Conditional random field with loop,L-CRF)相 结合的命名实体识别方法.首先,对输入语句进行分词和标注,使用Word2vec中的Skip-gram模型对标注语料进行预训练, 将其生成的字向量通过词嵌入层转化为字向量序列:然后,将字向量序列输入BLSTM学习长期依赖信息;最后将句子表达 输入L-CF获取全局最优序列.实验结果表明,该方法明显优于其他命名实体识别方法,为数控机床设备的智能检修与实时 诊断任务打下了坚实的基础. 关键词数控机床:设备故障:双向长短期记忆网络:具有回路的条件随机场:命名实体识别 分类号TP391.1 Named entity recognition based on equipment and fault field of CNC machine tools WANG Huan 2).ZHU Wen-qiu2).WU Yue-zhong2.HE Pin-jie2.WAN Lan-jun 2) 1)School of Computer,Hunan University of Technology,Zhuzhou 412008,China 2)Hunan Key Laboratory of Intelligent Information Perception and Processing Technology,Zhuzhou 412008,China Corresponding author,E-mail:yuezhong wu@163.com ABSTRACT With the advent of intelligent manufacturing and big data,the Made in China 2025 Initiative and Industry 4.0 have been paying increasing attention to automation and intelligent industrial equipment.In the background of the present times,the complexity and intelligence of computer numerical control(CNC)machine tools have been continuously improved,and the types and descriptions of CNC machine tools'faults have increased,presenting serious challenges to equipment maintenance and diagnosis of CNC machine tools.In order to provide guarantee for accurate fault diagnosis of CNC machine tools,and to prolong the service life of CNC machine tools,it is necessary to improve the performance of named entity recognition system.Accordingly,the named entity recognition in the equipment and faults field of CNC machine tools were studied,taking the historical examinations and repair records of CNC machine tools as the research object.After analyzing the characteristics of fault description in the historical examinations and repair records,a named entity recognition method was proposed based on the combination of bidirectional long short-term memory (BLSTM)and conditional random field with loop (L-CRF).The first step is to input a sentence and segment and label the input sentence.The annotation corpus is combined with the pre-trained generated word vector by using Skip-gram model in Word2vec,and the word vector is converted into a word vector sequence through the word embedding layer.In the second step,the word vector sequence is integrated into the BLSTM layer to learn long term dependency information.The final step is to input the sentence expression into the L-CRF layer 收稿日期:2019-09-17 基金项目:国家重点研发计划资助项目(2018Y℉B1700200):国家自然科学基金青年科学基金资助项目(61702177):湖南省教育厅开放平台 创新基金资助项目(17K029:智能信息感知及处理技术湖南省重点实验室开放课题资助项目(2017K℉07):湖南省重点领域研发计划课题 资助项目(2019GK2133):湖南省教有厅科学研究优秀青年资助项目(19B147)基于数控机床设备故障领域的命名实体识别 王 欢1,2),朱文球1,2),吴岳忠1,2) 苣,何频捷1,2),万烂军1,2) 1) 湖南工业大学计算机学院,株洲 412008 2) 湖南省智能信息感知及处理技术重点实验室,株洲 412008 苣通信作者,E-mail: yuezhong.wu@163.com 摘 要 为了给数控机床故障的精准诊断提供保障,延长数控机床使用周期,以数控机床历史维修记录为研究对象,对数控 机床设备故障领域的命名实体识别进行了研究. 在分析历史维修记录中的故障描述特点后,提出了一种基于双向长短期记 忆网络(Bidirectional long short-term memory, BLSTM)与具有回路的条件随机场(Conditional random field with loop, L-CRF)相 结合的命名实体识别方法. 首先,对输入语句进行分词和标注,使用 Word2vec 中的 Skip-gram 模型对标注语料进行预训练, 将其生成的字向量通过词嵌入层转化为字向量序列;然后,将字向量序列输入 BLSTM 学习长期依赖信息;最后将句子表达 输入 L-CRF 获取全局最优序列. 实验结果表明,该方法明显优于其他命名实体识别方法,为数控机床设备的智能检修与实时 诊断任务打下了坚实的基础. 关键词 数控机床;设备故障;双向长短期记忆网络;具有回路的条件随机场;命名实体识别 分类号 TP391.1 Named entity recognition based on equipment and fault field of CNC machine tools WANG Huan1,2) ,ZHU Wen-qiu1,2) ,WU Yue-zhong1,2) 苣 ,HE Pin-jie1,2) ,WAN Lan-jun1,2) 1) School of Computer, Hunan University of Technology, Zhuzhou 412008, China 2) Hunan Key Laboratory of Intelligent Information Perception and Processing Technology, Zhuzhou 412008, China 苣 Corresponding author, E-mail: yuezhong.wu@163.com ABSTRACT With the advent of intelligent manufacturing and big data, the Made in China 2025 Initiative and Industry 4.0 have been paying increasing attention to automation and intelligent industrial equipment. In the background of the present times, the complexity and intelligence of computer numerical control (CNC) machine tools have been continuously improved, and the types and descriptions of CNC machine tools ’ faults have increased, presenting serious challenges to equipment maintenance and diagnosis of CNC machine tools. In order to provide guarantee for accurate fault diagnosis of CNC machine tools, and to prolong the service life of CNC machine tools, it is necessary to improve the performance of named entity recognition system. Accordingly, the named entity recognition in the equipment and faults field of CNC machine tools were studied, taking the historical examinations and repair records of CNC machine tools as the research object. After analyzing the characteristics of fault description in the historical examinations and repair records, a named entity recognition method was proposed based on the combination of bidirectional long short-term memory (BLSTM) and conditional random field with loop (L-CRF). The first step is to input a sentence and segment and label the input sentence. The annotation corpus is combined with the pre-trained generated word vector by using Skip-gram model in Word2vec, and the word vector is converted into a word vector sequence through the word embedding layer. In the second step, the word vector sequence is integrated into the BLSTM layer to learn long term dependency information. The final step is to input the sentence expression into the L-CRF layer 收稿日期: 2019−09−17 基金项目: 国家重点研发计划资助项目 (2018YFB1700200);国家自然科学基金青年科学基金资助项目 (61702177);湖南省教育厅开放平台 创新基金资助项目 (17K029);智能信息感知及处理技术湖南省重点实验室开放课题资助项目(2017KF07);湖南省重点领域研发计划课题 资助项目 (2019GK2133);湖南省教育厅科学研究优秀青年资助项目(19B147) 工程科学学报,第 42 卷,第 4 期:476−482,2020 年 4 月 Chinese Journal of Engineering, Vol. 42, No. 4: 476−482, April 2020 https://doi.org/10.13374/j.issn2095-9389.2019.09.17.002; http://cje.ustb.edu.cn