7. References [1] L. Schmidt, M. Shar_中国高校课件下载中心

点击下载：《人工智能、机器学习与大数据》课程教学资源（参考文献）Deep Hashing for Speaker Identification and Retrieval

正在加载图片...

7.References [1]L.Schmidt,M.Sharifi,and I.Lopez-Moreno,"Large-scale speaker identification,"in IEEE International Conference on Acoustics,Speech and Signal Processing.2014,pp.1650-1654. [2]J.S.Chung,A.Nagrani,and A.Zisserman,"Voxceleb2:Deep speaker recognition,"in Annual Conference of the Internationa Speech Communication Association,2018,pp.1086-1090. [3]L.Li,C.Xing.D.Wang,K.Yu,and T.F.Zheng."Binary speaker embedding."in International Symposium on Chinese Spoken Lan- guage Processing,2016,pp.1-4. [4]N.Dehak,P.Kenny,R.Dehak,P.Dumouchel,and P.Ouellet, "Front-end factor analysis for speaker verification,"IEEE Trans- actions on Audio,Speech Language Processing,vol.19,no.4. Pp.788-798.2011. [5]E.Variani,X.Lei,E.McDermott,I.Lopez-Moreno,and J.Gonzalez-Dominguez,"Deep neural networks for small foot- print text-dependent speaker verification,"in IEEE International Conference on Acoustics,Speech and Signal Processing,2014 Pp.40524056. [6]D.Snyder,D.Garcia-Romero,G.Sell,D.Povey,and S.Khudan- pur,"X-vectors:Robust DNN embeddings for speaker recogni- tion."in IEEE International Conference on Acoustics.Speech and Signal Processing.2018.pp.5329-5333. [7]O.Russakovsky,J.Deng,H.Su,J.Krause,S.Satheesh,S.Ma Z.Huang.A.Karpathy,A.Khosla,M.S.Bernstein,A.C.Berg. and F.Li,"Imagenet large scale visual recognition challenge,"In- ternational Journal of Computer Vision,vol.115,no.3,pp.211- 252.2015 [8]Y.-Q.Yu,L.Fan,and W.-J.Li,"Ensemble additive margin soft- max for speaker verification,"in IEEE International Conference on Acoustics,Speech and Signal Processing,2019,pp.6046- 6050. [9]M.Datar,N.Immorlica,P.Indyk,and V.S.Mirrokni,"Locality- sensitive hashing scheme based on p-stable distributions,"in ACM Symposium on Computational Geometry,2004,pp.253-262. [10]Y.Gong and S.Lazebnik,"Iterative quantization:A procrustean approach to learning binary codes"in IEEE Conference on Com- puter Vision and Pattern Recognition.2011.pp.817-824. [11]W.-J.Li,S.Wang.and W.-C.Kang,"Feature learning based deep supervised hashing with pairwise labels,"in International Joint Conference on Artificial Intelligence,2016.pp.1711-1717. [12]Q.-Y.Jiang.X.Cui.and W.-J.Li,"Deep discrete supervised hash- ing,"IEEE Transaction Image Processing,vol.27.no.12,pp. 5996-6009.2018. [13]M.Norouzi.D.J.Fleet,and R.Salakhutdinov,"Hamming dis- tance metric learning,"in Annual Conference on Neural Informa- tion Processing Systems,2012,pp.1070-1078. [14]K.He,X.Zhang,S.Ren,and J.Sun,"Deep residual learning for image recognition,"in IEEE Conference on Computer Vision and Pattern Recognition,2016,pp.770-778. [15]F.Wang.J.Cheng,W.Liu,and H.Liu,"Additive margin soft- max for face verification,"IEEE Signal Processing Letter,vol.25 no.7,Pp.926-930,2018 [16]W.Kong and W.-J.Li,"Isotropic hashing,"in Annual Conference on Neural Information Processing Systems,2012,pp.1655-1663. [17]D.Gutman and Y.Bistritz,"Speaker verification using phoneme- adapted gaussian mixture models,"in European Signal Processing Conference.2002,pp.1-4. [18]A.O.Hatch.S.S.Kajarekar,and A.Stolcke,"Within-class co- variance normalization for svm-based speaker recognition,"inIn- ternational Conference on Spoken Language Processing,2006. Pp.1471-1474. 29127. References [1] L. Schmidt, M. Sharifi, and I. Lopez-Moreno, “Large-scale speaker identification,” in IEEE International Conference on Acoustics, Speech and Signal Processing, 2014, pp. 1650–1654. [2] J. S. Chung, A. Nagrani, and A. Zisserman, “Voxceleb2: Deep speaker recognition,” in Annual Conference of the International Speech Communication Association, 2018, pp. 1086–1090. [3] L. Li, C. Xing, D. Wang, K. Yu, and T. F. Zheng, “Binary speaker embedding,” in International Symposium on Chinese Spoken Language Processing, 2016, pp. 1–4. [4] N. Dehak, P. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, “Front-end factor analysis for speaker verification,” IEEE Transactions on Audio, Speech & Language Processing, vol. 19, no. 4, pp. 788–798, 2011. [5] E. Variani, X. Lei, E. McDermott, I. Lopez-Moreno, and J. Gonzalez-Dominguez, “Deep neural networks for small footprint text-dependent speaker verification,” in IEEE International Conference on Acoustics, Speech and Signal Processing, 2014, pp. 4052–4056. [6] D. Snyder, D. Garcia-Romero, G. Sell, D. Povey, and S. Khudanpur, “X-vectors: Robust DNN embeddings for speaker recognition,” in IEEE International Conference on Acoustics, Speech and Signal Processing, 2018, pp. 5329–5333. [7] O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. S. Bernstein, A. C. Berg, and F. Li, “Imagenet large scale visual recognition challenge,” International Journal of Computer Vision, vol. 115, no. 3, pp. 211– 252, 2015. [8] Y.-Q. Yu, L. Fan, and W.-J. Li, “Ensemble additive margin softmax for speaker verification,” in IEEE International Conference on Acoustics, Speech and Signal Processing, 2019, pp. 6046– 6050. [9] M. Datar, N. Immorlica, P. Indyk, and V. S. Mirrokni, “Localitysensitive hashing scheme based on p-stable distributions,” in ACM Symposium on Computational Geometry, 2004, pp. 253–262. [10] Y. Gong and S. Lazebnik, “Iterative quantization: A procrustean approach to learning binary codes,” in IEEE Conference on Computer Vision and Pattern Recognition, 2011, pp. 817–824. [11] W.-J. Li, S. Wang, and W.-C. Kang, “Feature learning based deep supervised hashing with pairwise labels,” in International Joint Conference on Artificial Intelligence, 2016, pp. 1711–1717. [12] Q.-Y. Jiang, X. Cui, and W.-J. Li, “Deep discrete supervised hashing,” IEEE Transaction Image Processing, vol. 27, no. 12, pp. 5996–6009, 2018. [13] M. Norouzi, D. J. Fleet, and R. Salakhutdinov, “Hamming distance metric learning,” in Annual Conference on Neural Information Processing Systems, 2012, pp. 1070–1078. [14] K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in IEEE Conference on Computer Vision and Pattern Recognition, 2016, pp. 770–778. [15] F. Wang, J. Cheng, W. Liu, and H. Liu, “Additive margin softmax for face verification,” IEEE Signal Processing Letter, vol. 25, no. 7, pp. 926–930, 2018. [16] W. Kong and W.-J. Li, “Isotropic hashing,” in Annual Conference on Neural Information Processing Systems, 2012, pp. 1655–1663. [17] D. Gutman and Y. Bistritz, “Speaker verification using phonemeadapted gaussian mixture models,” in European Signal Processing Conference, 2002, pp. 1–4. [18] A. O. Hatch, S. S. Kajarekar, and A. Stolcke, “Within-class covariance normalization for svm-based speaker recognition,” in International Conference on Spoken Language Processing, 2006, pp. 1471–1474. 2912

<<向上翻页

点击下载：《人工智能、机器学习与大数据》课程教学资源（参考文献）Deep Hashing for Speaker Identification and Retrieval