MFCC-based perceptual hashing for compressed domain of speech content identification
Zhang, Qiu-Yu1; Liu, Yang-Wei1; Di, Yan-Jun1; Zhang, Qian-Yun2; Xing, Peng-Fei1
2014
发表期刊Journal of Chemical and Pharmaceutical Research
卷号6期号:7页码:379-386
摘要Current research on speech content identification aim primarily at raw wideband speech signals, which are generally transmitted in a compressed format. This makes it unable to meet the demand of speech content identification in compressed domain. This paper proposes a new speech perceptual hashing algorithm for speech content identification with compressed domain based on MFCC (Mel Frequency Cepstral Coefficient), to solve problems of real-time speech content identification and large quantity of voice message information over the mobile Internet. This algorithm extracts MFCC feature based on the raw wideband method. The process begins by extracting the MDCT coefficients, which are the intermediately decoded results of compressed speeches in MP3 format. These coefficients are translated to MFCC parameters and the binary hashing values are then generated from these parameters, combined with human auditory features. This algorithm uses highly compressed data to realize fast identification for speech content. Experimental results show that the proposed algorithm can realize tampering localization and increase 5% in efficiency when compared with raw wideband algorithms, with the precondition of robustness and discrimination. © 2014, Journal of Chemical and Pharmaceutical Research. All rights reserved.
关键词Algorithms Internet Robustness (control systems) Speech Speech recognition Auditory feature Compressed domain MDCT coefficients Mel frequency cepstral co-efficient MFCC features Perceptual hashing Speech content Tampering localizations
收录类别EI
语种英语
出版者Journal of Chemical and Pharmaceutical Research, 3/668 Malviya Nagar, Jaipur, Rajasthan, India
EI入藏号20143900076369
EI主题词Identification (control systems)
EI分类号716 Telecommunication ; Radar, Radio and Television - 717 Optical Communication - 718 Telephone Systems and Related Technologies ; Line Communications - 723 Computer Software, Data Handling and Applications - 731.1 Control Systems - 751.5 Speech - 921 Mathematics
来源库Compendex
分类代码716 Telecommunication; Radar, Radio and Television - 717 Optical Communication - 718 Telephone Systems and Related Technologies; Line Communications - 723 Computer Software, Data Handling and Applications - 731.1 Control Systems - 751.5 Speech - 921 Mathematics
文献类型期刊论文
条目标识符https://ir.lut.edu.cn/handle/2XXMBERH/113617
专题计算机与通信学院
作者单位1.School of Computer and Communication, Lanzhou University of Technology, Lanzhou, China;
2.School of Communication and Information Engineering, Shanghai University, Shanghai, China
第一作者单位兰州理工大学
第一作者的第一单位兰州理工大学
推荐引用方式
GB/T 7714
Zhang, Qiu-Yu,Liu, Yang-Wei,Di, Yan-Jun,et al. MFCC-based perceptual hashing for compressed domain of speech content identification[J]. Journal of Chemical and Pharmaceutical Research,2014,6(7):379-386.
APA Zhang, Qiu-Yu,Liu, Yang-Wei,Di, Yan-Jun,Zhang, Qian-Yun,&Xing, Peng-Fei.(2014).MFCC-based perceptual hashing for compressed domain of speech content identification.Journal of Chemical and Pharmaceutical Research,6(7),379-386.
MLA Zhang, Qiu-Yu,et al."MFCC-based perceptual hashing for compressed domain of speech content identification".Journal of Chemical and Pharmaceutical Research 6.7(2014):379-386.
条目包含的文件
条目无相关文件。
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Zhang, Qiu-Yu]的文章
[Liu, Yang-Wei]的文章
[Di, Yan-Jun]的文章
百度学术
百度学术中相似的文章
[Zhang, Qiu-Yu]的文章
[Liu, Yang-Wei]的文章
[Di, Yan-Jun]的文章
必应学术
必应学术中相似的文章
[Zhang, Qiu-Yu]的文章
[Liu, Yang-Wei]的文章
[Di, Yan-Jun]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。