Audio fingerprint retrieval method based on feature dimension reduction and feature combination
Zhang, Qiu-Yu; Xu, Fu-Jiu; Bai, Jian
2021-02-28
Source PublicationKSII Transactions on Internet and Information Systems
ISSN1976-7277
Volume15Issue:2Pages:522-539
AbstractIn order to solve the problems of the existing audio fingerprint method when extracting audio fingerprints from long speech segments, such as too large fingerprint dimension, poor robustness, and low retrieval accuracy and efficiency, a robust audio fingerprint retrieval method based on feature dimension reduction and feature combination is proposed. Firstly, the Mel-frequency cepstral coefficient (MFCC) and linear prediction cepstrum coefficient (LPCC) of the original speech are extracted respectively, and the MFCC feature matrix and LPCC feature matrix are combined. Secondly, the feature dimension reduction method based on information entropy is used for column dimension reduction, and the feature matrix after dimension reduction is used for row dimension reduction based on energy feature dimension reduction method. Finally, the audio fingerprint is constructed by using the feature combination matrix after dimension reduction. When speech’s user retrieval, the normalized Hamming distance algorithm is used for matching retrieval. Experiment results show that the proposed method has smaller audio fingerprint dimension and better robustness for long speech segments, and has higher retrieval efficiency while maintaining a higher recall rate and precision rate. Copyright © 2021 KSII
KeywordHamming distance Information retrieval Speech analysis Dimension reduction Distance algorithm Feature combination Feature dimensions Information entropy Mel-frequency cepstral coefficients Retrieval accuracy Retrieval efficiency
DOI10.3837/tiis.2021.02.008
Indexed ByEI
Language英语
PublisherKorean Society for Internet Information
EI Accession Number20211310139757
EI KeywordsDimensionality reduction
EI Classification Number751.5 Speech ; 903.3 Information Retrieval and Use
Citation statistics
Cited Times [WOS]:0   [WOS Record]     [Related Records in WOS]
Document Type期刊论文
Identifierhttp://ir.lut.edu.cn/handle/2XXMBERH/148465
Collection计算机与通信学院
AffiliationSchool of computer and communication, Lanzhou University of Technology, Lanzhou; 730050, China
First Author AffilicationLanzhou University of Technology
First Signature AffilicationLanzhou University of Technology
Recommended Citation
GB/T 7714
Zhang, Qiu-Yu,Xu, Fu-Jiu,Bai, Jian. Audio fingerprint retrieval method based on feature dimension reduction and feature combination[J]. KSII Transactions on Internet and Information Systems,2021,15(2):522-539.
APA Zhang, Qiu-Yu,Xu, Fu-Jiu,&Bai, Jian.(2021).Audio fingerprint retrieval method based on feature dimension reduction and feature combination.KSII Transactions on Internet and Information Systems,15(2),522-539.
MLA Zhang, Qiu-Yu,et al."Audio fingerprint retrieval method based on feature dimension reduction and feature combination".KSII Transactions on Internet and Information Systems 15.2(2021):522-539.
Files in This Item:
There are no files associated with this item.
Related Services
Usage statistics
Google Scholar
Similar articles in Google Scholar
[Zhang, Qiu-Yu]'s Articles
[Xu, Fu-Jiu]'s Articles
[Bai, Jian]'s Articles
Baidu academic
Similar articles in Baidu academic
[Zhang, Qiu-Yu]'s Articles
[Xu, Fu-Jiu]'s Articles
[Bai, Jian]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Zhang, Qiu-Yu]'s Articles
[Xu, Fu-Jiu]'s Articles
[Bai, Jian]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.