Fuzzy clustering based on semantic body and its application in Chinese spam filtering
Zhang, Qiu-yu1,2; Yang, Hui-juan1; Wang, Peng1; Ma, Wei1
2011-04-01
发表期刊International Journal of Digital Content Technology and its Applications
ISSN19759339
卷号5期号:4页码:1-11
摘要E-mail's text is the main body of an E-mail. Its content is reflected by semantic body formed by a large number of semantic elements, so it is the most authoritative and effective to study semantic body information of spam when analyzing its text. Firstly, this paper takes the advantage of HowNet in analysis of semantic element and analyze semantic bodies in email text, then proposes the method of constructing semantic body and calculation ways of similarity between semantic bodies based on sentence similarity. Secondly, for the problem of Imprecision and Fuzziness existing in current spam filtering technology, we use fuzzy clustering method to solve it. Combining fuzzy clustering with the semantic body, the paper proposes the method of fuzzy clustering based on semantic body. It is different from the traditional methods that semantic body is used as the object to be classified and the similarity between semantic bodies used as similarity coefficient in the proposed method. The method reduces the dimension when we use fuzzy clustering method to deal with text clustering problem. Finally, we apply the new method of fuzzy clustering based on semantic body to spam filtering. The result of the experiment shows that this method is more objective in determining email content when comparing with the method of traditional email filtering in semantic unit. The proposed method reflects much better in recall rate of discernment of email for spam whose meaning is expressed unclearly.
关键词Cluster analysis Electronic mail Fuzzy clustering Fuzzy systems Chinese spam Equivalence relations Hownet Semantic bodies Similarity analysis
DOI10.4156/jdcta.vol5.issue4.22
收录类别EI
语种英语
出版者Advanced Institute of Convergence Information Technology
EI入藏号20111913969065
EI主题词Semantics
EI分类号723 Computer Software, Data Handling and Applications - 961 Systems Science
来源库Compendex
分类代码723 Computer Software, Data Handling and Applications - 961 Systems Science
引用统计
文献类型期刊论文
条目标识符https://ir.lut.edu.cn/handle/2XXMBERH/111607
专题计算机与通信学院
作者单位1.School of Computer and Communication, Lanzhou University of Technology, Lanzhou Gansu 730050, China;
2.Key Laboratory of Gansu Advanced Control for Industrial Processes, Lanzhou Gansu 730050, China
第一作者单位兰州理工大学
第一作者的第一单位兰州理工大学
推荐引用方式
GB/T 7714
Zhang, Qiu-yu,Yang, Hui-juan,Wang, Peng,et al. Fuzzy clustering based on semantic body and its application in Chinese spam filtering[J]. International Journal of Digital Content Technology and its Applications,2011,5(4):1-11.
APA Zhang, Qiu-yu,Yang, Hui-juan,Wang, Peng,&Ma, Wei.(2011).Fuzzy clustering based on semantic body and its application in Chinese spam filtering.International Journal of Digital Content Technology and its Applications,5(4),1-11.
MLA Zhang, Qiu-yu,et al."Fuzzy clustering based on semantic body and its application in Chinese spam filtering".International Journal of Digital Content Technology and its Applications 5.4(2011):1-11.
条目包含的文件
条目无相关文件。
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Zhang, Qiu-yu]的文章
[Yang, Hui-juan]的文章
[Wang, Peng]的文章
百度学术
百度学术中相似的文章
[Zhang, Qiu-yu]的文章
[Yang, Hui-juan]的文章
[Wang, Peng]的文章
必应学术
必应学术中相似的文章
[Zhang, Qiu-yu]的文章
[Yang, Hui-juan]的文章
[Wang, Peng]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。