Institutional Repository of Coll Comp & Commun
Fuzzy clustering based on semantic body and its application in Chinese spam filtering | |
Zhang, Qiu-yu1,2![]() | |
2011-04-01 | |
发表期刊 | International Journal of Digital Content Technology and its Applications
![]() |
ISSN | 19759339 |
卷号 | 5期号:4页码:1-11 |
摘要 | E-mail's text is the main body of an E-mail. Its content is reflected by semantic body formed by a large number of semantic elements, so it is the most authoritative and effective to study semantic body information of spam when analyzing its text. Firstly, this paper takes the advantage of HowNet in analysis of semantic element and analyze semantic bodies in email text, then proposes the method of constructing semantic body and calculation ways of similarity between semantic bodies based on sentence similarity. Secondly, for the problem of Imprecision and Fuzziness existing in current spam filtering technology, we use fuzzy clustering method to solve it. Combining fuzzy clustering with the semantic body, the paper proposes the method of fuzzy clustering based on semantic body. It is different from the traditional methods that semantic body is used as the object to be classified and the similarity between semantic bodies used as similarity coefficient in the proposed method. The method reduces the dimension when we use fuzzy clustering method to deal with text clustering problem. Finally, we apply the new method of fuzzy clustering based on semantic body to spam filtering. The result of the experiment shows that this method is more objective in determining email content when comparing with the method of traditional email filtering in semantic unit. The proposed method reflects much better in recall rate of discernment of email for spam whose meaning is expressed unclearly. |
关键词 | Cluster analysis Electronic mail Fuzzy clustering Fuzzy systems Chinese spam Equivalence relations Hownet Semantic bodies Similarity analysis |
DOI | 10.4156/jdcta.vol5.issue4.22 |
收录类别 | EI |
语种 | 英语 |
出版者 | Advanced Institute of Convergence Information Technology |
EI入藏号 | 20111913969065 |
EI主题词 | Semantics |
EI分类号 | 723 Computer Software, Data Handling and Applications - 961 Systems Science |
来源库 | Compendex |
分类代码 | 723 Computer Software, Data Handling and Applications - 961 Systems Science |
引用统计 | 无
|
文献类型 | 期刊论文 |
条目标识符 | https://ir.lut.edu.cn/handle/2XXMBERH/111607 |
专题 | 计算机与通信学院 |
作者单位 | 1.School of Computer and Communication, Lanzhou University of Technology, Lanzhou Gansu 730050, China; 2.Key Laboratory of Gansu Advanced Control for Industrial Processes, Lanzhou Gansu 730050, China |
第一作者单位 | 兰州理工大学 |
第一作者的第一单位 | 兰州理工大学 |
推荐引用方式 GB/T 7714 | Zhang, Qiu-yu,Yang, Hui-juan,Wang, Peng,et al. Fuzzy clustering based on semantic body and its application in Chinese spam filtering[J]. International Journal of Digital Content Technology and its Applications,2011,5(4):1-11. |
APA | Zhang, Qiu-yu,Yang, Hui-juan,Wang, Peng,&Ma, Wei.(2011).Fuzzy clustering based on semantic body and its application in Chinese spam filtering.International Journal of Digital Content Technology and its Applications,5(4),1-11. |
MLA | Zhang, Qiu-yu,et al."Fuzzy clustering based on semantic body and its application in Chinese spam filtering".International Journal of Digital Content Technology and its Applications 5.4(2011):1-11. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论