Vol.40 No.12

Journal of Xi'an Jiaotong University

Jan.2006

retue.gif (1614 ×Ö½Ú)

zwb.gif (1647 ×Ö½Ú)

¡¡

Categorization Method of Realª²Time Web Page in Network Information Audit Systems
Liao Mingtao1£¬Zhang Deyun1£¬Hou Lin2£¬Li Jinku1
(1.Institute of Network£¬Xi'an¡¡Jiaotong¡¡University£¬Xi'an 710049£¬China£»2.Institute of Information and Control Engineering£¬Xi'an University of Architecture £¦ Technology£¬Xi'an 710055£¬China)

Abstract£ºTo accomplish the categorization of realª²time web page in network information audit systems£¬a new method based on Dempsterª²Shafer evidence theory is proposed£® The main idea of it is as follows£ºthe web page address and fragments are regarded as the evidence of categorization without reassembling IP fragments£¬ and then the support degree that each of evidence stands up for a category is calculated£®The Dempster combination rule is used to continuously fuse and adjudge the information provided by various evidences online£¬ and finally the categorized result is obtained£® When the existing evidences can efficiently categorize the web page£¬ the subsequence fragments are no need to be handled further£® The experiment shows that the precision rate and recall rate of the proposed method are larger than 83% and 90% respectively£¬ and it is superior to the fuzzy ª«Kª« nearest neighbor algorithm based on fragments in the categorization performance and running time£®
Keywords£ºnetwork information audit£»web page categorization£»evidence theory