The task of clustering Web sessions is to group Web sessions based on similarity and consists of maximizing the intra-group similarity while minimizing the inter-group similarity.The first and foremost question needed to be considered in clusteringW b sessions is how to measure the similarity between Websessions.However.there are many shortcomings in traditiona1measurements.This paper introduces a new method for measuringsimilarities between Web pages that takes into account not only theURL but also the viewing time of the visited web page.Yhen wegive a new method to measure the similarity of Web sessions usingsequence alignment and the similarity of W eb page access in detailExperiments have proved that our method is valid and e币cient.
猜您喜欢
评论