바로가기메뉴

본문 바로가기 주메뉴 바로가기
 

Journal of Korean Library and Information Science Society

  • P-ISSN2466-2542
  • KCI

Clustering of Web Document Exploiting with the Union of Term Frequency and Co-link in Hypertext

Journal of Korean Library and Information Science Society / Journal of Korean Library and Information Science Society, (P)2466-2542;
2003, v.34 no.3, pp.211-229





Abstract

In this paper, we have focused that the number of word in the web document affects definite clustering performance. Our experimental results have clearly shown the relationship between the amounts of word and its impact on clustering performance. We also have presented an algorithm that can be supplemented of the contrast portion through co-links frequency of web documents. Testing bench of this research is 1,449 web documents included on 'Natural science' category among the Naver Directory. We have clustered these objects by term-based clustering, link-based clustering, and hybrid clustering method, and compared the output results with originally allocated category of Naver directory.

keywords
단어기반 클러스터링, 링크기반 클러스터링, 단어-링크 혼합 클러스터링, 동시링크, 텀 벡터

Journal of Korean Library and Information Science Society