Publication Details

Jalali, A., Oroumchian, F. & Hejazi, M. R. 2004, 'Comparison of different distance measures on hierarchical document clustering in 2-pass retrieval', WSEAS Transactions on Computers, vol. 3, no. 3, pp. 725-731.


Hierarchic document clustering has been applied to search results (query-specific clustering) on the grounds of its potential improved effectiveness compared both to that of static clustering and of conventional inverted file search (IFS).

In this paper we review and compare the effects of seven different measures of similarity among docwnents in hierarchic query specific clustering. We have conducted a number of experiments using OHSUMED document collection. The Experiments seems to indicate that the choice of similarity measure effects positively or negatively the quality of clustering.