Scopus Harvesting Series

SKIFF: Spherical K-means with iterative feature filtering for text document clustering

Iti Sharma, Govt. College Kota
Abhay Sharma, Manipal University Jaipur
Rekha Chaturvedi, Manipal University Jaipur
Jitendra Rajpurohit, Manipal University Jaipur
Manoj Kumar, University of Wollongong in Dubai

Publication Name

Journal of Information Science

Abstract

Text clustering has been an overlooked field of text mining that requires more attention. Several applications require automatic text organisation which relies on an information retrieval system based on organised search results. Spherical k-means is a successful adaptation of the classic k-means algorithm for text clustering. However, conventional methods to accelerate k-means may not apply to spherical k-means due to the different nature of text document data. The proposed work introduces an iterative feature filtering technique that reduces the data size during the process of clustering which further produces more feature-relevant clusters in less time compared to classic spherical k-means. The novelty of the proposed method is that feature assessment is distinct from the objective function of clustering and derived from the cluster structure. Experimental results show that the proposed scheme achieves computation speed without sacrificing cluster quality over popular text corpora. The demonstrated results are satisfactory and outperform compared to recent works in this domain.

Open Access Status

This publication may be available as open access

Link to Full Text

COinS

Link to publisher version (DOI)

http://dx.doi.org/10.1177/01655515231165230

Scopus Harvesting Series

SKIFF: Spherical K-means with iterative feature filtering for text document clustering

Publication Name

Abstract

Open Access Status

Link to publisher version (DOI)

Search

Browse

Links

Scopus Harvesting Series

SKIFF: Spherical K-means with iterative feature filtering for text document clustering

Authors

Publication Name

Abstract

Open Access Status

Share

Link to publisher version (DOI)

Search

Browse

Links