University of Wollongong
Browse

Quality information retrieval for the World Wide Web

Download (297.98 kB)
conference contribution
posted on 2024-11-14, 11:01 authored by Milly Kc, Markus HagenbuchnerMarkus Hagenbuchner, Ah Chung Tsoi
The World Wide Web is an unregulated communication medium which exhibits very limited means of quality control. Quality assurance has become a key issue for many information retrieval services on the Internet, e.g. web search engines. This paper introduces some quality evaluation and assessment methods to assess the quality of web pages. The proposed quality evaluation mechanisms are based on a set of quality criteria which were extracted from a targeted user survey. A weighted algorithmic interpretation of the most significant user quoted quality criteria is proposed. In addition, the paper utilizes machine learning methods to produce a prediction of quality for web pages before they are downloaded. The set of quality criteria allows us to implement a web search engine with quality ranking schemes, leading to web crawlers which can crawl directly quality web pages. The proposed approaches produce some very promising results on a sizable web repository.

History

Citation

Kc, M. W., Hagenbuchner, M. & Tsoi, A. (2008). Quality information retrieval for the World Wide Web. IEEE/WIC/ACM international Conference on Web Intelligence and Intelligent Agent Technology (pp. 655-661). Australia: IEEE.

Parent title

Proceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008

Pagination

655-661

Language

English

RIS ID

25475

Usage metrics

    Categories

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC