Sampling Frequency Tuning Tool

Recent Updates:
New Scientific Paper:
A solution to the exact match on rare item searches: introducing the lost sheep algorithm 2011
New Blog Post:
Towards Automated eGovernment Monitoring 2011-09-26
Ph.D. Thesis:
Towards Automated eGovernment Monitoring

General Information

Download Sampling Frequency Tuning Tool as PDF (544 KB) .

Title: Sampling Frequency Tuning Tool.
Author(s): Wu Yang, Jin Qi and Sun Wei.
Published date: December 2005.
Published at: Web-Mining and Data Analysis 2005

Abstract


The goal of this project is to find ways to optimise the crawler frequency for individual web sites. The idea is to avoid crawling a site in case the accessibility to the site has not been changed. This is a challenge for all search engines to focus the resources on actual changes. Another relevant aspect of sampling is to select a significant and representative set of sites.

The author of this document is:
Morten Goodwin
E-mail address is:
morten.goodwin circle-a tingtun.no
Phone is:
+47 95 24 86 79

Valid XHTML 1.0! Valid CSS! Checked by eGovMon