Checker

Component description

The MKSearch checker component is essentially an integration layer between the data acquisition system and the repository, ensuring the currency of the data store. The checker component does not exist as a distinct component in the beta 1 version of MKSearch, a new repository is created for each crawl session.

Completed task information has been moved to the beta 1 checker plans archive.

Beta 2 development plans

Check un-linked documents
At present, the crawler component pushes the whole data aquisition process by following published hyperlinks and creates a new repository for each session. However, with an incremental indexing scheme, previously indexed documents may be removed between sessions and un-linked. In this case, the crawler will not discover resources are obsolete. The checker component therefore needs periodically to check whether "old" source documents still exist.

Document Links

beta 1 checker plans
Summary task and progress notes for the beta 1 release of the MKSearch checker component
http://mksearch.mkdoc.org.archived.website/plans/beta-1-release-tasks/beta-1-checker-plans/
This document was last modified on 2005-08-04 08:16:22.
Copyright MKDoc Ltd. and others.
The Free Documentation License http://www.gnu.org/copyleft/fdl.html