Skip Navigation

Spiders

J-Spider

JoBo

Arachnid

Spindle

Acme Spider

Metis

Heritrix

HouseSpider

WebLech

Excluded spiders

Link mappers

Content parsers

RDF Crawlers

Sign up

If you sign up for an account on this web site you can customise elements of this site and subscribe to an email newsletter.

If you have an account on this web site you may login.

If you have an account on this site but have forgotten your user name and / or your password then you can request an account reminder email.

HouseSpider

HouseSpider is designed for use in applets, so probably is not suitable. However, it is released under the GPL licence. It has no dependencies beyond the standard Java class library, the package only has 14 classes.

Initial review notes

HouseSpider is a relatively advanced site search applet in so far as it generates and searches index files, but it is limited by its intended usage. Much of the HTML parsing and metadata extraction uses fairly rigid string matching. There is some food for thought, such as handling the HTML base element for relative URLs, but overall HouseSpider will not be suitable for the MKSearch project.

<< | Up | >>

This document was last modified by Philip Shaw on 2004-11-04 04:01:57
Copyright MKDoc Ltd. and others.
The Free Documentation License http://www.gnu.org/copyleft/fdl.html