JSpider start-up process
The JSpider engine has quite a complex start-up process that uses a range of property files, static factory methods, abstractions and reflective invocation. These notes are to clarify the origin of runtime properties, configuration and storage facilities. Most class references refer to JSpider library classes.
JSpider main entry point
The primary JSpider class takes two arguments: a base URL to start spidering, and a configuration directory reference. If no configuration is specified, a directory called "default" is checked for configuration property files.
The JSpider class completes the following steps:
- Loads the relevant configuration using the static
ConfigurationFactory.getConfigurationmethod. - Creates a new
JSpiderinstance. - Calls the
startmethod on theJSpiderinstance.
ConfigurationFactory
The two configuration factory methods that may be called by JSpider create a singleton JSpiderConfiguration instance from a PropertiesConfiguration. Methods also exist to assign a JSpiderConfiguration directly and "clean" the configuration by assigning a null reference.
Copyright MKDoc Ltd. and others.
The Free Documentation License http://www.gnu.org/copyleft/fdl.html