Package org.archive.crawler.admin

Contains classes that the web UI uses to monitor and control crawls.

See:
          Description

Class Summary
CrawlJob A CrawlJob encapsulates a 'crawl order' with any and all information and methods needed by a CrawlJobHandler to accept and execute them.
CrawlJobErrorHandler An implementation of the ValueErrorHandler for the UI.
CrawlJobHandler This class manages CrawlJobs.
SeedRecord Record of all interesting info about the most-recent processing of a specific seed.
StatisticsSummary This class provides descriptive statistics of a finished crawl job by using the crawl report files generated by StatisticsTracker.
StatisticsTracker This is an implementation of the AbstractTracker.
 

Exception Summary
InvalidJobFileException An exception that is thrown when a program encounters a jobfile that is corrupt or otherwise incomplete or invalid.
 

Package org.archive.crawler.admin Description

Contains classes that the web UI uses to monitor and control crawls. Some utilities classes used exclusively or primarily for the UI are also included.

Most of the heavy duty work is done by the CrawlJobHandler that manages most of the interaction between the UI and the the CrawlController. The CrawlJob class serves to encapsulate the settings needed to launch one crawl.

This package also provides an implementation of the Statistics Tracking interface that contains useful methods to access progress data. This is used for monitoring crawls. While it is technically possible to launch jobs without this statistics tracker, it would render the UI inoperable as far as monitoring the progress of that crawl.



Copyright © 2003-2011 Internet Archive. All Rights Reserved.