org.archive.crawler.admin (Heritrix 1.15.5-201106092337)

Overview

Package

Class

Use

Tree

Deprecated

Index

Help

PREV PACKAGE NEXT PACKAGE

FRAMES NO FRAMES

Package org.archive.crawler.admin

Contains classes that the web UI uses to monitor and control crawls.

See:
Description

Class Summary
CrawlJob	A CrawlJob encapsulates a 'crawl order' with any and all information and methods needed by a CrawlJobHandler to accept and execute them.
CrawlJobErrorHandler	An implementation of the ValueErrorHandler for the UI.
CrawlJobHandler	This class manages CrawlJobs.
SeedRecord	Record of all interesting info about the most-recent processing of a specific seed.
StatisticsSummary	This class provides descriptive statistics of a finished crawl job by using the crawl report files generated by StatisticsTracker.
StatisticsTracker	This is an implementation of the AbstractTracker.

Exception Summary
InvalidJobFileException	An exception that is thrown when a program encounters a jobfile that is corrupt or otherwise incomplete or invalid.

Package org.archive.crawler.admin Description

Contains classes that the web UI uses to monitor and control crawls. Some utilities classes used exclusively or primarily for the UI are also included.

Most of the heavy duty work is done by the CrawlJobHandler that manages most of the interaction between the UI and the the CrawlController. The CrawlJob class serves to encapsulate the settings needed to launch one crawl.

This package also provides an implementation of the Statistics Tracking interface that contains useful methods to access progress data. This is used for monitoring crawls. While it is technically possible to launch jobs without this statistics tracker, it would render the UI inoperable as far as monitoring the progress of that crawl.