Class Summary |
FetchHistoryProcessor |
Maintain a history of fetch information inside the CrawlURI's attributes. |
PersistLoadProcessor |
Store CrawlURI attributes from latest fetch to persistent storage for
consultation by a later recrawl. |
PersistLogProcessor |
Log CrawlURI attributes from latest fetch for consultation by a later
recrawl. |
PersistOnlineProcessor |
Common superclass for persisting Processors which directly store/load
to persistence (as opposed to logging for batch load later). |
PersistProcessor |
Superclass for Processors which utilize BDB-JE for URI state
(including most notably history) persistence. |
PersistStoreProcessor |
Store CrawlURI attributes from latest fetch to persistent storage for
consultation by a later recrawl. |