Package org.archive.crawler.processor.recrawl

Class Summary
FetchHistoryProcessor Maintain a history of fetch information inside the CrawlURI's attributes.
PersistLoadProcessor Store CrawlURI attributes from latest fetch to persistent storage for consultation by a later recrawl.
PersistLogProcessor Log CrawlURI attributes from latest fetch for consultation by a later recrawl.
PersistOnlineProcessor Common superclass for persisting Processors which directly store/load to persistence (as opposed to logging for batch load later).
PersistProcessor Superclass for Processors which utilize BDB-JE for URI state (including most notably history) persistence.
PersistStoreProcessor Store CrawlURI attributes from latest fetch to persistent storage for consultation by a later recrawl.
 



Copyright © 2003-2011 Internet Archive. All Rights Reserved.