org.archive.crawler.processor.recrawl
Class PersistLoadProcessor
java.lang.Object
javax.management.Attribute
org.archive.crawler.settings.Type
org.archive.crawler.settings.ComplexType
org.archive.crawler.settings.ModuleType
org.archive.crawler.framework.Processor
org.archive.crawler.processor.recrawl.PersistProcessor
org.archive.crawler.processor.recrawl.PersistOnlineProcessor
org.archive.crawler.processor.recrawl.PersistLoadProcessor
- All Implemented Interfaces:
- java.io.Serializable, javax.management.DynamicMBean
public class PersistLoadProcessor
- extends PersistOnlineProcessor
Store CrawlURI attributes from latest fetch to persistent storage for
consultation by a later recrawl.
- Version:
- $Date: 2006-09-25 20:19:54 +0000 (Mon, 25 Sep 2006) $, $Revision: 4654 $
- Author:
- gojomo
- See Also:
- Serialized Form
Field Summary |
static java.lang.String |
ATTR_PRELOAD_SOURCE
file (log) or directory (state/env) from which to preload history |
Method Summary |
protected com.sleepycat.collections.StoredSortedMap<java.lang.String,st.ata.util.AList> |
initStore()
|
protected void |
innerProcess(CrawlURI curi)
Classes subclassing this one should override this method to perform
their custom actions on the CrawlURI. |
Methods inherited from class org.archive.crawler.framework.Processor |
checkForInterrupt, getController, getDecideRule, getDefaultNextProcessor, innerRejectProcess, isContentToProcess, isEnabled, isExpectedMimeType, isHttpTransactionContentToProcess, kickUpdate, process, report, rulesAccept, rulesAccept, setDefaultNextProcessor, spawn |
Methods inherited from class org.archive.crawler.settings.ComplexType |
addElementToDefinition, checkValue, earlyInitialize, getAbsoluteName, getAttribute, getAttribute, getAttribute, getAttributeInfo, getAttributeInfo, getAttributeInfoIterator, getAttributes, getDataContainerRecursive, getDataContainerRecursive, getDefaultValue, getDescription, getElementFromDefinition, getLegalValues, getLocalAttribute, getMBeanInfo, getMBeanInfo, getParent, getPreservedFields, getSettingsHandler, getUncheckedAttribute, getValue, globalSettings, invoke, isInitialized, isOverridden, iterator, removeElementFromDefinition, setAsOrder, setAttribute, setAttribute, setAttributes, setDescription, setPreservedFields, toString, unsetAttribute |
Methods inherited from class org.archive.crawler.settings.Type |
addConstraint, equals, getConstraints, getLegalValueType, isExpertSetting, isOverrideable, isTransient, setExpertSetting, setLegalValueType, setOverrideable, setTransient |
Methods inherited from class javax.management.Attribute |
getName, hashCode |
Methods inherited from class java.lang.Object |
clone, finalize, getClass, notify, notifyAll, wait, wait, wait |
ATTR_PRELOAD_SOURCE
public static final java.lang.String ATTR_PRELOAD_SOURCE
- file (log) or directory (state/env) from which to preload history
- See Also:
- Constant Field Values
PersistLoadProcessor
public PersistLoadProcessor(java.lang.String name)
- Usual constructor
- Parameters:
name
-
initStore
protected com.sleepycat.collections.StoredSortedMap<java.lang.String,st.ata.util.AList> initStore()
- Overrides:
initStore
in class PersistOnlineProcessor
innerProcess
protected void innerProcess(CrawlURI curi)
throws java.lang.InterruptedException
- Description copied from class:
Processor
- Classes subclassing this one should override this method to perform
their custom actions on the CrawlURI.
- Overrides:
innerProcess
in class Processor
- Parameters:
curi
- The CrawlURI being processed.
- Throws:
java.lang.InterruptedException
Copyright © 2003-2011 Internet Archive. All Rights Reserved.