|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object javax.management.Attribute org.archive.crawler.settings.Type org.archive.crawler.settings.ComplexType org.archive.crawler.settings.ModuleType org.archive.crawler.datamodel.CrawlOrder
public class CrawlOrder
Represents the 'root' of the settings hierarchy. Contains those settings that do not belong to any specific module, but rather relate to the crawl as a whole (much of this is used by the CrawlController directly or indirectly).
ModuleType
,
Serialized FormNested Class Summary |
---|
Nested classes/interfaces inherited from class org.archive.crawler.settings.ComplexType |
---|
ComplexType.MBeanAttributeInfoIterator |
Field Summary | |
---|---|
static java.lang.String |
ATTR_BDB_CACHE_PERCENT
Percentage of heap to allocate to bdb cache |
static java.lang.String |
ATTR_CHECKPOINT_COPY_BDBJE_LOGS
When checkpointing, copy the bdb logs. |
static java.lang.String |
ATTR_CHECKPOINTS_PATH
|
static java.lang.String |
ATTR_DISK_PATH
|
static java.lang.String |
ATTR_EXTRACT_PROCESSORS
|
static java.lang.String |
ATTR_FETCH_PROCESSORS
|
static java.lang.String |
ATTR_FROM
|
static java.lang.String |
ATTR_HTTP_HEADERS
|
static java.lang.String |
ATTR_INDEPENDENT_EXTRACTORS
|
static java.lang.String |
ATTR_LOGGERS
|
static java.lang.String |
ATTR_LOGS_PATH
|
static java.lang.String |
ATTR_MAX_BYTES_DOWNLOAD
|
static java.lang.String |
ATTR_MAX_DOCUMENT_DOWNLOAD
|
static java.lang.String |
ATTR_MAX_TIME_SEC
|
static java.lang.String |
ATTR_MAX_TOE_THREADS
|
static java.lang.String |
ATTR_NAME
|
static java.lang.String |
ATTR_POST_PROCESSORS
|
static java.lang.String |
ATTR_PRE_FETCH_PROCESSORS
|
static java.lang.String |
ATTR_RECORDER_IN_BUFFER
|
static java.lang.String |
ATTR_RECORDER_OUT_BUFFER
|
static java.lang.String |
ATTR_RECOVER_PATH
|
static java.lang.String |
ATTR_RECOVER_RETAIN_FAILURES
|
static java.lang.String |
ATTR_RECOVER_SCOPE_ENQUEUES
|
static java.lang.String |
ATTR_RECOVER_SCOPE_INCLUDES
|
static java.lang.String |
ATTR_RULES
|
static java.lang.String |
ATTR_SCRATCH_PATH
|
static java.lang.String |
ATTR_SETTINGS_DIRECTORY
|
static java.lang.String |
ATTR_STATE_PATH
|
static java.lang.String |
ATTR_USER_AGENT
|
static java.lang.String |
ATTR_WRITE_PROCESSORS
|
static java.lang.Boolean |
DEFAULT_CHECKPOINT_COPY_BDBJE_LOGS
|
Fields inherited from class org.archive.crawler.settings.ComplexType |
---|
definition, definitionMap |
Constructor Summary | |
---|---|
CrawlOrder()
Construct a CrawlOrder. |
Method Summary | |
---|---|
void |
checkUserAgentAndFrom()
Checks if the User Agent and From field are set 'correctly' in the specified Crawl Order. |
java.io.File |
getCheckpointsDirectory()
|
CrawlController |
getController()
|
java.lang.String |
getCrawlOrderName()
Get the name of the order file. |
java.lang.String |
getFrom(CrawlURI curi)
|
MapType |
getLoggers()
Returns the Map of the StatisticsTracking modules that are included in the configuration that the current instance of this class is representing. |
int |
getMaxToes()
Returns the set number of maximum toe threads. |
RobotsHonoringPolicy |
getRobotsHonoringPolicy()
This method gets the RobotsHonoringPolicy object from the orders file. |
java.io.File |
getSettingsDir(java.lang.String key)
Return fullpath to the directory named by key
in settings. |
java.lang.String |
getUserAgent(CrawlURI curi)
|
void |
setController(CrawlController controller)
|
Methods inherited from class org.archive.crawler.settings.ModuleType |
---|
addElement, listUsedFiles |
Methods inherited from class org.archive.crawler.settings.Type |
---|
addConstraint, equals, getConstraints, getLegalValueType, isExpertSetting, isOverrideable, isTransient, setExpertSetting, setLegalValueType, setOverrideable, setTransient |
Methods inherited from class javax.management.Attribute |
---|
getName, hashCode |
Methods inherited from class java.lang.Object |
---|
clone, finalize, getClass, notify, notifyAll, wait, wait, wait |
Field Detail |
---|
public static final java.lang.String ATTR_NAME
public static final java.lang.String ATTR_SETTINGS_DIRECTORY
public static final java.lang.String ATTR_DISK_PATH
public static final java.lang.String ATTR_LOGS_PATH
public static final java.lang.String ATTR_CHECKPOINTS_PATH
public static final java.lang.String ATTR_STATE_PATH
public static final java.lang.String ATTR_SCRATCH_PATH
public static final java.lang.String ATTR_RECOVER_PATH
public static final java.lang.String ATTR_RECOVER_RETAIN_FAILURES
public static final java.lang.String ATTR_RECOVER_SCOPE_INCLUDES
public static final java.lang.String ATTR_RECOVER_SCOPE_ENQUEUES
public static final java.lang.String ATTR_MAX_BYTES_DOWNLOAD
public static final java.lang.String ATTR_MAX_DOCUMENT_DOWNLOAD
public static final java.lang.String ATTR_MAX_TIME_SEC
public static final java.lang.String ATTR_MAX_TOE_THREADS
public static final java.lang.String ATTR_HTTP_HEADERS
public static final java.lang.String ATTR_USER_AGENT
public static final java.lang.String ATTR_FROM
public static final java.lang.String ATTR_PRE_FETCH_PROCESSORS
public static final java.lang.String ATTR_FETCH_PROCESSORS
public static final java.lang.String ATTR_EXTRACT_PROCESSORS
public static final java.lang.String ATTR_WRITE_PROCESSORS
public static final java.lang.String ATTR_POST_PROCESSORS
public static final java.lang.String ATTR_LOGGERS
public static final java.lang.String ATTR_RULES
public static final java.lang.String ATTR_RECORDER_OUT_BUFFER
public static final java.lang.String ATTR_RECORDER_IN_BUFFER
public static final java.lang.String ATTR_INDEPENDENT_EXTRACTORS
public static final java.lang.String ATTR_BDB_CACHE_PERCENT
public static final java.lang.String ATTR_CHECKPOINT_COPY_BDBJE_LOGS
public static final java.lang.Boolean DEFAULT_CHECKPOINT_COPY_BDBJE_LOGS
Constructor Detail |
---|
public CrawlOrder()
Method Detail |
---|
public java.lang.String getUserAgent(CrawlURI curi)
curi
-
public java.lang.String getFrom(CrawlURI curi)
curi
-
public int getMaxToes()
public RobotsHonoringPolicy getRobotsHonoringPolicy()
public java.lang.String getCrawlOrderName()
public CrawlController getController()
public void setController(CrawlController controller)
controller
- public MapType getLoggers()
public void checkUserAgentAndFrom() throws FatalConfigurationException
FatalConfigurationException
public java.io.File getCheckpointsDirectory()
public java.io.File getSettingsDir(java.lang.String key) throws javax.management.AttributeNotFoundException
key
in settings.
If directory does not exist, it and all intermediary dirs
will be created.
key
- Key to use going to settings.
key
.
javax.management.AttributeNotFoundException
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |