org.archive.crawler.scope
Class SeedCachingScope
java.lang.Object
javax.management.Attribute
org.archive.crawler.settings.Type
org.archive.crawler.settings.ComplexType
org.archive.crawler.settings.ModuleType
org.archive.crawler.framework.Filter
org.archive.crawler.framework.CrawlScope
org.archive.crawler.scope.ClassicScope
org.archive.crawler.scope.SeedCachingScope
- All Implemented Interfaces:
- java.io.Serializable, javax.management.DynamicMBean
- Direct Known Subclasses:
- DomainScope, HostScope, PathScope
public class SeedCachingScope
- extends ClassicScope
A CrawlScope that caches its seed list for the
convenience of scope-tests that are based on the
seeds.
- Author:
- gojomo
- See Also:
- Serialized Form
Field Summary |
(package private) java.util.List<UURI> |
seeds
|
Methods inherited from class org.archive.crawler.settings.ComplexType |
addElementToDefinition, checkValue, earlyInitialize, getAbsoluteName, getAttribute, getAttribute, getAttribute, getAttributeInfo, getAttributeInfo, getAttributeInfoIterator, getAttributes, getDataContainerRecursive, getDataContainerRecursive, getDefaultValue, getDescription, getElementFromDefinition, getLegalValues, getLocalAttribute, getMBeanInfo, getMBeanInfo, getParent, getPreservedFields, getSettingsHandler, getUncheckedAttribute, getValue, globalSettings, invoke, isInitialized, isOverridden, iterator, removeElementFromDefinition, setAsOrder, setAttribute, setAttribute, setAttributes, setDescription, setPreservedFields, unsetAttribute |
Methods inherited from class org.archive.crawler.settings.Type |
addConstraint, equals, getConstraints, getLegalValueType, isExpertSetting, isOverrideable, isTransient, setExpertSetting, setLegalValueType, setOverrideable, setTransient |
Methods inherited from class javax.management.Attribute |
getName, hashCode |
Methods inherited from class java.lang.Object |
clone, finalize, getClass, notify, notifyAll, wait, wait, wait |
seeds
java.util.List<UURI> seeds
SeedCachingScope
public SeedCachingScope(java.lang.String name)
addSeed
public boolean addSeed(CrawlURI curi)
refreshSeeds
public void refreshSeeds()
- Description copied from class:
CrawlScope
- Refresh seeds.
- Overrides:
refreshSeeds
in class CrawlScope
seedsIterator
public java.util.Iterator<UURI> seedsIterator()
- Description copied from class:
CrawlScope
- Gets an iterator over all configured seeds. Subclasses
which cache seeds in memory can override with more
efficient implementation.
- Overrides:
seedsIterator
in class CrawlScope
- Returns:
- Iterator, perhaps over a disk file, of seeds
fillSeedsCache
protected void fillSeedsCache()
- Ensure seeds cache is created/filled
Copyright © 2003-2011 Internet Archive. All Rights Reserved.