Uses of Package
org.archive.crawler.framework

Packages that use org.archive.crawler.framework
org.archive.crawler.admin Contains classes that the web UI uses to monitor and control crawls. 
org.archive.crawler.datamodel   
org.archive.crawler.datamodel.credential Contains html form login and basic and digest credentials used by Heritrix logging into sites. 
org.archive.crawler.deciderules Provides classes for a simple decision rules framework. 
org.archive.crawler.extractor   
org.archive.crawler.fetcher   
org.archive.crawler.filter   
org.archive.crawler.framework   
org.archive.crawler.frontier   
org.archive.crawler.postprocessor   
org.archive.crawler.prefetch   
org.archive.crawler.processor   
org.archive.crawler.processor.recrawl   
org.archive.crawler.scope   
org.archive.crawler.writer   
 

Classes in org.archive.crawler.framework used by org.archive.crawler.admin
AbstractTracker
          A partial implementation of the StatisticsTracking interface.
CrawlController
          CrawlController collects all the classes which cooperate to perform a crawl and provides a high-level interface to the running crawl.
FrontierMarker
          A marker is a pointer to a place somewhere inside a frontier's list of pending URIs.
StatisticsTracking
          An interface for objects that want to collect statistics on running crawls.
 

Classes in org.archive.crawler.framework used by org.archive.crawler.datamodel
CrawlController
          CrawlController collects all the classes which cooperate to perform a crawl and provides a high-level interface to the running crawl.
Processor
          Base class for URI processing classes.
ProcessorChain
          This class groups together a number of processors that logically fit together.
 

Classes in org.archive.crawler.framework used by org.archive.crawler.datamodel.credential
CrawlController
          CrawlController collects all the classes which cooperate to perform a crawl and provides a high-level interface to the running crawl.
 

Classes in org.archive.crawler.framework used by org.archive.crawler.deciderules
CrawlController
          CrawlController collects all the classes which cooperate to perform a crawl and provides a high-level interface to the running crawl.
CrawlScope
          A CrawlScope instance defines which URIs are "in" a particular crawl.
Filter
          Base class for filter classes.
 

Classes in org.archive.crawler.framework used by org.archive.crawler.extractor
CrawlController
          CrawlController collects all the classes which cooperate to perform a crawl and provides a high-level interface to the running crawl.
Processor
          Base class for URI processing classes.
 

Classes in org.archive.crawler.framework used by org.archive.crawler.fetcher
Processor
          Base class for URI processing classes.
 

Classes in org.archive.crawler.framework used by org.archive.crawler.filter
Filter
          Base class for filter classes.
 

Classes in org.archive.crawler.framework used by org.archive.crawler.framework
CrawlController
          CrawlController collects all the classes which cooperate to perform a crawl and provides a high-level interface to the running crawl.
CrawlScope
          A CrawlScope instance defines which URIs are "in" a particular crawl.
Filter
          Base class for filter classes.
Frontier
          An interface for URI Frontiers.
Frontier.FrontierGroup
          Generic interface representing the internal groupings of a Frontier's URIs -- usually queues.
FrontierMarker
          A marker is a pointer to a place somewhere inside a frontier's list of pending URIs.
Processor
          Base class for URI processing classes.
ProcessorChain
          This class groups together a number of processors that logically fit together.
ProcessorChainList
          A list of all the ProcessorChains.
StatisticsTracking
          An interface for objects that want to collect statistics on running crawls.
ToePool
          A collection of ToeThreads.
 

Classes in org.archive.crawler.framework used by org.archive.crawler.frontier
CrawlController
          CrawlController collects all the classes which cooperate to perform a crawl and provides a high-level interface to the running crawl.
Frontier
          An interface for URI Frontiers.
Frontier.FrontierGroup
          Generic interface representing the internal groupings of a Frontier's URIs -- usually queues.
FrontierMarker
          A marker is a pointer to a place somewhere inside a frontier's list of pending URIs.
 

Classes in org.archive.crawler.framework used by org.archive.crawler.postprocessor
Processor
          Base class for URI processing classes.
Scoper
          Base class for Scopers.
 

Classes in org.archive.crawler.framework used by org.archive.crawler.prefetch
Processor
          Base class for URI processing classes.
Scoper
          Base class for Scopers.
 

Classes in org.archive.crawler.framework used by org.archive.crawler.processor
Processor
          Base class for URI processing classes.
 

Classes in org.archive.crawler.framework used by org.archive.crawler.processor.recrawl
Processor
          Base class for URI processing classes.
 

Classes in org.archive.crawler.framework used by org.archive.crawler.scope
CrawlController
          CrawlController collects all the classes which cooperate to perform a crawl and provides a high-level interface to the running crawl.
CrawlScope
          A CrawlScope instance defines which URIs are "in" a particular crawl.
Filter
          Base class for filter classes.
 

Classes in org.archive.crawler.framework used by org.archive.crawler.writer
Processor
          Base class for URI processing classes.
WriterPoolProcessor
          Abstract implementation of a file pool processor.
 



Copyright © 2003-2011 Internet Archive. All Rights Reserved.