org.archive.hcc.client
Interface CompletedCrawlJob

All Superinterfaces:
CrawlJob
All Known Implementing Classes:
CompletedCrawlJobImpl

public interface CompletedCrawlJob
extends CrawlJob

Defines the interface for interacting with a completed crawl job.

Author:
Daniel Bernstein (dbernstein@archive.org)

Method Summary
 java.lang.String getCrawlReport()
          Returns a crawl report which is some statistics summarizing the end state of the crawl.
 java.lang.String getHostReport()
          A count of bytes and docs collected by host.
 java.lang.String getMimeTypeReport()
          A count of bytes and docs collected by mime type.
 java.lang.String getSeedReport()
          Returns a seed report which details final crawl status of all the seeds.
 java.lang.String getSourceReport()
          A count of docs per host per source seed.
 
Methods inherited from interface org.archive.hcc.client.CrawlJob
getJobName, getMother, getUid
 

Method Detail

getCrawlReport

java.lang.String getCrawlReport()
                                throws ClusterException
Returns a crawl report which is some statistics summarizing the end state of the crawl.

Returns:
Throws:
ClusterException

getSeedReport

java.lang.String getSeedReport()
                               throws ClusterException
Returns a seed report which details final crawl status of all the seeds.

Returns:
Throws:
ClusterException

getHostReport

java.lang.String getHostReport()
                               throws ClusterException
A count of bytes and docs collected by host.

Returns:
Throws:
ClusterException

getSourceReport

java.lang.String getSourceReport()
                                 throws ClusterException
A count of docs per host per source seed.

Returns:
Throws:
ClusterException

getMimeTypeReport

java.lang.String getMimeTypeReport()
                                   throws ClusterException
A count of bytes and docs collected by mime type.

Returns:
Throws:
ClusterException


Copyright © 2005-2008 The Internet Archive. All Rights Reserved.