org.archive.crawler.frontier
Interface FrontierJournal

All Known Implementing Classes:
RecoveryJournal

public interface FrontierJournal

Record of key Frontier happenings.

Version:
$Date: 2008-07-11 22:14:18 +0000 (Fri, 11 Jul 2008) $, $Revision: 5870 $
Author:
stack

Field Summary
static java.lang.String LOGNAME_RECOVER
           
 
Method Summary
 void added(CandidateURI curi)
           
 void checkpoint(java.io.File checkpointDir)
          Checkpoint.
 void close()
          Flush and close any held objects.
 void emitted(CandidateURI curi)
          Note that a CrawlURI was emitted for processing.
 void finishedDisregard(CandidateURI curi)
           
 void finishedFailure(CandidateURI curi)
           
 void finishedSuccess(CandidateURI curi)
           
 void rescheduled(CandidateURI curi)
           
 void seriousError(java.lang.String string)
          Add a line noting a serious crawl error.
 

Field Detail

LOGNAME_RECOVER

static final java.lang.String LOGNAME_RECOVER
See Also:
Constant Field Values
Method Detail

added

void added(CandidateURI curi)
Parameters:
curi - CrawlURI that has been scheduled to be added to the Frontier.

finishedSuccess

void finishedSuccess(CandidateURI curi)
Parameters:
curi - CrawlURI that finished successfully.

emitted

void emitted(CandidateURI curi)
Note that a CrawlURI was emitted for processing. If not followed by a finished or rescheduled notation in the journal, the CrawlURI was still in-process when the journal ended.

Parameters:
curi - CrawlURI emitted.

finishedFailure

void finishedFailure(CandidateURI curi)
Parameters:
curi - CrawlURI finished unsuccessfully.

finishedDisregard

void finishedDisregard(CandidateURI curi)
Parameters:
curi - CrawlURI finished disregarded (uncounted failure).

rescheduled

void rescheduled(CandidateURI curi)
Parameters:
curi - CrawlURI that was returned to the Frontier for another try.

close

void close()
Flush and close any held objects.


checkpoint

void checkpoint(java.io.File checkpointDir)
                throws java.io.IOException
Checkpoint.

Parameters:
checkpointDir - Directory we're checkpointing into.
Throws:
java.io.IOException

seriousError

void seriousError(java.lang.String string)
Add a line noting a serious crawl error.

Parameters:
string -


Copyright © 2003-2011 Internet Archive. All Rights Reserved.