org.archive.crawler.url
Interface CanonicalizationRule

All Known Implementing Classes:
BaseRule, FixupQueryStr, LowercaseRule, RegexRule, StripExtraSlashes, StripSessionCFIDs, StripSessionIDs, StripUserinfoRule, StripWWWNRule, StripWWWRule

public interface CanonicalizationRule

A rule to apply canonicalizing a url.

Version:
$Date: 2004-10-08 17:39:42 +0000 (Fri, 08 Oct 2004) $, $Revision: 2627 $
Author:
stack

Method Summary
 java.lang.String canonicalize(java.lang.String url, java.lang.Object context)
          Apply this canonicalization rule.
 java.lang.String getName()
           
 boolean isEnabled(java.lang.Object context)
           
 

Method Detail

canonicalize

java.lang.String canonicalize(java.lang.String url,
                              java.lang.Object context)
Apply this canonicalization rule.

Parameters:
url - Url string we apply this rule to.
context - An object that will provide context for the settings system. The UURI of the URL we're canonicalizing is an example of an object that provides context.
Returns:
Result of applying this rule to passed url.

getName

java.lang.String getName()
Returns:
Name of this rule.

isEnabled

boolean isEnabled(java.lang.Object context)
Parameters:
context - An object that will provide context for the settings system. The UURI of the URL we're canonicalizing is an example of an object that provides context.
Returns:
True if this rule is enabled and to be run.


Copyright © 2003-2011 Internet Archive. All Rights Reserved.