org.archive.crawler.url
Interface CanonicalizationRule
- All Known Implementing Classes:
- BaseRule, FixupQueryStr, LowercaseRule, RegexRule, StripExtraSlashes, StripSessionCFIDs, StripSessionIDs, StripUserinfoRule, StripWWWNRule, StripWWWRule
public interface CanonicalizationRule
A rule to apply canonicalizing a url.
- Version:
- $Date: 2004-10-08 17:39:42 +0000 (Fri, 08 Oct 2004) $, $Revision: 2627 $
- Author:
- stack
Method Summary |
java.lang.String |
canonicalize(java.lang.String url,
java.lang.Object context)
Apply this canonicalization rule. |
java.lang.String |
getName()
|
boolean |
isEnabled(java.lang.Object context)
|
canonicalize
java.lang.String canonicalize(java.lang.String url,
java.lang.Object context)
- Apply this canonicalization rule.
- Parameters:
url
- Url string we apply this rule to.context
- An object that will provide context for the settings
system. The UURI of the URL we're canonicalizing is an example of
an object that provides context.
- Returns:
- Result of applying this rule to passed
url
.
getName
java.lang.String getName()
- Returns:
- Name of this rule.
isEnabled
boolean isEnabled(java.lang.Object context)
- Parameters:
context
- An object that will provide context for the settings
system. The UURI of the URL we're canonicalizing is an example of
an object that provides context.
- Returns:
- True if this rule is enabled and to be run.
Copyright © 2003-2011 Internet Archive. All Rights Reserved.