|
||||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |
Class Summary | |
---|---|
BaseRule | Base of all rules applied canonicalizing a URL that are configurable via the Heritrix settings system. |
FixupQueryStr | Strip any trailing question mark. |
LowercaseRule | Lowercases the URL. |
RegexRule | General conversion rule. |
StripExtraSlashes | |
StripSessionCFIDs | Strip cold fusion session ids. |
StripSessionIDs | Strip known session ids. |
StripUserinfoRule | Strip any 'userinfo' found on http/https URLs. |
StripWWWNRule | Strip any 'www[0-9]*' found on http/https URLs IF they have some path/query component (content after third slash). |
StripWWWRule | Strip any 'www' found on http/https URLs, IF they have some path/query component (content after third slash). |
|
||||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |