Uses of Class
org.archive.crawler.deciderules.MatchesRegExpDecideRule

Packages that use MatchesRegExpDecideRule
org.archive.crawler.deciderules Provides classes for a simple decision rules framework. 
 

Uses of MatchesRegExpDecideRule in org.archive.crawler.deciderules
 

Subclasses of MatchesRegExpDecideRule in org.archive.crawler.deciderules
 class ClassKeyMatchesRegExpDecideRule
          Rule applies configured decision to any CrawlURI class key -- i.e.
 class ContentTypeMatchesRegExpDecideRule
          DecideRule whose decision is applied if the URI's content-type is present and matches the supplied regular expression.
 class ContentTypeNotMatchesRegExpDecideRule
          DecideRule whose decision is applied if the URI's content-type is present and does not match the supplied regular expression.
 class MatchesFilePatternDecideRule
          Compares suffix of a passed CrawlURI, UURI, or String against a regular expression pattern, applying its configured decision to all matches.
 class NotMatchesFilePatternDecideRule
          Rule applies configured decision to any URIs which do *not* match the supplied (file-pattern) regexp.
 class NotMatchesRegExpDecideRule
          Rule applies configured decision to any URIs which do *not* match the supplied regexp.
 class PathologicalPathDecideRule
          Rule REJECTs any URI which contains an excessive number of identical, consecutive path-segments (eg http://example.com/a/a/a/boo.html == 3 '/a' segments)
 



Copyright © 2003-2011 Internet Archive. All Rights Reserved.