|
||||||||||
PREV NEXT | FRAMES NO FRAMES |
Packages that use UURI | |
---|---|
org.archive.crawler.datamodel | |
org.archive.crawler.extractor | |
org.archive.crawler.framework | |
org.archive.crawler.frontier | |
org.archive.crawler.scope | |
org.archive.crawler.settings | Provides classes for the settings framework. |
org.archive.crawler.settings.refinements | |
org.archive.crawler.url | |
org.archive.extractor | |
org.archive.net | |
org.archive.util |
Uses of UURI in org.archive.crawler.datamodel |
---|
Methods in org.archive.crawler.datamodel that return UURI | |
---|---|
UURI |
CrawlURI.getBaseURI()
Get the (HTML) Base URI used for derelativizing internal URIs. |
UURI |
CandidateURI.getUURI()
|
UURI |
CandidateURI.getVia()
|
protected UURI |
CandidateURI.readUuri(java.lang.String u)
Read a UURI from a String, handling a null or URIException |
Methods in org.archive.crawler.datamodel with parameters of type UURI | |
---|---|
CandidateURI |
CandidateURI.createCandidateURI(UURI baseUURI,
Link link)
Utility method for creation of CandidateURIs found extracting links from this CrawlURI. |
CandidateURI |
CandidateURI.createCandidateURI(UURI baseUURI,
Link link,
int scheduling,
boolean seed)
Utility method for creation of CandidateURIs found extracting links from this CrawlURI. |
static CandidateURI |
CandidateURI.createSeedCandidateURI(UURI uuri)
|
void |
CandidateURI.setVia(UURI via)
|
Constructors in org.archive.crawler.datamodel with parameters of type UURI | |
---|---|
CandidateURI(UURI u)
|
|
CandidateURI(UURI u,
java.lang.String pathFromSeed,
UURI via,
java.lang.CharSequence viaContext)
|
|
CrawlURI(UURI uuri)
Create a new instance of CrawlURI from a UURI . |
Uses of UURI in org.archive.crawler.extractor |
---|
Methods in org.archive.crawler.extractor with parameters of type UURI | |
---|---|
protected static java.util.List<java.lang.String> |
ExtractorURI.extractQueryStringLinks(UURI source)
Look for URIs inside the supplied UURI. |
Uses of UURI in org.archive.crawler.framework |
---|
Methods in org.archive.crawler.framework that return types with arguments of type UURI | |
---|---|
java.util.Iterator<UURI> |
CrawlScope.seedsIterator()
Gets an iterator over all configured seeds. |
java.util.Iterator<UURI> |
CrawlScope.seedsIterator(java.io.Writer ignoredItemWriter)
Gets an iterator over all configured seeds. |
Methods in org.archive.crawler.framework with parameters of type UURI | |
---|---|
void |
Frontier.considerIncluded(UURI u)
Notify Frontier that it should consider the given UURI as if already scheduled. |
protected boolean |
CrawlScope.isSameHost(UURI a,
UURI b)
|
void |
CrawlController.logUriError(org.apache.commons.httpclient.URIException e,
UURI u,
java.lang.CharSequence l)
Log a URIException from deep inside other components to the crawl's shared log. |
Uses of UURI in org.archive.crawler.frontier |
---|
Methods in org.archive.crawler.frontier that return UURI | |
---|---|
UURI |
WorkQueue.getContextUURI(WorkQueueFrontier wqf)
|
Methods in org.archive.crawler.frontier with parameters of type UURI | |
---|---|
protected java.lang.String |
AbstractFrontier.canonicalize(UURI uuri)
Canonicalize passed uuri. |
protected java.lang.String |
AdaptiveRevisitFrontier.canonicalize(UURI uuri)
Canonicalize passed uuri. |
void |
WorkQueueFrontier.considerIncluded(UURI u)
|
void |
AdaptiveRevisitFrontier.considerIncluded(UURI u)
|
Uses of UURI in org.archive.crawler.scope |
---|
Fields in org.archive.crawler.scope with type parameters of type UURI | |
---|---|
(package private) java.util.List<UURI> |
SeedCachingScope.seeds
|
Methods in org.archive.crawler.scope that return UURI | |
---|---|
protected UURI |
SeedFileIterator.transform(java.lang.String uri)
|
Methods in org.archive.crawler.scope that return types with arguments of type UURI | |
---|---|
java.util.Iterator<UURI> |
SeedCachingScope.seedsIterator()
|
Uses of UURI in org.archive.crawler.settings |
---|
Fields in org.archive.crawler.settings declared as UURI | |
---|---|
(package private) UURI |
ComplexType.Context.uri
|
Methods in org.archive.crawler.settings with parameters of type UURI | |
---|---|
CrawlerSettings |
CrawlerSettings.getParent(UURI uri)
Get the parent of this CrawlerSettings object. |
CrawlerSettings |
SettingsHandler.getSettings(java.lang.String host,
UURI uuri)
Get CrawlerSettings object in effect for a host or domain. |
Constructors in org.archive.crawler.settings with parameters of type UURI | |
---|---|
ComplexType.Context(CrawlerSettings settings,
UURI uri)
|
Uses of UURI in org.archive.crawler.settings.refinements |
---|
Methods in org.archive.crawler.settings.refinements with parameters of type UURI | |
---|---|
boolean |
Criteria.isWithinRefinementBounds(UURI uri)
Check if a uri is within the bounds of this criteria. |
boolean |
TimespanCriteria.isWithinRefinementBounds(UURI uri)
|
boolean |
PortnumberCriteria.isWithinRefinementBounds(UURI uri)
|
boolean |
Refinement.isWithinRefinementBounds(UURI uri)
Check if a URI is within the bounds of every criteria set for this refinement. |
boolean |
RegularExpressionCriteria.isWithinRefinementBounds(UURI uri)
|
Uses of UURI in org.archive.crawler.url |
---|
Methods in org.archive.crawler.url with parameters of type UURI | |
---|---|
static java.lang.String |
Canonicalizer.canonicalize(UURI uuri,
CrawlOrder order)
Convenience method that is passed a settings object instance pulling from it what it needs to canonicalize. |
static java.lang.String |
Canonicalizer.canonicalize(UURI uuri,
java.util.Iterator rules)
Run the passed uuri through the list of rules. |
Uses of UURI in org.archive.extractor |
---|
Fields in org.archive.extractor declared as UURI | |
---|---|
protected UURI |
CharSequenceLinkExtractor.base
|
protected UURI |
CharSequenceLinkExtractor.source
|
Methods in org.archive.extractor with parameters of type UURI | |
---|---|
static void |
CharSequenceLinkExtractor.extract(java.lang.CharSequence content,
UURI source,
UURI base,
java.util.List<Link> collector,
ExtractErrorListener extractErrorListener)
Convenience method to do default extraction. |
void |
ExtractErrorListener.noteExtractError(java.io.IOException ex,
UURI source,
java.lang.CharSequence context)
Callback to report an extraction error. |
void |
CharSequenceLinkExtractor.setup(UURI sourceandbase,
java.lang.CharSequence content,
ExtractErrorListener listener)
Convenience method for when source and base are same. |
void |
LinkExtractor.setup(UURI sourceandbase,
java.io.InputStream content,
java.nio.charset.Charset charset,
ExtractErrorListener listener)
Convenience version of above for common case where source and base are same. |
void |
CharSequenceLinkExtractor.setup(UURI sourceandbase,
java.io.InputStream content,
java.nio.charset.Charset charset,
ExtractErrorListener listener)
|
void |
CharSequenceLinkExtractor.setup(UURI source,
UURI base,
java.lang.CharSequence content,
ExtractErrorListener listener)
|
void |
LinkExtractor.setup(UURI source,
UURI base,
java.io.InputStream content,
java.nio.charset.Charset charset,
ExtractErrorListener listener)
Setup the LinkExtractor to operate on the given stream and charset, considering the given contextURI as the initial 'base' URI for resolving relative URIs. |
void |
CharSequenceLinkExtractor.setup(UURI source,
UURI base,
java.io.InputStream content,
java.nio.charset.Charset charset,
ExtractErrorListener listener)
|
Uses of UURI in org.archive.net |
---|
Methods in org.archive.net that return UURI | |
---|---|
static UURI |
UURI.from(java.lang.Object o)
Convenience method for finding the UURI inside an Object likely to have (or be/imply) one. |
static UURI |
UURIFactory.getInstance(java.lang.String uri)
|
static UURI |
UURIFactory.getInstance(java.lang.String uri,
java.lang.String charset)
|
static UURI |
UURIFactory.getInstance(UURI base,
java.lang.String relative)
|
UURI |
UURI.resolve(java.lang.String uri)
|
UURI |
UURI.resolve(java.lang.String uri,
boolean e)
|
UURI |
UURI.resolve(java.lang.String uri,
boolean e,
java.lang.String charset)
|
protected UURI |
UURIFactory.validityCheck(UURI uuri)
Check the generated UURI. |
Methods in org.archive.net with parameters of type UURI | |
---|---|
static UURI |
UURIFactory.getInstance(UURI base,
java.lang.String relative)
|
protected UURI |
UURIFactory.validityCheck(UURI uuri)
Check the generated UURI. |
Constructors in org.archive.net with parameters of type UURI | |
---|---|
UURI(UURI base,
UURI relative)
|
Uses of UURI in org.archive.util |
---|
Methods in org.archive.util with parameters of type UURI | |
---|---|
static java.lang.String |
UriUtils.speculativeFixup(java.lang.String candidate,
UURI base)
Perform additional fixup of likely-URI Strings |
|
||||||||||
PREV NEXT | FRAMES NO FRAMES |