|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object javax.management.Attribute org.archive.crawler.settings.Type org.archive.crawler.settings.ComplexType org.archive.crawler.settings.ModuleType org.archive.crawler.framework.Processor org.archive.crawler.extractor.Extractor org.archive.crawler.extractor.ExtractorPDF
public class ExtractorPDF
Allows the caller to process a CrawlURI representing a PDF for the purpose of extracting URIs
Nested Class Summary |
---|
Nested classes/interfaces inherited from class org.archive.crawler.settings.ComplexType |
---|
ComplexType.MBeanAttributeInfoIterator |
Field Summary | |
---|---|
protected long |
numberOfCURIsHandled
|
protected long |
numberOfLinksExtracted
|
Fields inherited from class org.archive.crawler.framework.Processor |
---|
ATTR_DECIDE_RULES, ATTR_ENABLED, attrDecideRules |
Fields inherited from class org.archive.crawler.settings.ComplexType |
---|
definition, definitionMap |
Constructor Summary | |
---|---|
ExtractorPDF(java.lang.String name)
|
Method Summary | |
---|---|
protected void |
extract(CrawlURI curi)
|
java.lang.String |
report()
Provide a human-readable textual summary of this Processor's state. |
Methods inherited from class org.archive.crawler.extractor.Extractor |
---|
innerProcess, isHttpTransactionContentToProcess, isIndependentExtractors |
Methods inherited from class org.archive.crawler.framework.Processor |
---|
checkForInterrupt, finalTasks, getController, getDecideRule, getDefaultNextProcessor, initialTasks, innerRejectProcess, isContentToProcess, isEnabled, isExpectedMimeType, kickUpdate, process, rulesAccept, rulesAccept, setDefaultNextProcessor, spawn |
Methods inherited from class org.archive.crawler.settings.ModuleType |
---|
addElement, listUsedFiles |
Methods inherited from class org.archive.crawler.settings.Type |
---|
addConstraint, equals, getConstraints, getLegalValueType, isExpertSetting, isOverrideable, isTransient, setExpertSetting, setLegalValueType, setOverrideable, setTransient |
Methods inherited from class javax.management.Attribute |
---|
getName, hashCode |
Methods inherited from class java.lang.Object |
---|
clone, finalize, getClass, notify, notifyAll, wait, wait, wait |
Field Detail |
---|
protected long numberOfCURIsHandled
protected long numberOfLinksExtracted
Constructor Detail |
---|
public ExtractorPDF(java.lang.String name)
name
- Method Detail |
---|
protected void extract(CrawlURI curi)
extract
in class Extractor
public java.lang.String report()
report
in class Processor
Processor.report()
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |