org.apache.slide.extractor
Class AbstractContentExtractor

java.lang.Object
  extended byorg.apache.slide.extractor.AbstractContentExtractor
All Implemented Interfaces:
ContentExtractor, Extractor
Direct Known Subclasses:
MSExcelExtractor, MSPowerPointExtractor, MSWordExtractor, PDFExtractor

public abstract class AbstractContentExtractor
extends java.lang.Object
implements ContentExtractor

The AbstractPropertyExtractor class


Constructor Summary
AbstractContentExtractor(java.lang.String uri, java.lang.String contentType)
           
AbstractContentExtractor(java.lang.String uri, java.lang.String contentType, java.lang.String namespace)
           
 
Method Summary
abstract  java.io.Reader extract(java.io.InputStream content)
          Will be called before content gets stored
 java.lang.String getContentType()
           
 java.lang.String getNamespace()
           
 java.lang.String getUri()
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

AbstractContentExtractor

public AbstractContentExtractor(java.lang.String uri,
                                java.lang.String contentType)

AbstractContentExtractor

public AbstractContentExtractor(java.lang.String uri,
                                java.lang.String contentType,
                                java.lang.String namespace)
Method Detail

extract

public abstract java.io.Reader extract(java.io.InputStream content)
                                throws ExtractorException
Description copied from interface: ContentExtractor
Will be called before content gets stored

Specified by:
extract in interface ContentExtractor
Throws:
ExtractorException

getContentType

public java.lang.String getContentType()
Specified by:
getContentType in interface Extractor

getUri

public java.lang.String getUri()
Specified by:
getUri in interface Extractor

getNamespace

public java.lang.String getNamespace()
Specified by:
getNamespace in interface Extractor