org.mmbase.module.lucene.extraction.impl
Class PDFBoxExtractor

java.lang.Object
  extended by org.mmbase.module.lucene.extraction.impl.PDFBoxExtractor
All Implemented Interfaces:
Extractor

public class PDFBoxExtractor
extends Object
implements Extractor

Version:
$Id: PDFBoxExtractor.java 35593 2009-06-02 23:58:55Z michiel $
Author:
Wouter Heijke

Constructor Summary
PDFBoxExtractor()
           
 
Method Summary
 String extract(InputStream input)
          Extract text from a source
 String getMimeType()
          Mimetype this Extractor handles
static void main(String[] args)
           
 void setMimeType(String mimetype)
          Mimetype this Extractor handles
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

PDFBoxExtractor

public PDFBoxExtractor()
Method Detail

setMimeType

public void setMimeType(String mimetype)
Description copied from interface: Extractor
Mimetype this Extractor handles

Specified by:
setMimeType in interface Extractor
Parameters:
mimetype - String representing the MIME Type

getMimeType

public String getMimeType()
Description copied from interface: Extractor
Mimetype this Extractor handles

Specified by:
getMimeType in interface Extractor
Returns:
String representing the MIME Type

extract

public String extract(InputStream input)
               throws Exception
Description copied from interface: Extractor
Extract text from a source

Specified by:
extract in interface Extractor
Parameters:
input - InputStream where the data comes from
Returns:
String representing the extracted text
Throws:
Exception

main

public static void main(String[] args)
                 throws Exception
Throws:
Exception


MMBase 2.0-SNAPSHOT - null