org.mmbase.module.lucene.extraction.impl
Class POIWordExtractor

java.lang.Object
  extended by org.mmbase.module.lucene.extraction.impl.POIWordExtractor
All Implemented Interfaces:
Extractor

public class POIWordExtractor
extends Object
implements Extractor

Use POI to extract text from a MS Word document

Version:
$Id $
Author:
Wouter Heijke

Constructor Summary
POIWordExtractor()
           
 
Method Summary
 String extract(InputStream input)
          Extract text from a source
 String getMimeType()
          Mimetype this Extractor handles
static void main(String[] args)
           
 void setMimeType(String mimetype)
          Mimetype this Extractor handles
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

POIWordExtractor

public POIWordExtractor()
Method Detail

setMimeType

public void setMimeType(String mimetype)
Description copied from interface: Extractor
Mimetype this Extractor handles

Specified by:
setMimeType in interface Extractor
Parameters:
mimetype - String representing the MIME Type

getMimeType

public String getMimeType()
Description copied from interface: Extractor
Mimetype this Extractor handles

Specified by:
getMimeType in interface Extractor
Returns:
String representing the MIME Type

extract

public String extract(InputStream input)
Description copied from interface: Extractor
Extract text from a source

Specified by:
extract in interface Extractor
Parameters:
input - InputStream where the data comes from
Returns:
String representing the extracted text

main

public static void main(String[] args)
                 throws Exception
Throws:
Exception


MMBase 2.0-SNAPSHOT - null