org.sejda.impl.pdfbox.component
Class PdfTextExtractor

java.lang.Object
  extended by org.sejda.impl.pdfbox.component.PdfTextExtractor
All Implemented Interfaces:
Closeable

public class PdfTextExtractor
extends Object
implements Closeable

Component responsible for extracting text from an input pdf document.

Author:
Andrea Vacondio

Constructor Summary
PdfTextExtractor(String encoding)
           
 
Method Summary
 void close()
           
 void extract(org.apache.pdfbox.pdmodel.PDDocument document, File output)
          Extract text from the input document writing it to the given output file.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

PdfTextExtractor

public PdfTextExtractor(String encoding)
                 throws TaskException
Throws:
TaskException
Method Detail

extract

public void extract(org.apache.pdfbox.pdmodel.PDDocument document,
                    File output)
             throws TaskException
Extract text from the input document writing it to the given output file.

Parameters:
document -
output -
Throws:
TaskException

close

public void close()
Specified by:
close in interface Closeable


Copyright © 2012. All Rights Reserved.