Package org.jpedal.examples.text
Class FindTextInRectangle
java.lang.Object
org.jpedal.examples.BaseExample
org.jpedal.examples.text.FindTextInRectangle
public class FindTextInRectangle
extends org.jpedal.examples.BaseExample
Find text in PDF files
This class provides a simple Java API to find text in a PDF file and also a static convenience method if you want to search a PDF file or directory containing PDF files
See our Support Pages for more information on Text Searching.
- 
Constructor SummaryConstructorsConstructorDescriptionFindTextInRectangle(byte[] byteArray) Sets up an FindTextInRectangle instance to open a PDF file contained as a BLOB within a byte[] streamFindTextInRectangle(String fileName) Sets up an FindTextInRectangle instance to open a PDF File
- 
Method SummaryModifier and TypeMethodDescriptionvoiddecodeFile(String file_name) routine to decode a filestatic List<float[]> findTextOnAllPages(String inputDir, String textToFind) Convenience method to find text in a PDF filefloat[]findTextOnPage(int page, int x1, int y1, int x2, int y2, String textToFind, int searchType) Return the coords for the page specified.The origin of the coords is the bottom left hand corner (on unrotated page)float[]findTextOnPage(int page, String textToFind, int searchType) Return the coords for the page specified.The origin of the coords is the bottom left hand corner (on unrotated page)intnumber of pages in PDF file (starting at 1)voidsetPassword(String password) Methods inherited from class org.jpedal.examples.BaseExampleclosePDFfile, openPDFFile
- 
Constructor Details- 
FindTextInRectangleSets up an FindTextInRectangle instance to open a PDF File- Parameters:
- fileName- full path to a single PDF file
 
- 
FindTextInRectanglepublic FindTextInRectangle(byte[] byteArray) Sets up an FindTextInRectangle instance to open a PDF file contained as a BLOB within a byte[] stream- Parameters:
- byteArray- Array that will hold the BLOB
 
 
- 
- 
Method Details- 
decodeFileroutine to decode a file- Throws:
- PdfException
 
- 
findTextOnPageReturn the coords for the page specified.The origin of the coords is the bottom left hand corner (on unrotated page)- Parameters:
- page- :: Page number to check for results
- textToFind- test to look for
- searchType- A static int from org.jpedal.grouping.SearchType class
- Returns:
- float[] containing all coords for the page, or empty array is no results found
         
 [0]=result x1 coord
 [1]=result y1 coord
 [2]=result x2 coord
 [3]=result y2 coord
 [4]=either -101 to show that the next text area is the remainder of this word on another line else any other value is ignored.
- Throws:
- PdfException- PdfException
 
- 
findTextOnPagepublic float[] findTextOnPage(int page, int x1, int y1, int x2, int y2, String textToFind, int searchType) throws PdfException Return the coords for the page specified.The origin of the coords is the bottom left hand corner (on unrotated page)- Parameters:
- page- page to search
- x1- x1
- y1- y1
- x2- x2
- y2- y2
- textToFind- text to look for
- searchType- A static int from org.jpedal.grouping.SearchType class
- Returns:
- float[] containing all coords for the page, or empty array is no results found
         
 [0]=result x1 coord
 [1]=result y1 coord
 [2]=result x2 coord
 [3]=result y2 coord
 [4]=either -101 to show that the next text area is the remainder of this word on another line else any other value is ignored.
- Throws:
- PdfException- pdfException
 
- 
findTextOnAllPagespublic static List<float[]> findTextOnAllPages(String inputDir, String textToFind) throws PdfException Convenience method to find text in a PDF file- Parameters:
- inputDir- a PDF file
- textToFind- text to look for
- Returns:
- ArrayList containing set of float[] values for all pages (-1 for actual page) * The origin of the coords
         is the bottom left hand corner (on unrotated page) organised in the following order.
         
 [0]=result x1 coord
 [1]=result y1 coord
 [2]=result x2 coord
 [3]=result y2 coord
 [4]=either -101 to show that the next text area is the remainder of this word on another line else any other value is ignored. s
- Throws:
- PdfException- PdfException
 
- 
setPassword- Parameters:
- password- the USER or OWNER password for the PDF file
 
- 
getPageCountpublic int getPageCount()number of pages in PDF file (starting at 1)- Returns:
- page count
 
 
-