You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "John Hewson (JIRA)" <ji...@apache.org> on 2014/06/17 21:41:09 UTC
[jira] [Closed] (PDFBOX-12) text from box
[ https://issues.apache.org/jira/browse/PDFBOX-12?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
John Hewson closed PDFBOX-12.
-----------------------------
Resolution: Fixed
Fix Version/s: 1.7.0
Looks like PDFTextStripper#setSortByPosition() has supported this for some time.
> text from box
> -------------
>
> Key: PDFBOX-12
> URL: https://issues.apache.org/jira/browse/PDFBOX-12
> Project: PDFBox
> Issue Type: New Feature
> Components: Text extraction
> Fix For: 1.7.0
>
>
> [imported from SourceForge]
> http://sourceforge.net/tracker/index.php?group_id=78314&atid=552835&aid=939868
> Originally submitted by ramyamd on 2004-04-22 01:19.
> I am attaching a file with this message.
> problem:
> The text from the rectangles are not read sequentially
> i.e not extract from a single rectangle at a time. it is
> extracting randomly from different rectangles. I want to
> get the text rectangle wise.
> for example - PDF page no - 89
> The text is to be extracted in this way
> A
> 767
> FAULT ISOLATION/MAINT MANUAL
> PASSENGER ADDRESS
> AMPLIFIER BITE
> PROCEDURE
> PREREQUISITES
> MAKE SURE THIS CIRCUIT BREAKER IS CLOSED:
> 11C22
> MAKE SURE THE AIRPLANE IS IN THIS CONFIGURATION:
> ELECTRICAL POWER IS ON (AMM 24-22-00/201)
> 1 SET THE FUNCTION SELECTOR NO
> SWITCH TO THE "LEVEL" POSITION
> ON THE PA AMPLIFIER FRONT
> PANEL AT E2-5.
> DOES THE PA AMPLIFIER
> FRONT PANEL SHOW 69 TO 71
> VRMS?
> 10 ADJUST THE "MASTER GAIN"
> FOR 69 TO 71 VRMS.
> DOES THE PA AMPLIFIER
> FRONT PANEL SHOW 69 TO 71
> VRMS?
> YES
> NO
> 20 REPLACE THE PA AMPLIFIER,
> M177 (AMM 23-31-01/401).
> YES
> 2 SET THE FUNCTION SELECTOR
> SWITCH TO THE "LOAD" POSITION.
> DOES THE PA AMPLIFIER NO
> FRONT PANEL SHOW 30 OHMS OR
> MORE?
> YES
> 21 EXAMINE THE SPEAKER WIRING
> FOR SHORT CIRCUITS FROM
> PIN A13 TO B13 (IF USED) AND
> PIN A15 TO B15, OF CONNECTOR
> D455B, AT E2-5 (WDM 23-31-14
> THRU 23-31-17).
> REPAIR THE PROBLEMS THAT
> YOU FIND.
> 3 SET THE FUNCTION SELECTOR NO
> SWITCH TO THE "TONE" POSITION.
> DO YOU HEAR SOUND FROM ALL
> THE PA SPEAKERS?
> YES
> 4 SET THE FUNCTION SELECTOR
> SWITCH TO THE "OPERATE" POSI-
> TION.
> THE SYSTEM IS OK.
> 11 DO YOU HEAR NO SOUND AT
> ONE OF THE SPEAKERS?
> NO
> 12 DO YOU HEAR NO SOUND FROM
> ALL OF THE SPEAKERS?
> NO
> YES
> YES
> 22 REPLACE THE BAD SPEAKER.
> REFER TO TABLE 101.
> 23 REPLACE THE PA AMPLIFIER,
> M177 (AMM 23-31-01/401).
> 24 EXAMINE THE SPEAKER WIRING
> FOR OPEN CIRCUITS FROM A
> SPEAKER WITH THE SOUND TO A
> SPEAKER WITHOUT (WDM 23-31-14
> THRU 23-31-17).
> REPAIR THE PROBLEMS THAT
> YOU FIND.
> NOTE:
> BITE DOES A TEST OF THESE SYSTEM COMPONENTS:
> PA AMPLIFIER
> SPEAKERS
> SPEAKER WIRING.
> BITE DOES NOT DO A TEST OF THESE SYSTEM
> COMPONENTS:
> AUDIO ACCESSORY UNIT
> ZONE MULTIPLEXER.
> SPEAKER
> LOCATION
> PSU
> GALLEY
> LAVATORY
> CEILING
> AMM
> REFERENCE
> 23-31-02/401
> 23-31-04/401
> 23-31-05/401
> 23-31-08/401
> TABLE 101
> Passenger Address Amplifier BITE Procedure
> Figure 103
>
> its just the outline of how i need the information. The
> text should be read completely from one rectanble and
> then switched to next rectangle etc.,
> Similar pages are also in this PDF document. pls test it
> with that also.
> it will be useful for me if u give the details of the images
> in this file(how it is stored and which format)
> pls give importance to this message.
> Thanks in advance. Waiting for ur reply.
> [attachment on SourceForge]
> http://sourceforge.net/tracker/download.php?group_id=78314&atid=552835&aid=939868&file_id=84628
> task23.zip (), 138214 bytes
--
This message was sent by Atlassian JIRA
(v6.2#6252)