You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Tim Allison (JIRA)" <ji...@apache.org> on 2014/09/23 03:04:35 UTC

[jira] [Created] (PDFBOX-2376) Small regression in text extraction with PDFBox 1.8.7 vs. 1.8.6

Tim Allison created PDFBOX-2376:
-----------------------------------

             Summary: Small regression in text extraction with PDFBox 1.8.7 vs. 1.8.6
                 Key: PDFBOX-2376
                 URL: https://issues.apache.org/jira/browse/PDFBOX-2376
             Project: PDFBox
          Issue Type: Bug
            Reporter: Tim Allison
            Priority: Minor


On at least one file in govdocs1, less text is being extracted with PDFBox 1.8.7 than was extracted with 1.8.6.  When running the app.jar with ExtractText, 1.8.7 is not extracting:
{noformat}
Designated Counties
No Designation
Individual Assistance
All counties are eligible
ITS Mapping & Analysis CenterWashington, DC
05/09/08 -- 09:36 AM EDT
Source: Disaster Federal Registry Notice05/08/2008
Location Map
MapID 196d109cd27
for Hazard Mitigation

{noformat}

from govdocs1's 894770.pdf.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)