You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Phil Varner (JIRA)" <ji...@apache.org> on 2010/01/05 21:09:54 UTC

[jira] Commented: (PDFBOX-533) PDFTextStripper.writeCharacters is called no where in the class

    [ https://issues.apache.org/jira/browse/PDFBOX-533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12796828#action_12796828 ] 

Phil Varner commented on PDFBOX-533:
------------------------------------

Mel, 

I think it should be 

            while(pdfSource.available() > 0 && objIter.hasNext())

instead, so the call to next() returns the correct Integer when next() is called later on.

This worked for me on a doc that threw the same exception.

I didn't see a separate JIRA issue for this, I'll gladly file and fix if someone can provide a doc that the error occurs on (mine is confidential from a customer).

 

> PDFTextStripper.writeCharacters is called no where in the class
> ---------------------------------------------------------------
>
>                 Key: PDFBOX-533
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-533
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 0.8.0-incubator
>            Reporter: Navendu Garg
>         Attachments: TestPDFTextStripperPerf.java
>
>
> It seems writeCharacters method is not called anywhere in the PDFTextStripper class. This makes it impossible for handling character TextPosition as well as Line Separator because processLineSeparator method is no longer there and writeLineSeparator is called when actual writing happens.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.