You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@pdfbox.apache.org by "Andreas Lehmkühler (JIRA)" <ji...@apache.org> on 2011/09/24 14:53:26 UTC

[jira] [Commented] (PDFBOX-1123) Not able to read field values from a PDF File if the field contains special characters.

    [ https://issues.apache.org/jira/browse/PDFBOX-1123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13113966#comment-13113966 ] 

Andreas Lehmkühler commented on PDFBOX-1123:
--------------------------------------------

Can you provide us with a sample pdf?

> Not able to read field values from a PDF File if the field contains special characters.
> ---------------------------------------------------------------------------------------
>
>                 Key: PDFBOX-1123
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1123
>             Project: PDFBox
>          Issue Type: Bug
>            Reporter: Rubesh MX
>            Priority: Critical
>              Labels: Bug
>   Original Estimate: 12h
>  Remaining Estimate: 12h
>
> Hi, I am trying to read the field names in a PDF file, it is working with most of the files, but in some files we are not able to read the field Id/name, the reason being we have some field names as -
> topmostSubform[0].Page1[0].c1_04_0_[0]
> topmostSubform[0].Page1[0].c1_09_0_
> topmostSubform[0].Page2[0].Table_Line4a[0].#subform[1].p2-t69[0]
> Here all the field names starts with topmostSubform[0]. so when we try to get the field names like PDField.getpartialname() - the field name is getting truncated at '.' and we get only - topmostSubform[0] and since all the field names starts with the same name the total count of fields are coming as 1. Since there are some special characters like '.'; '_'; '#' this is causing the issue. Could you please suggest on this? This is very critical.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira