You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Thomas Gawehn (Jira)" <ji...@apache.org> on 2020/06/12 14:22:00 UTC
[jira] [Updated] (PDFBOX-4878) Call to Dictionary.encoding throws NullPointerException for some PDF's

     [ https://issues.apache.org/jira/browse/PDFBOX-4878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Thomas Gawehn updated PDFBOX-4878:
----------------------------------
    Description: 
Wenn iterating all fonts in a PDF a call to PDSimpleFont.getEncoding().getEncodingName () throws a NullPointerException inside the library code for the attached sample-PDF.

The situation can be reproduced, when running the following loop for the attached PDF (test.pdf):

 

 
{code:java}
        document = PDDocument.load (file);        int pages = document.getNumberOfPages ();        for (int i = 0; i < pages; ++i)
        {
            PDPage page = document.getPage (i);            PDResources resources = page.getResources ();
            if (resources == null)
                continue;            Iterator<COSName> iter = resources.getFontNames ().iterator ();            while (iter.hasNext ())
            {
                COSName cos = iter.next ();
                try
                {
                    PDFont font = resources.getFont (cos);
                    System.out.println (font.getName ());                    if (font instanceof PDSimpleFont)
                    {
                        PDSimpleFont simpleFont = (PDSimpleFont) font;
                        Encoding encoding = simpleFont.getEncoding ();
                        if (encoding != null)
                        {
                            try
                            {
                                System.out.println ("* encoding=" + encoding.getEncodingName ());
                            }
                            catch (Exception e)
                            {
                                e.printStackTrace ();
                            }
                        }
                    }
                }
                catch (IOException e)
                {
                    e.printStackTrace ();
                }
            }
        }        document.close ();

{code}
 

  was:
Wenn iterating all fonts in a PDF a call to PDSimpleFont.getEncoding().getEncodingName () throws a NullPointerException inside the library code for the attached sample-PDF.

The situation can be reproduced, when running the following loop for the attached PDF (test.pdf):

{{{{}}}}document = PDDocument.load (file);

int pages = document.getNumberOfPages ();

for (int i = 0; i < pages; ++i)
 {
 PDPage page = document.getPage (i);

PDResources resources = page.getResources ();
 if (resources == null)
 continue;

Iterator<COSName> iter = resources.getFontNames ().iterator ();

while (iter.hasNext ())
 {
 COSName cos = iter.next ();
 try
 {
 PDFont font = resources.getFont (cos);
 if (font instanceof PDSimpleFont)
 {
 PDSimpleFont simpleFont = (PDSimpleFont) font;
 Encoding encoding = simpleFont.getEncoding ();
 if (encoding != null)
 {
 try
 {
 System.out.println ("* encoding=" + encoding.getEncodingName ());
}
 catch (Exception e)
 {
 e.printStackTrace ();
}
 }

}
 }
 catch (IOException e)
 {
 e.printStackTrace ();
 }
 }
 }

document.close ();

{{}}{{ }}


> Call to Dictionary.encoding throws NullPointerException for some PDF's
> ----------------------------------------------------------------------
>
>                 Key: PDFBOX-4878
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4878
>             Project: PDFBox
>          Issue Type: Bug
>          Components: PDModel
>    Affects Versions: 2.0.20
>            Reporter: Thomas Gawehn
>            Priority: Critical
>         Attachments: test.pdf
>
>
> Wenn iterating all fonts in a PDF a call to PDSimpleFont.getEncoding().getEncodingName () throws a NullPointerException inside the library code for the attached sample-PDF.
> The situation can be reproduced, when running the following loop for the attached PDF (test.pdf):
>  
>  
> {code:java}
>         document = PDDocument.load (file);        int pages = document.getNumberOfPages ();        for (int i = 0; i < pages; ++i)
>         {
>             PDPage page = document.getPage (i);            PDResources resources = page.getResources ();
>             if (resources == null)
>                 continue;            Iterator<COSName> iter = resources.getFontNames ().iterator ();            while (iter.hasNext ())
>             {
>                 COSName cos = iter.next ();
>                 try
>                 {
>                     PDFont font = resources.getFont (cos);
>                     System.out.println (font.getName ());                    if (font instanceof PDSimpleFont)
>                     {
>                         PDSimpleFont simpleFont = (PDSimpleFont) font;
>                         Encoding encoding = simpleFont.getEncoding ();
>                         if (encoding != null)
>                         {
>                             try
>                             {
>                                 System.out.println ("* encoding=" + encoding.getEncodingName ());
>                             }
>                             catch (Exception e)
>                             {
>                                 e.printStackTrace ();
>                             }
>                         }
>                     }
>                 }
>                 catch (IOException e)
>                 {
>                     e.printStackTrace ();
>                 }
>             }
>         }        document.close ();
> {code}
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org