You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@pdfbox.apache.org by "Shannon, Kevin" <Sh...@principal.com> on 2013/08/01 14:46:56 UTC

RE: pdf to image conversion -- font problem?

Andreas,

Thanks for the note and enjoy your vacation!

I saw those commits in the trunk, so I'll give them a try this week.

Thanks again for the quick note.  Sorry to interrupt the fun!

Kevin

_____________________________________________
From: Andreas Lehmkühler
Sent: Tuesday, July 31, 2013 5:43 AM
To: users@pdfbox.apache.org
Subject: Re: pdf to image conversion -- font problem?


Hi,

There are some well known font issues. All released versions rely on the jdk for font rendering.
All jdk versions are known to habe problems with subsets of fonts. Newer jdks link yours are
showing some new issues. We are working on our own rendering routines so that pdfbox won't
rely on the jdk anymore. This will be part of the next major release and some oft the improvements
are already part oft the current trunk.
I didn't have a look at your pdf as I am on a vacation.

BR
Andreas Lehmkühler



"Shannon, Kevin" <Sh...@principal.com> schrieb:
Hello all,

Thanks for building the PDFBox library.  I really appreciate the hard work and complex algorithms you've been writing over the past few years.

I have been spending the better part of last week and this week researching an issue I am seeing with the latest version of Java and PDFBox.  I am unable to see some text within my PDF documents.  The text displayed in an older version of java.  When we made Java more secure in version 7u21, the text disappeared.  I am also able to replicate the issue with other PDFs from other companies, which is why I am looking for help from this list.  Unfortunately, I don't know enough about the parsers or font & file system interactions to provide a solution yet.  I am using PDFBox 1.7.1, but see the same results in newer versions of PDFBox.

I have looked into the difference between Windows 7 & Windows XP, CMYK v. RGB, FontMapping.properties, DCTDecode, and believe none of those are causing my issue.


My PDF renders correctly in PDFBox 1.7.1 running on java 1.7.09.  http://www.flickr.com/photos/8135824@N02/9394176499/
When I switch to java 1.7.21, the text disappears from the pdf.  Again, I am using PDFBox 1.7.1.  http://www.flickr.com/photos/8135824@N02/9396944366/in/photostream/
The PDF can be downloaded from Dropbox: https://www.dropbox.com/s/q3ky5mdqotb28ru/JJ1777-0.original.pdf


This pdf renders correctly in Adobe Reader 10.  http://www.flickr.com/photos/8135824@N02/9396837216/in/set-72157634846008111
The pdf renders mostly correct in PDFBox 1.7.1 running on java 1.7.09.  I think the colors might be slightly off.  http://www.flickr.com/photos/8135824@N02/9396837274/in/set-72157634846008111/
The pdf renders without text in the main area in PDFBox 1.7.1 running on java 1.7.21.  http://www.flickr.com/photos/8135824@N02/9396837232/in/set-72157634846008111/
PDF can be downloaded from the Adobe Acrobat Engineering test site: http://acroeng.adobe.com/Test_Files/pdf_standards/standards_panel//FedExForm-AUDIT-FAILURE.pdf


Any help would be appreciated.  I can work with my PDF artists to re-do the pdf, but I would need to know what settings to tell them to use.  I also have some capacity (time allocation) to make bug fixes in the code.  If you can tell me where to start, I'll do my best to provide a solution.

Thanks again,
Kevin


Command line output below>>>>
C:\Users\Kevin Shannon\Downloads\Adobe Viewer Tests>"C:\Program Files (x86)\Java\jre1.7.0_09\bin\java.exe" -jar C:\sandbox\DIIS\cleanCode13\DIISDoc\PDFBoxDoc\pdfbox-app-1.7.1.jar PDFReader
Jul 29, 2013 3:38:53 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: BDC
Jul 29, 2013 3:38:54 PM org.apache.pdfbox.pdmodel.font.PDSimpleFont drawString
WARNING: Changing font on <«> from <XSLXGO+Arial> to the default font
Jul 29, 2013 3:38:54 PM org.apache.pdfbox.pdmodel.font.PDSimpleFont drawString
WARNING: Changing font on <1> from <SQWOOY+Arial,Bold> to the default font
Jul 29, 2013 3:38:54 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: EMC
Jul 29, 2013 3:38:54 PM org.apache.pdfbox.pdmodel.font.PDSimpleFont drawString
WARNING: Changing font on <«> from <XSLXGO+Arial> to the default font
Jul 29, 2013 3:38:54 PM org.apache.pdfbox.pdmodel.font.PDSimpleFont drawString
WARNING: Changing font on <1> from <SQWOOY+Arial,Bold> to the default font
Jul 29, 2013 3:38:58 PM org.apache.pdfbox.pdmodel.font.PDSimpleFont drawString
WARNING: Changing font on <«> from <XSLXGO+Arial> to the default font
Jul 29, 2013 3:38:58 PM org.apache.pdfbox.pdmodel.font.PDSimpleFont drawString
WARNING: Changing font on <1> from <SQWOOY+Arial,Bold> to the default font

C:\Users\Kevin Shannon\Downloads\Adobe Viewer Tests>java -jar C:\sandbox\DIIS\cleanCode13\DIISDoc\PDFBoxDoc\pdfbox-app-1.7.1.jar PDFReader
Jul 29, 2013 3:48:18 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: BDC
Jul 29, 2013 3:48:19 PM org.apache.pdfbox.pdmodel.font.PDSimpleFont drawString
WARNING: Changing font on <«> from <XSLXGO+Arial> to the default font
Jul 29, 2013 3:48:19 PM org.apache.pdfbox.pdmodel.font.PDSimpleFont drawString
WARNING: Changing font on <1> from <SQWOOY+Arial,Bold> to the default font
Jul 29, 2013 3:48:19 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: EMC
Jul 29, 2013 3:48:20 PM org.apache.pdfbox.pdmodel.font.PDSimpleFont drawString
WARNING: Changing font on <«> from <XSLXGO+Arial> to the default font
Jul 29, 2013 3:48:20 PM org.apache.pdfbox.pdmodel.font.PDSimpleFont drawString
WARNING: Changing font on <1> from <SQWOOY+Arial,Bold> to the default font


Kevin Shannon | * Principal Financial Group | IT Application Analyst | USIS IT-IDI Design Center | +1 515 235-1801 | shannon.kevin@principal.com<ma...@principal.com> | My Profile<http://mysites.principal.com/Person.aspx?preferredname=shannon,%20kevin>
The Principal has joined the Council for Disability Awareness (CDA) in the fight to help you understand the impact of a disability - and how you can defend your income<http://go.ke-v.in/QiPKB6>.



-----Message Disclaimer-----

This e-mail message is intended only for the use of the individual or
entity to which it is addressed, and may contain information that is
privileged, confidential and exempt from disclosure under applicable law.
If you are not the intended recipient, any dissemination, distribution or
copying of this communication is strictly prohibited. If you have
received this communication in error, please notify us immediately by
reply email to Connect@principal.com and delete or destroy all copies of
the original message and attachments thereto. Email sent to or from the
Principal Financial Group or any of its member companies may be retained
as required by law or regulation.

Nothing in this message is intended to constitute an Electronic signature
for purposes of the Uniform Electronic Transactions Act (UETA) or the
Electronic Signatures in Global and National Commerce Act ("E-Sign")
unless a specific statement to the contrary is included in this message.

While this communication may be used to promote or market a transaction
or an idea that is discussed in the publication, it is intended to provide
general information about the subject matter covered and is provided with
the understanding that The Principal is not rendering legal, accounting,
or tax advice. It is not a marketed opinion and may not be used to avoid
penalties under the Internal Revenue Code. You should consult with
appropriate counsel or other advisors on all matters pertaining to legal,
tax, or accounting obligations and requirements