You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Tilman Hausherr (JIRA)" <ji...@apache.org> on 2015/11/03 08:23:27 UTC

[jira] [Comment Edited] (PDFBOX-3083) PDF to Image Conversion - Some text layers are missing from conversion

    [ https://issues.apache.org/jira/browse/PDFBOX-3083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14986827#comment-14986827 ] 

Tilman Hausherr edited comment on PDFBOX-3083 at 11/3/15 7:22 AM:
------------------------------------------------------------------

For me, it works by changing one line in PDFStreamEngine.java:
{code}
Matrix aa = Matrix.concatenate(matrix, a);
{code}
to
{code}
Matrix aa = Matrix.concatenate(a, matrix);
{code}
This change has been sitting around locally for months, so long that I can't even remember why I had it in the first place. I need to test whether it produces any regressions.


was (Author: tilman):
For me, it works by changing one line in PageDrawer.java:
{code}
Matrix aa = Matrix.concatenate(matrix, a);
{code}
to
{code}
Matrix aa = Matrix.concatenate(a, matrix);
{code}
This change has been sitting around locally for months, so long that I can't even remember why I had it in the first place. I need to test whether it produces any regressions.

> PDF to Image Conversion - Some text layers are missing from conversion
> ----------------------------------------------------------------------
>
>                 Key: PDFBOX-3083
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-3083
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Rendering
>    Affects Versions: 1.6.0, 1.8.10, 2.0.0
>         Environment: JDK 1.7, Tomcat 7, Maven 3
> Eclipse IDE (Mars)
>            Reporter: Atthapon Junpun-eak
>            Priority: Critical
>         Attachments: myout_image_1.jpg, mypdf.pdf, out_bufferedimage_1.png
>
>
> Hi,
> I was trying to convert my pdf file to image (jpg or png) but only the grey text (see my attached pdf) were successfully converted to an image file but the blue text cannot.
> I've try to extract this pdf with several PdfBox versions : 2.0, 1.8.10, and 1.6 but it doesn't work. 
> However, with 1.6, the generated image contains both grey and blue text but in an incorrect layout/alignment.
> Can you please verify this issue?
> Please also let me know if you have any quick fix for this issue.
> -------------------------------
> Attached Sources : 
> - mypdf.pdf : original pdf to be converted
> - out_bufferedimage_1.png : converted with PdfBox 2.0 and 1.8.10
> - myout_image_1.jpg :  converted with PdfBox 2.0 and 1.6
> -------------------------------
> Thanks a lot.
> Best,



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org