You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2020/01/21 16:24:00 UTC

[jira] [Commented] (PDFBOX-4747) ImageType.BINARY not extracting all charecters

    [ https://issues.apache.org/jira/browse/PDFBOX-4747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17020346#comment-17020346 ] 

Tilman Hausherr commented on PDFBOX-4747:
-----------------------------------------

I assume you mean the disclaimer. We have only black and white, so something below 0.5 will be black and something above will be white. The text is in light gray on lighter gray so you lose. This is a terrible PDF, architectural plans should have vector graphics, this PDF is a poor quality low resolution scan. It even looks bad in color at 300dpi.

What you could try is to render it in color or in grey and then use filters (e.g. from jhlabs) to convert to b/w. Another thing to try would be to pass RenderingHints to PDFRenderer but I'm pessimistic on that one.

> ImageType.BINARY not extracting all charecters
> ----------------------------------------------
>
>                 Key: PDFBOX-4747
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4747
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Rendering
>    Affects Versions: 2.0.18
>         Environment: java 1.8 update 201
>            Reporter: krishna prasad
>            Priority: Major
>         Attachments: sample.pdf, sample.png
>
>
> I am trying to convert pdfpage to image in binary mode. In the generated Png file one block of complete text not visible. Please find the attachments.
>  
> Code used as below
> PDFRenderer renderer = new PDFRenderer(document);
> BufferedImage image = renderer.renderImageWithDPI(0, 300, ImageType.BINARY);
> ImageIO.write(image, "PNG", new File("<filePath>/myimage.png"));



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org