You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@pdfbox.apache.org by Tim Allison <ta...@apache.org> on 2021/04/29 13:24:14 UTC

Detecting black (or any?) highlighting?

All,

  Tilman recently tweeted about a failed redaction with black
highlighter [0].  I realize there are probably 18 different ways of
failing with redaction, but is there a fairly straightforward way to
look for black highlighting with PDFBox?

  In the particular file referenced from Tilman's tweet, if I
understand correctly, I see a bunch of 'm' (start a new subpath) and l
(lowercase L) (append a straight line) in the particular file that the
article focused on and then 'f' (fill the path).

         Cheers,

              Tim

[0] https://twitter.com/Passwort12345/status/1387066421130244098

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: users-help@pdfbox.apache.org


Re: Detecting black (or any?) highlighting?

Posted by Tilman Hausherr <TH...@t-online.de>.
The first highlight is this

   q
     541.73 639.07 m
     70.765 639.07 l
     68.163 641.672 68.163 647.506 70.765 650.108 c
     70.765 639.12 l
     99.952 639.12 l
     99.952 639.11 l
     541.586 639.11 l
     541.586 639.12 l
     541.779 639.12 l
     541.763 639.102 541.746 639.085 541.73 639.07 c
     541.955 639.31 m
     541.955 649.869 l
     544.257 647.245 544.257 641.932 541.955 639.31 c
     W
     n
     /Cs6 cs
     0 0 0 scn
     /GS1 gs
     68.163 639.07 476.094 11.039 re
     f
   Q

The part before W n sets a clipping, the "0 0 0 scn" is black, and then 
it fills a rectangle. The properties of /GS1 decide whether it's 
transparent or not.

Tilman


Am 29.04.2021 um 15:24 schrieb Tim Allison:
> All,
>
>    Tilman recently tweeted about a failed redaction with black
> highlighter [0].  I realize there are probably 18 different ways of
> failing with redaction, but is there a fairly straightforward way to
> look for black highlighting with PDFBox?
>
>    In the particular file referenced from Tilman's tweet, if I
> understand correctly, I see a bunch of 'm' (start a new subpath) and l
> (lowercase L) (append a straight line) in the particular file that the
> article focused on and then 'f' (fill the path).
>
>           Cheers,
>
>                Tim
>
> [0] https://twitter.com/Passwort12345/status/1387066421130244098
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
> For additional commands, e-mail: users-help@pdfbox.apache.org
>


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: users-help@pdfbox.apache.org