You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@spamassassin.apache.org by bu...@spamassassin.apache.org on 2022/03/03 04:21:55 UTC

[Bug 7960] New: PDFInfo misses valid metadata

https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7960

            Bug ID: 7960
           Summary: PDFInfo misses valid metadata
           Product: Spamassassin
           Version: SVN Trunk (Latest Devel Version)
          Hardware: All
                OS: All
            Status: NEW
          Severity: normal
          Priority: P2
         Component: Plugins
          Assignee: dev@spamassassin.apache.org
          Reporter: billcole@apache.org
  Target Milestone: Undefined

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Bug 7960] PDFInfo misses valid metadata

Posted by bu...@spamassassin.apache.org.
https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7960

Bill Cole <bi...@apache.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
         Resolution|---                         |FIXED
             Status|NEW                         |RESOLVED
                 CC|                            |billcole@apache.org

--- Comment #1 from Bill Cole <bi...@apache.org> ---
As reported on the Users' mailing list by Ricky Boone on 2022-03-02, the
PDFInfo plugin fails to parse out much of the metadata from a sizable fraction
of today's PDFs. 

I've fixed this in r1898546 by removing the optimization (no longer valid) of
skipping lines in the PDF with high-bit-set characters.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Bug 7960] PDFInfo misses valid metadata

Posted by bu...@spamassassin.apache.org.
https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7960

Henrik Krohns <ap...@hege.li> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |apache@hege.li

--- Comment #2 from Henrik Krohns <ap...@hege.li> ---
Committed some more cleanups. The PDF "parsing" is unbelievably naive, so I did
the same for UTF-16 decoding.. I guess it does the job for now ¯\_(ツ)_/¯

Sending        trunk/lib/Mail/SpamAssassin/Plugin/PDFInfo.pm
Transmitting file data .done
Committing transaction...
Committed revision 1898557.

-- 
You are receiving this mail because:
You are the assignee for the bug.