You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@tika.apache.org by "Tim Allison (JIRA)" <ji...@apache.org> on 2017/11/06 16:50:00 UTC

[jira] [Resolved] (TIKA-2493) Allow Extraction of Javascript from PDFs

     [ https://issues.apache.org/jira/browse/TIKA-2493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tim Allison resolved TIKA-2493.
-------------------------------
    Resolution: Not A Problem

> Allow Extraction of Javascript from PDFs
> ----------------------------------------
>
>                 Key: TIKA-2493
>                 URL: https://issues.apache.org/jira/browse/TIKA-2493
>             Project: Tika
>          Issue Type: Improvement
>            Reporter: Rahul Veeramalla
>            Priority: Blocker
>
> I have a use case wherein I need to upload PDFs as part of a File Upload feature that I am currently building for my application. Based on Security teams recommendation, I need to scan the PDFs for any embedded Javascript, attachments and links contained in them and block such PDFs.
> I was able to figure out the solution to extract hyperlinks and attachments from the PDF using TIKA.
> However, I am unable to find anything to extract javascript from PDFs.
> **I need help to figure out if a PDF contains Javascript elements/code or not.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)