You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Hudson (JIRA)" <ji...@apache.org> on 2017/02/02 20:53:51 UTC

[jira] [Commented] (TIKA-2259) Include hyperlinks from widget annotations

    [ https://issues.apache.org/jira/browse/TIKA-2259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15850470#comment-15850470 ] 

Hudson commented on TIKA-2259:
------------------------------

SUCCESS: Integrated in Jenkins build Tika-trunk #1193 (See [https://builds.apache.org/job/Tika-trunk/1193/])
TIKA-2259 -- improve url extraction from PDFs = copy Tilman Hausherr's (tallison: rev 7555b136d9ba046e2007d1f305f707948fcbcbc3)
* (edit) tika-parsers/src/main/java/org/apache/tika/parser/pdf/AbstractPDF2XHTML.java


> Include hyperlinks from widget annotations
> ------------------------------------------
>
>                 Key: TIKA-2259
>                 URL: https://issues.apache.org/jira/browse/TIKA-2259
>             Project: Tika
>          Issue Type: Improvement
>            Reporter: Tim Allison
>            Assignee: Tim Allison
>            Priority: Minor
>
> On PDFBOX-3644, [~tilman] recently improved PDFBox's {{printURLs}} to extract urls embedded in widget annotations.  We should follow his lead.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)