You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Robert Kaulbach (Jira)" <ji...@apache.org> on 2020/08/14 21:45:00 UTC

[jira] [Created] (TIKA-3167) Link content not extracted from textbox in .xlsx file

Robert Kaulbach created TIKA-3167:
-------------------------------------

             Summary: Link content not extracted from textbox in .xlsx file
                 Key: TIKA-3167
                 URL: https://issues.apache.org/jira/browse/TIKA-3167
             Project: Tika
          Issue Type: Bug
    Affects Versions: 1.24.1
            Reporter: Robert Kaulbach
         Attachments: linked-textbox.xlsx

Attached .xlsx file was created in LibreOffice, I inserted a textbox and then put a hyperlink inside. After parsing with Tika using LinkContentHandler, the link is not in the output. I would expect to see "http://example.test" parsed out of the file.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)