You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Robert Kaulbach (Jira)" <ji...@apache.org> on 2020/08/14 21:45:00 UTC
[jira] [Created] (TIKA-3167) Link content not extracted from
textbox in .xlsx file
Robert Kaulbach created TIKA-3167:
-------------------------------------
Summary: Link content not extracted from textbox in .xlsx file
Key: TIKA-3167
URL: https://issues.apache.org/jira/browse/TIKA-3167
Project: Tika
Issue Type: Bug
Affects Versions: 1.24.1
Reporter: Robert Kaulbach
Attachments: linked-textbox.xlsx
Attached .xlsx file was created in LibreOffice, I inserted a textbox and then put a hyperlink inside. After parsing with Tika using LinkContentHandler, the link is not in the output. I would expect to see "http://example.test" parsed out of the file.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)