You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Robert Kaulbach (Jira)" <ji...@apache.org> on 2020/08/14 01:01:09 UTC
[jira] [Created] (TIKA-3165) Missing link after parsing odp/otp
file with hyperlinked shape
Robert Kaulbach created TIKA-3165:
-------------------------------------
Summary: Missing link after parsing odp/otp file with hyperlinked shape
Key: TIKA-3165
URL: https://issues.apache.org/jira/browse/TIKA-3165
Project: Tika
Issue Type: Bug
Components: parser
Affects Versions: 1.24.1
Reporter: Robert Kaulbach
Attachments: linked-shape.odp, linked-shape.otp
I created this file in LibreOffice by drawing a shape and then adding a hyperlink to it. After parsing with Tika using LinkContentHandler, the link text isn't extracted. I would expect to see "http://example.test" in the links output.
It looks like the tag with the link inside gets dropped in ElementMappingContentHandler because there isn't a mapping for it in OpenDocumentContentParser.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)