You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Robert Kaulbach (Jira)" <ji...@apache.org> on 2020/08/14 01:01:09 UTC

[jira] [Created] (TIKA-3165) Missing link after parsing odp/otp file with hyperlinked shape

Robert Kaulbach created TIKA-3165:
-------------------------------------

             Summary: Missing link after parsing odp/otp file with hyperlinked shape
                 Key: TIKA-3165
                 URL: https://issues.apache.org/jira/browse/TIKA-3165
             Project: Tika
          Issue Type: Bug
          Components: parser
    Affects Versions: 1.24.1
            Reporter: Robert Kaulbach
         Attachments: linked-shape.odp, linked-shape.otp

I created this file in LibreOffice by drawing a shape and then adding a hyperlink to it. After parsing with Tika using LinkContentHandler, the link text isn't extracted. I would expect to see "http://example.test" in the links output.

It looks like the tag with the link inside gets dropped in ElementMappingContentHandler because there isn't a mapping for it in OpenDocumentContentParser.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)