You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2020/08/12 10:42:00 UTC

[jira] [Comment Edited] (TIKA-3159) Macros not extracted from OpenDocument format Office files (flatXML format)

    [ https://issues.apache.org/jira/browse/TIKA-3159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17176245#comment-17176245 ] 

Tim Allison edited comment on TIKA-3159 at 8/12/20, 10:41 AM:
--------------------------------------------------------------

[~nick], any recs for the mime for the flat files?

{{application/vnd.oasis.opendocument.flat-text}}

ref: https://en.wikipedia.org/wiki/OpenDocument_technical_specification

The file self-identifies mime type in the root element:
{noformat}
office:mimetype="application/vnd.oasis.opendocument.text"
{noformat}

But, I think we'll want to distinguish between compressed/traditional and flat?


was (Author: tallison@mitre.org):
[~nick], any recs for the mime for the flat files?

{{application/vnd.oasis.opendocument.flat-text}}

ref: https://en.wikipedia.org/wiki/OpenDocument_technical_specification

> Macros not extracted from OpenDocument format Office files (flatXML format)
> ---------------------------------------------------------------------------
>
>                 Key: TIKA-3159
>                 URL: https://issues.apache.org/jira/browse/TIKA-3159
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.24.1
>            Reporter: Robert Kaulbach
>            Assignee: Tim Allison
>            Priority: Minor
>         Attachments: libre-calc-macro.fods, libre-launch-calc.fodp, libre-launch-calc.fodt
>
>
> Tika is not extracting the VB macros from each of the attached OpenDocument files. These files were created in LibreOffice, I added a shape to each document and then set a simple macro to run on click (which tries to launch calc.exe).
> I have enabled options in the OOXMLParser and OfficeConfig to extract macros, but it has not made a difference.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)