You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Hudson (JIRA)" <ji...@apache.org> on 2017/11/27 18:57:04 UTC

[jira] [Commented] (TIKA-2510) Embedded MP3 file in PPTX document no longer identified

    [ https://issues.apache.org/jira/browse/TIKA-2510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16267255#comment-16267255 ] 

Hudson commented on TIKA-2510:
------------------------------

SUCCESS: Integrated in Jenkins build Tika-trunk #1397 (See [https://builds.apache.org/job/Tika-trunk/1397/])
TIKA-2510 -- Extract media files from ooxml (tallison: [https://github.com/apache/tika/commit/d4fd659ac5c3070104a85df4a535afe570b08a0e])
* (edit) tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ooxml/AbstractOOXMLExtractor.java
* (edit) tika-parsers/src/test/java/org/apache/tika/parser/microsoft/ooxml/SXSLFExtractorTest.java
* (edit) CHANGES.txt
* (add) tika-parsers/src/test/resources/test-documents/testPPT_embeddedMP3.pptx
* (edit) tika-parsers/src/test/java/org/apache/tika/parser/microsoft/ooxml/OOXMLParserTest.java


> Embedded MP3 file in PPTX document no longer identified
> -------------------------------------------------------
>
>                 Key: TIKA-2510
>                 URL: https://issues.apache.org/jira/browse/TIKA-2510
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.15
>            Reporter: Eamonn Saunders
>            Assignee: Tim Allison
>            Priority: Minor
>             Fix For: 1.17
>
>         Attachments: Windows Audio File.pptx, tika-1.14-output.json, tika-1.15-output.json
>
>
> I'm attaching a sample PPTX file with an embedded MP3 file along with JSON files produced by Tika App (versions 1.14 and 1.15).
> Notice that the 1.14 output identifies the embedded MP3 file while the 1.15 version does not.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)