You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Gerard van der Hoorn (JIRA)" <ji...@apache.org> on 2016/03/14 08:56:33 UTC

[jira] [Created] (TIKA-1901) tika detect consumes stream when streams contains msoffice file

Gerard van der Hoorn created TIKA-1901:
------------------------------------------

             Summary: tika detect consumes stream when streams contains msoffice file
                 Key: TIKA-1901
                 URL: https://issues.apache.org/jira/browse/TIKA-1901
             Project: Tika
          Issue Type: Bug
          Components: detector
    Affects Versions: 1.12
            Reporter: Gerard van der Hoorn


When tika.detect is used to on ms-office file (word or excel 2003) the stream is consumed which is not as expected. According to the documentation when  the stream supports marking the position in the file will be returned to the original position.

Added is a testcase.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)