You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Boris Petrov (JIRA)" <ji...@apache.org> on 2018/09/19 10:13:00 UTC

[jira] [Created] (TIKA-2730) parseToString fails for a simple mp3

Boris Petrov created TIKA-2730:
----------------------------------

             Summary: parseToString fails for a simple mp3
                 Key: TIKA-2730
                 URL: https://issues.apache.org/jira/browse/TIKA-2730
             Project: Tika
          Issue Type: Bug
    Affects Versions: 1.19
            Reporter: Boris Petrov
         Attachments: demo.mp3

This is a regression from 1.18. I've attached the mp3 that fails. The exception I get is:
{noformat}
org.apache.tika.exception.TikaException: TIKA-198: Illegal IOException from org.apache.tika.parser.mp3.Mp3Parser@cefe6c6
    at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:286)
    at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
    at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143)
    at org.apache.tika.Tika.parseToString(Tika.java:527)
    at com.company.TextExtractor.getText(TextExtractor.java:39)

    Caused by:
    java.io.EOFException: EOF: tried to skip 361 but could only skip 247
        at org.apache.tika.parser.mp3.MpegStream.skipFrame(MpegStream.java:166)
        at org.apache.tika.parser.mp3.Mp3Parser.getAllTagHandlers(Mp3Parser.java:204)
        at org.apache.tika.parser.mp3.Mp3Parser.parse(Mp3Parser.java:71)
        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
        ... 5 more{noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Re: [jira] [Created] (TIKA-2730) parseToString fails for a simple mp3

Posted by Oleg Tikhonov <ol...@apache.org>.
Hi,
It would be great, if you could attach such a file. Or does it fails on any?


On Wed, Sep 19, 2018, 13:13 Boris Petrov (JIRA) <ji...@apache.org> wrote:

> Boris Petrov created TIKA-2730:
> ----------------------------------
>
>              Summary: parseToString fails for a simple mp3
>                  Key: TIKA-2730
>                  URL: https://issues.apache.org/jira/browse/TIKA-2730
>              Project: Tika
>           Issue Type: Bug
>     Affects Versions: 1.19
>             Reporter: Boris Petrov
>          Attachments: demo.mp3
>
> This is a regression from 1.18. I've attached the mp3 that fails. The
> exception I get is:
> {noformat}
> org.apache.tika.exception.TikaException: TIKA-198: Illegal IOException
> from org.apache.tika.parser.mp3.Mp3Parser@cefe6c6
>     at
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:286)
>     at
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
>     at
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143)
>     at org.apache.tika.Tika.parseToString(Tika.java:527)
>     at com.company.TextExtractor.getText(TextExtractor.java:39)
>
>     Caused by:
>     java.io.EOFException: EOF: tried to skip 361 but could only skip 247
>         at
> org.apache.tika.parser.mp3.MpegStream.skipFrame(MpegStream.java:166)
>         at
> org.apache.tika.parser.mp3.Mp3Parser.getAllTagHandlers(Mp3Parser.java:204)
>         at org.apache.tika.parser.mp3.Mp3Parser.parse(Mp3Parser.java:71)
>         at
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
>         ... 5 more{noformat}
>
>
>
> --
> This message was sent by Atlassian JIRA
> (v7.6.3#76005)
>