You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Oliver Heger (JIRA)" <ji...@apache.org> on 2013/02/15 21:41:13 UTC

[jira] [Updated] (TIKA-991) Mp3Parser cannot extract the duration of an audio file

     [ https://issues.apache.org/jira/browse/TIKA-991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Oliver Heger updated TIKA-991:
------------------------------

    Attachment: tika-991_3.patch

As promised, this is an additional patch fixing a corner case in TailStream. It prevents that the MP3 parser can enter an endless loop when a skip operation is done at the end of the stream.
                
> Mp3Parser cannot extract the duration of an audio file
> ------------------------------------------------------
>
>                 Key: TIKA-991
>                 URL: https://issues.apache.org/jira/browse/TIKA-991
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.2
>            Reporter: Oliver Heger
>              Labels: patch
>         Attachments: tika-991_2.patch, tika-991_3.patch
>
>
> The duration of an MP3 file is an important information. Currently, {{Mp3Parser}} is not able to extract this data from a file.
> Unfortunately, it is not easy to implement this functionality with the current design of the {{Mp3Parser}} class. In order to obtain the duration, all audio frames have to be read, and their durations have to be summed up. Currently, only the beginning (the ID3v2 tags) and the end (lyrics and ID3v1) are evaluated. Processing the whole file would have additional benefits, e.g. MP3s with variable bit rates could be detected.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira