You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tyler Palsulich (JIRA)" <ji...@apache.org> on 2014/12/04 02:59:13 UTC

[jira] [Commented] (TIKA-1218) Unable to parse a mp3 file on 1.5 getting a exception

    [ https://issues.apache.org/jira/browse/TIKA-1218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14233807#comment-14233807 ] 

Tyler Palsulich commented on TIKA-1218:
---------------------------------------

A simple fix is to not let the size of the block go below 0. After applying that change, most tags are still extracted and there are still some body elements:

{code}
java -jar tika-app.jar Save-the-World-Knife-Party-Remix.mp3
<?xml version="1.0" encoding="UTF-8"?><html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="xmpDM:releaseDate" content=""/>
<meta name="xmpDM:duration" content="316757.78125"/>
<meta name="Content-Length" content="8129449"/>
<meta name="xmpDM:album" content="nashvillenightsblog.com"/>
<meta name="dc:creator" content="Swedish House Mafia"/>
<meta name="Author" content="Swedish House Mafia"/>
<meta name="xmpDM:artist" content="Swedish House Mafia"/>
<meta name="xmpDM:logComment" content=""/>
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser"/>
<meta name="X-Parsed-By" content="org.apache.tika.parser.mp3.Mp3Parser"/>
<meta name="creator" content="Swedish House Mafia"/>
<meta name="xmpDM:audioCompressor" content="MP3"/>
<meta name="meta:author" content="Swedish House Mafia"/>
<meta name="xmpDM:genre" content=""/>
<meta name="Content-Type" content="audio/mpeg"/>
<meta name="resourceName" content="Save-the-World-Knife-Party-Remix.mp3"/>
<meta name="dc:title" content="Save the World (Knife Party Remix) "/>
<title>Save the World (Knife Party Remix) </title>
</head>
<body><h1>Save the World (Knife Party Remix) </h1>
<p>Swedish House Mafia</p>
<p>nashvillenightsblog.com</p>
<p>316757.78</p>
</body></html>
{code}

I'll apply this change once the SVN master is up and running.

> Unable to parse a mp3 file on 1.5 getting a exception
> -----------------------------------------------------
>
>                 Key: TIKA-1218
>                 URL: https://issues.apache.org/jira/browse/TIKA-1218
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.5
>         Environment: Win 7, Java 1.7
>            Reporter: Sumeet Gorab
>            Priority: Blocker
>         Attachments: Save-the-World-Knife-Party-Remix.mp3
>
>
> Unable to parse a mp3 file on 1.5 getting following exception:
> Exception in thread "main" java.lang.NegativeArraySizeException
> 	at org.apache.tika.parser.mp3.ID3v2Frame$RawTag.<init>(ID3v2Frame.java:417)
> 	at org.apache.tika.parser.mp3.ID3v2Frame$RawTag.<init>(ID3v2Frame.java:382)
> 	at org.apache.tika.parser.mp3.ID3v2Frame$RawTagIterator.next(ID3v2Frame.java:371)
> 	at org.apache.tika.parser.mp3.ID3v24Handler.<init>(ID3v24Handler.java:49)
> 	at org.apache.tika.parser.mp3.Mp3Parser.getAllTagHandlers(Mp3Parser.java:174)
> 	at org.apache.tika.parser.mp3.Mp3Parser.parse(Mp3Parser.java:71)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)