You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tyler Palsulich (JIRA)" <ji...@apache.org> on 2014/12/04 02:59:13 UTC
[jira] [Commented] (TIKA-1218) Unable to parse a mp3 file on 1.5
getting a exception
[ https://issues.apache.org/jira/browse/TIKA-1218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14233807#comment-14233807 ]
Tyler Palsulich commented on TIKA-1218:
---------------------------------------
A simple fix is to not let the size of the block go below 0. After applying that change, most tags are still extracted and there are still some body elements:
{code}
java -jar tika-app.jar Save-the-World-Knife-Party-Remix.mp3
<?xml version="1.0" encoding="UTF-8"?><html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="xmpDM:releaseDate" content=""/>
<meta name="xmpDM:duration" content="316757.78125"/>
<meta name="Content-Length" content="8129449"/>
<meta name="xmpDM:album" content="nashvillenightsblog.com"/>
<meta name="dc:creator" content="Swedish House Mafia"/>
<meta name="Author" content="Swedish House Mafia"/>
<meta name="xmpDM:artist" content="Swedish House Mafia"/>
<meta name="xmpDM:logComment" content=""/>
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser"/>
<meta name="X-Parsed-By" content="org.apache.tika.parser.mp3.Mp3Parser"/>
<meta name="creator" content="Swedish House Mafia"/>
<meta name="xmpDM:audioCompressor" content="MP3"/>
<meta name="meta:author" content="Swedish House Mafia"/>
<meta name="xmpDM:genre" content=""/>
<meta name="Content-Type" content="audio/mpeg"/>
<meta name="resourceName" content="Save-the-World-Knife-Party-Remix.mp3"/>
<meta name="dc:title" content="Save the World (Knife Party Remix) "/>
<title>Save the World (Knife Party Remix) </title>
</head>
<body><h1>Save the World (Knife Party Remix) </h1>
<p>Swedish House Mafia</p>
<p>nashvillenightsblog.com</p>
<p>316757.78</p>
</body></html>
{code}
I'll apply this change once the SVN master is up and running.
> Unable to parse a mp3 file on 1.5 getting a exception
> -----------------------------------------------------
>
> Key: TIKA-1218
> URL: https://issues.apache.org/jira/browse/TIKA-1218
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.5
> Environment: Win 7, Java 1.7
> Reporter: Sumeet Gorab
> Priority: Blocker
> Attachments: Save-the-World-Knife-Party-Remix.mp3
>
>
> Unable to parse a mp3 file on 1.5 getting following exception:
> Exception in thread "main" java.lang.NegativeArraySizeException
> at org.apache.tika.parser.mp3.ID3v2Frame$RawTag.<init>(ID3v2Frame.java:417)
> at org.apache.tika.parser.mp3.ID3v2Frame$RawTag.<init>(ID3v2Frame.java:382)
> at org.apache.tika.parser.mp3.ID3v2Frame$RawTagIterator.next(ID3v2Frame.java:371)
> at org.apache.tika.parser.mp3.ID3v24Handler.<init>(ID3v24Handler.java:49)
> at org.apache.tika.parser.mp3.Mp3Parser.getAllTagHandlers(Mp3Parser.java:174)
> at org.apache.tika.parser.mp3.Mp3Parser.parse(Mp3Parser.java:71)
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)