You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Johndee Burks (JIRA)" <ji...@apache.org> on 2013/03/12 16:25:14 UTC
[jira] [Commented] (HIVE-3387) meta data file size exceeds limit
[ https://issues.apache.org/jira/browse/HIVE-3387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13600092#comment-13600092 ]
Johndee Burks commented on HIVE-3387:
-------------------------------------
Adding this comment to make this easier to find, the error message is below.
java.io.IOException: Split metadata size exceeded 10000000.
> meta data file size exceeds limit
> ---------------------------------
>
> Key: HIVE-3387
> URL: https://issues.apache.org/jira/browse/HIVE-3387
> Project: Hive
> Issue Type: Bug
> Affects Versions: 0.7.1
> Reporter: Alexander Alten-Lorenz
> Assignee: Navis
> Fix For: 0.10.0
>
> Attachments: HIVE-3387.1.patch.txt
>
>
> The cause is certainly that we use an array list instead of a set structure in the split locations API. Looks like a bug in Hive's CombineFileInputFormat.
> Reproduce:
> Set mapreduce.jobtracker.split.metainfo.maxsize=100000000 when submitting the Hive query. Run a big hive query that write data into a partitioned table. Due to the large number of splits, you encounter an exception on the job submitted to Hadoop and the exception said:
> meta data size exceeds 100000000.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira