You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Brock Noland (JIRA)" <ji...@apache.org> on 2014/08/25 22:52:59 UTC

[jira] [Commented] (HIVE-7842) load_dyn_part1.q fails with an assertion [Spark Branch]

    [ https://issues.apache.org/jira/browse/HIVE-7842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14109705#comment-14109705 ] 

Brock Noland commented on HIVE-7842:
------------------------------------

IIRC assertions are not enabled for MR when tasks are run so this might fail with MR as well.

> load_dyn_part1.q fails with an assertion [Spark Branch]
> -------------------------------------------------------
>
>                 Key: HIVE-7842
>                 URL: https://issues.apache.org/jira/browse/HIVE-7842
>             Project: Hive
>          Issue Type: Bug
>          Components: Spark
>    Affects Versions: spark-branch
>            Reporter: Venki Korukanti
>            Assignee: Venki Korukanti
>              Labels: Spark-M1
>             Fix For: spark-branch
>
>
> On spark branch, load_dyn_part1.q fails with following assertion. Looks like SerDe is receiving invalid ByteWritable buffer.
> {code}
> java.lang.AssertionError
> "org.apache.hadoop.hive.serde2.binarysortable.BinarySortableSerDe.deserialize(BinarySortableSerDe.java:205)"
> "org.apache.hadoop.hive.serde2.binarysortable.BinarySortableSerDe.deserialize(BinarySortableSerDe.java:187)"
> "org.apache.hadoop.hive.ql.exec.spark.SparkReduceRecordHandler.processRow(SparkReduceRecordHandler.java:186)"
> "org.apache.hadoop.hive.ql.exec.spark.HiveReduceFunctionResultList.processNextRecord(HiveReduceFunctionResultList.java:47)"
> "org.apache.hadoop.hive.ql.exec.spark.HiveReduceFunctionResultList.processNextRecord(HiveReduceFunctionResultList.java:27)"
> "org.apache.hadoop.hive.ql.exec.spark.HiveBaseFunctionResultList$ResultIterator.hasNext(HiveBaseFunctionResultList.java:98)"
> "scala.collection.convert.Wrappers$JIteratorWrapper.hasNext(Wrappers.scala:41)"
> "scala.collection.Iterator$class.foreach(Iterator.scala:727)"
> "scala.collection.AbstractIterator.foreach(Iterator.scala:1157)"
> "org.apache.spark.rdd.RDD$$anonfun$foreach$1.apply(RDD.scala:759)"
> "org.apache.spark.rdd.RDD$$anonfun$foreach$1.apply(RDD.scala:759)"
> "org.apache.spark.SparkContext$$anonfun$runJob$4.apply(SparkContext.scala:1121)"
> "org.apache.spark.SparkContext$$anonfun$runJob$4.apply(SparkContext.scala:1121)"
> "org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:62)"
> "org.apache.spark.scheduler.Task.run(Task.scala:54)"
> "org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:199)"
> "java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)"
> "java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)"
> "java.lang.Thread.run(Thread.java:744)"
> {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)