You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Etienne Chauchot (JIRA)" <ji...@apache.org> on 2017/04/24 14:16:04 UTC

[jira] [Updated] (BEAM-2029) NullPointerException while evaluating GroupByKey in Spark Runner in streaming mode

     [ https://issues.apache.org/jira/browse/BEAM-2029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Etienne Chauchot updated BEAM-2029:
-----------------------------------
    Description: 
Behavior:
{{context.borrowDataset(transform)}} returns null.

stackTrace
{code}
17/04/20 15:00:58 INFO org.apache.beam.runners.spark.SparkRunner$Evaluator: Evaluating GroupByKey
Exception in thread "main" java.lang.NullPointerException
	at org.apache.beam.runners.spark.translation.streaming.StreamingTransformTranslator$6.evaluate(StreamingTransformTranslator.java:272)
	at org.apache.beam.runners.spark.translation.streaming.StreamingTransformTranslator$6.evaluate(StreamingTransformTranslator.java:267)
	at org.apache.beam.runners.spark.SparkRunner$Evaluator.doVisitTransform(SparkRunner.java:409)
	at org.apache.beam.runners.spark.SparkRunner$Evaluator.visitPrimitiveTransform(SparkRunner.java:395)
	at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:488)
	at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:483)
	at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:483)
	at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:483)
	at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:483)
	at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:483)
	at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:483)
	at org.apache.beam.sdk.runners.TransformHierarchy$Node.access$400(TransformHierarchy.java:232)
	at org.apache.beam.sdk.runners.TransformHierarchy.visit(TransformHierarchy.java:207)
	at org.apache.beam.sdk.Pipeline.traverseTopologically(Pipeline.java:384)
	at org.apache.beam.runners.spark.translation.streaming.SparkRunnerStreamingContextFactory.call(SparkRunnerStreamingContextFactory.java:88)
	at org.apache.beam.runners.spark.translation.streaming.SparkRunnerStreamingContextFactory.call(SparkRunnerStreamingContextFactory.java:47)
	at org.apache.spark.streaming.api.java.JavaStreamingContext$$anonfun$10.apply(JavaStreamingContext.scala:776)
	at org.apache.spark.streaming.api.java.JavaStreamingContext$$anonfun$10.apply(JavaStreamingContext.scala:775)
	at scala.Option.getOrElse(Option.scala:120)
	at org.apache.spark.streaming.StreamingContext$.getOrCreate(StreamingContext.scala:864)
	at org.apache.spark.streaming.api.java.JavaStreamingContext$.getOrCreate(JavaStreamingContext.scala:775)
	at org.apache.spark.streaming.api.java.JavaStreamingContext.getOrCreate(JavaStreamingContext.scala)
	at org.apache.beam.runners.spark.SparkRunner.run(SparkRunner.java:155)
	at org.apache.beam.runners.spark.SparkRunner.run(SparkRunner.java:85)
	at org.apache.beam.sdk.Pipeline.run(Pipeline.java:276)
	at org.apache.beam.integration.nexmark.NexmarkRunner.run(NexmarkRunner.java:1232)
	at org.apache.beam.integration.nexmark.NexmarkDriver.runAll(NexmarkDriver.java:69)
	at org.apache.beam.integration.nexmark.drivers.NexmarkSparkDriver.main(NexmarkSparkDriver.java:46)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
	at java.lang.reflect.Method.invoke(Method.java:498)
	at com.intellij.rt.execution.application.AppMain.main(AppMain.java:147)
{code}

  was:
To reproduce the bug, run Nexmark query7 (https://github.com/iemejia/beam/tree/BEAM-160-nexmark) 
Run main in {{org.apache.beam.integration.nexmark.drivers.NexmarkSparkDriver}}
with VMOptions: {code} -Dspark.ui.enabled=false -DSPARK_LOCAL_IP=localhost -Dsun.io.serialization.extendedDebugInfo=true {code}
with Program arguments:
{code}
--query=7 --streamTimeout=1200 --streaming=true --numEventGenerators=4 --manageResources=false --monitorJobs=true --enforceEncodability=false --enforceImmutability=false
{code}
Behavior:
{{context.borrowDataset(transform)}} returns null.

stackTrace
{code}
17/04/20 15:00:58 INFO org.apache.beam.runners.spark.SparkRunner$Evaluator: Evaluating GroupByKey
Exception in thread "main" java.lang.NullPointerException
	at org.apache.beam.runners.spark.translation.streaming.StreamingTransformTranslator$6.evaluate(StreamingTransformTranslator.java:272)
	at org.apache.beam.runners.spark.translation.streaming.StreamingTransformTranslator$6.evaluate(StreamingTransformTranslator.java:267)
	at org.apache.beam.runners.spark.SparkRunner$Evaluator.doVisitTransform(SparkRunner.java:409)
	at org.apache.beam.runners.spark.SparkRunner$Evaluator.visitPrimitiveTransform(SparkRunner.java:395)
	at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:488)
	at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:483)
	at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:483)
	at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:483)
	at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:483)
	at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:483)
	at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:483)
	at org.apache.beam.sdk.runners.TransformHierarchy$Node.access$400(TransformHierarchy.java:232)
	at org.apache.beam.sdk.runners.TransformHierarchy.visit(TransformHierarchy.java:207)
	at org.apache.beam.sdk.Pipeline.traverseTopologically(Pipeline.java:384)
	at org.apache.beam.runners.spark.translation.streaming.SparkRunnerStreamingContextFactory.call(SparkRunnerStreamingContextFactory.java:88)
	at org.apache.beam.runners.spark.translation.streaming.SparkRunnerStreamingContextFactory.call(SparkRunnerStreamingContextFactory.java:47)
	at org.apache.spark.streaming.api.java.JavaStreamingContext$$anonfun$10.apply(JavaStreamingContext.scala:776)
	at org.apache.spark.streaming.api.java.JavaStreamingContext$$anonfun$10.apply(JavaStreamingContext.scala:775)
	at scala.Option.getOrElse(Option.scala:120)
	at org.apache.spark.streaming.StreamingContext$.getOrCreate(StreamingContext.scala:864)
	at org.apache.spark.streaming.api.java.JavaStreamingContext$.getOrCreate(JavaStreamingContext.scala:775)
	at org.apache.spark.streaming.api.java.JavaStreamingContext.getOrCreate(JavaStreamingContext.scala)
	at org.apache.beam.runners.spark.SparkRunner.run(SparkRunner.java:155)
	at org.apache.beam.runners.spark.SparkRunner.run(SparkRunner.java:85)
	at org.apache.beam.sdk.Pipeline.run(Pipeline.java:276)
	at org.apache.beam.integration.nexmark.NexmarkRunner.run(NexmarkRunner.java:1232)
	at org.apache.beam.integration.nexmark.NexmarkDriver.runAll(NexmarkDriver.java:69)
	at org.apache.beam.integration.nexmark.drivers.NexmarkSparkDriver.main(NexmarkSparkDriver.java:46)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
	at java.lang.reflect.Method.invoke(Method.java:498)
	at com.intellij.rt.execution.application.AppMain.main(AppMain.java:147)
{code}


> NullPointerException while evaluating GroupByKey in Spark Runner in streaming mode
> ----------------------------------------------------------------------------------
>
>                 Key: BEAM-2029
>                 URL: https://issues.apache.org/jira/browse/BEAM-2029
>             Project: Beam
>          Issue Type: Bug
>          Components: runner-spark
>            Reporter: Etienne Chauchot
>            Assignee: Aviem Zur
>
> Behavior:
> {{context.borrowDataset(transform)}} returns null.
> stackTrace
> {code}
> 17/04/20 15:00:58 INFO org.apache.beam.runners.spark.SparkRunner$Evaluator: Evaluating GroupByKey
> Exception in thread "main" java.lang.NullPointerException
> 	at org.apache.beam.runners.spark.translation.streaming.StreamingTransformTranslator$6.evaluate(StreamingTransformTranslator.java:272)
> 	at org.apache.beam.runners.spark.translation.streaming.StreamingTransformTranslator$6.evaluate(StreamingTransformTranslator.java:267)
> 	at org.apache.beam.runners.spark.SparkRunner$Evaluator.doVisitTransform(SparkRunner.java:409)
> 	at org.apache.beam.runners.spark.SparkRunner$Evaluator.visitPrimitiveTransform(SparkRunner.java:395)
> 	at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:488)
> 	at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:483)
> 	at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:483)
> 	at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:483)
> 	at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:483)
> 	at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:483)
> 	at org.apache.beam.sdk.runners.TransformHierarchy$Node.visit(TransformHierarchy.java:483)
> 	at org.apache.beam.sdk.runners.TransformHierarchy$Node.access$400(TransformHierarchy.java:232)
> 	at org.apache.beam.sdk.runners.TransformHierarchy.visit(TransformHierarchy.java:207)
> 	at org.apache.beam.sdk.Pipeline.traverseTopologically(Pipeline.java:384)
> 	at org.apache.beam.runners.spark.translation.streaming.SparkRunnerStreamingContextFactory.call(SparkRunnerStreamingContextFactory.java:88)
> 	at org.apache.beam.runners.spark.translation.streaming.SparkRunnerStreamingContextFactory.call(SparkRunnerStreamingContextFactory.java:47)
> 	at org.apache.spark.streaming.api.java.JavaStreamingContext$$anonfun$10.apply(JavaStreamingContext.scala:776)
> 	at org.apache.spark.streaming.api.java.JavaStreamingContext$$anonfun$10.apply(JavaStreamingContext.scala:775)
> 	at scala.Option.getOrElse(Option.scala:120)
> 	at org.apache.spark.streaming.StreamingContext$.getOrCreate(StreamingContext.scala:864)
> 	at org.apache.spark.streaming.api.java.JavaStreamingContext$.getOrCreate(JavaStreamingContext.scala:775)
> 	at org.apache.spark.streaming.api.java.JavaStreamingContext.getOrCreate(JavaStreamingContext.scala)
> 	at org.apache.beam.runners.spark.SparkRunner.run(SparkRunner.java:155)
> 	at org.apache.beam.runners.spark.SparkRunner.run(SparkRunner.java:85)
> 	at org.apache.beam.sdk.Pipeline.run(Pipeline.java:276)
> 	at org.apache.beam.integration.nexmark.NexmarkRunner.run(NexmarkRunner.java:1232)
> 	at org.apache.beam.integration.nexmark.NexmarkDriver.runAll(NexmarkDriver.java:69)
> 	at org.apache.beam.integration.nexmark.drivers.NexmarkSparkDriver.main(NexmarkSparkDriver.java:46)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> 	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> 	at java.lang.reflect.Method.invoke(Method.java:498)
> 	at com.intellij.rt.execution.application.AppMain.main(AppMain.java:147)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)