You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@atlas.apache.org by "Ayub Khan (JIRA)" <ji...@apache.org> on 2016/08/25 03:53:20 UTC

[jira] [Resolved] (ATLAS-1135) %spark.r interpreter is not working in Zeppelin

     [ https://issues.apache.org/jira/browse/ATLAS-1135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ayub Khan resolved ATLAS-1135.
------------------------------
    Resolution: Invalid

> %spark.r interpreter is not working in Zeppelin
> -----------------------------------------------
>
>                 Key: ATLAS-1135
>                 URL: https://issues.apache.org/jira/browse/ATLAS-1135
>             Project: Atlas
>          Issue Type: Bug
>         Environment: Spark 1.6.2, Zeppelin 0.6.1
>            Reporter: Pari Margu
>
> I am having Spark 1.6.2 cluster with Hadoop YARN, Oozie. I have installed Zeppelin 0.6.1(Binary package with all interpreters: zeppelin-0.6.1-bin-all.tgz). When I am trying to use SparkR script with %spark.r interpreter,
> %spark.r
> # Creating SparkConext and connecting to Cloudant DB
> sc1 <- sparkR.init(sparkEnv = list("cloudant.host"="host_name","cloudant.username"="user_name","cloudant.password"="password", "jsonstore.rdd.schemaSampleSize"="-1"))
> # Database to be connected to extract the data
> database <- "sensordata"
> # Creating Spark SQL Context
> sqlContext <- sparkRSQL.init(sc)
> # Creating DataFrame for the "sensordata" Cloudant DB
> sensorDataDF <- read.df(sqlContext, database, header='true', source = "com.cloudant.spark",inferSchema='true')
> # Get basic information about the DataFrame(sensorDataDF)
> printSchema(sensorDataDF)
> I am getting the following error(log):
> ERROR [2016-08-25 03:28:37,336] ({Thread-77} JobProgressPoller.java[run]:54) - Can not get or update progress
> org.apache.zeppelin.interpreter.InterpreterException: org.apache.thrift.transport.TTransportException
>         at org.apache.zeppelin.interpreter.remote.RemoteInterpreter.getProgress(RemoteInterpreter.java:373)
>         at org.apache.zeppelin.interpreter.LazyOpenInterpreter.getProgress(LazyOpenInterpreter.java:111)
>         at org.apache.zeppelin.notebook.Paragraph.progress(Paragraph.java:237)
>         at org.apache.zeppelin.scheduler.JobProgressPoller.run(JobProgressPoller.java:51)
> Caused by: org.apache.thrift.transport.TTransportException
>         at org.apache.thrift.transport.TIOStreamTransport.read(TIOStreamTransport.java:132)
>         at org.apache.thrift.transport.TTransport.readAll(TTransport.java:86)
>         at org.apache.thrift.protocol.TBinaryProtocol.readAll(TBinaryProtocol.java:429)
>         at org.apache.thrift.protocol.TBinaryProtocol.readI32(TBinaryProtocol.java:318)
>         at org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol.java:219)
>         at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)
>         at org.apache.zeppelin.interpreter.thrift.RemoteInterpreterService$Client.recv_getProgress(RemoteInterpreterService.java:296)
>         at org.apache.zeppelin.interpreter.thrift.RemoteInterpreterService$Client.getProgress(RemoteInterpreterService.java:281)
>         at org.apache.zeppelin.interpreter.remote.RemoteInterpreter.getProgress(RemoteInterpreter.java:370)
>         ... 3 more
> Help would be much appreciated.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)