You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kudu.apache.org by "Chris George (Code Review)" <ge...@cloudera.org> on 2016/04/22 20:11:56 UTC

[kudu-CR] Adding kudu datasource to spark with predicate pushdowns and column projections. Uses scan tokens to setup partition. Example ussage: add kudu-spark-0.9.0-SNAPSHOT-withdeps.jar to your spark-shell import org.kududb.spark.kudu._ val temp = sqlContext.re

Chris George has uploaded a new change for review.

  http://gerrit.cloudera.org:8080/2847

Change subject: Adding kudu datasource to spark with predicate pushdowns and column projections. Uses scan tokens to setup partition. Example ussage: add kudu-spark-0.9.0-SNAPSHOT-withdeps.jar to your spark-shell  import org.kududb.spark.kudu._  val temp =  sqlContext.re
......................................................................

Adding kudu datasource to spark with predicate pushdowns and column projections. Uses scan tokens to setup partition. Example ussage: add kudu-spark-0.9.0-SNAPSHOT-withdeps.jar to your spark-shell
 import org.kududb.spark.kudu._
 val temp =  sqlContext.read.options(Map("kudu.master"-> "master-here","kudu.table"-> "address")).kudu
 then use temp as a normal dataframe
 temp.select("id").filter(col("id")===(10000000360L)).show
 temp.registerTempTable("temp")
 sqlContext.sql("select id, postalcode1pgfid from temp where id=10000000360").show

 I also made a jar with dependencies to simplify use with spark

Change-Id: I142411aa5a6a7f770ab9e582f102dd3eb5acd922
---
M java/kudu-spark/pom.xml
D java/kudu-spark/src/main/scala/org/kududb/spark/DefaultSource.scala
D java/kudu-spark/src/main/scala/org/kududb/spark/KuduRDD.scala
A java/kudu-spark/src/main/scala/org/kududb/spark/kudu/DefaultSource.scala
R java/kudu-spark/src/main/scala/org/kududb/spark/kudu/KuduContext.scala
A java/kudu-spark/src/main/scala/org/kududb/spark/kudu/KuduRDD.scala
A java/kudu-spark/src/main/scala/org/kududb/spark/kudu/package.scala
R java/kudu-spark/src/test/scala/org/kududb/spark/kudu/DefaultSourceTest.scala
R java/kudu-spark/src/test/scala/org/kududb/spark/kudu/KuduContextTest.scala
R java/kudu-spark/src/test/scala/org/kududb/spark/kudu/TestContext.scala
10 files changed, 445 insertions(+), 393 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/kudu refs/changes/47/2847/1
-- 
To view, visit http://gerrit.cloudera.org:8080/2847
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-MessageType: newchange
Gerrit-Change-Id: I142411aa5a6a7f770ab9e582f102dd3eb5acd922
Gerrit-PatchSet: 1
Gerrit-Project: kudu
Gerrit-Branch: master
Gerrit-Owner: Chris George <ch...@rms.com>