You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@carbondata.apache.org by "liujun (JIRA)" <ji...@apache.org> on 2016/07/07 11:03:10 UTC

[jira] [Created] (CARBONDATA-44) Table can not many times load data

liujun created CARBONDATA-44:
--------------------------------

             Summary: Table can not many times load data
                 Key: CARBONDATA-44
                 URL: https://issues.apache.org/jira/browse/CARBONDATA-44
             Project: CarbonData
          Issue Type: Bug
         Environment: carbondata branch master
osx 10.11
hadoop 2.6.4
spark 1.6.1
            Reporter: liujun


follow step the problem can be recreated
1.cc.sql("create table if not exists tt5 (id string, name string, city string, age Int) stored by 'org.apache.carbondata.format'")
2.cc.sql(s"load data inpath 'hdfs://127.0.0.1:9000/test/sample.csv' into table tt5")
3.cc.sql("select * from tt5").show
------- no problem with the above steps 
4.cc.sql(s"load data inpath 'hdfs://127.0.0.1:9000/test/sample.csv' into table tt5")
5.cc.sql("select * from tt5").show
------- display error message:
ERROR 07-07 17:56:52,995 - main
java.io.IOException: java.io.FileNotFoundException: File does not exist: hdfs://127.0.0.1:9000/carbondata/default/tt5/Fact/Part0/Segment_0
	at org.carbondata.hadoop.CarbonInputFormat.getSplits(CarbonInputFormat.java:286)
	at org.carbondata.spark.rdd.CarbonScanRDD.getPartitions(CarbonScanRDD.scala:84)
	at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:239)
	at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:237)
	at scala.Option.getOrElse(Option.scala:120)
	at org.apache.spark.rdd.RDD.partitions(RDD.scala:237)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)