You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Cheng Lian (JIRA)" <ji...@apache.org> on 2015/08/04 17:17:05 UTC

[jira] [Created] (SPARK-9600) DataFrameWriter.saveAsTable always writes data to "/user/hive/warehouse"

Cheng Lian created SPARK-9600:
---------------------------------

             Summary: DataFrameWriter.saveAsTable always writes data to "/user/hive/warehouse"
                 Key: SPARK-9600
                 URL: https://issues.apache.org/jira/browse/SPARK-9600
             Project: Spark
          Issue Type: Bug
          Components: SQL
    Affects Versions: 1.4.1, 1.5.0
            Reporter: Cheng Lian
            Priority: Critical


Having a {{hive-site.xml}} with a non-default {{hive.metastore.warehouse.dir}} value, Spark SQL still writes to the default warehouse location {{/user/hive/warehouse}} when saving data source tables using {{DataFrameWriter.saveAsTable()}}:
{noformat}
<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<configuration>
  <property>
    <name>javax.jdo.option.ConnectionURL</name>
    <value>jdbc:mysql://localhost/metastore_hive13_hadoop2</value>
  </property>

  <property>
    <name>javax.jdo.option.ConnectionDriverName</name>
    <value>com.mysql.jdbc.Driver</value>
  </property>

  <property>
    <name>javax.jdo.option.ConnectionUserName</name>
    <value>hive</value>
  </property>

  <property>
    <name>javax.jdo.option.ConnectionPassword</name>
    <value>password</value>
  </property>

  <property>
    <name>hive.metastore.warehouse.dir</name>
    <value>hdfs://localhost:9000/user/hive/warehouse_hive13</value>
  </property>
</configuration>
{noformat}
Spark shell snippet to reproduce:
{noformat}
sqlContext.range(10).write.saveAsTable("xxx")
{noformat}
Running {{DESC EXTENDED xxx}} in Hive to check SerDe propertyies:
{noformat}
...
location:hdfs://localhost:9000/user/hive/warehouse_hive13/xxx
...
parameters:{path=hdfs://localhost:9000/user/hive/warehouse/xxx, serialization.format=1})
...
{noformat}
We are probably using execution Hive configuration when calling {{HiveMetastoreCatalog.hiveDefaultTableFilePath()}}.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org