You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2017/09/19 03:52:00 UTC

[jira] [Reopened] (SPARK-21428) CliSessionState never be recognized because of IsolatedClientLoader

     [ https://issues.apache.org/jira/browse/SPARK-21428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Wenchen Fan reopened SPARK-21428:
---------------------------------

> CliSessionState never be recognized because of IsolatedClientLoader
> -------------------------------------------------------------------
>
>                 Key: SPARK-21428
>                 URL: https://issues.apache.org/jira/browse/SPARK-21428
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 1.5.2, 1.6.3, 2.0.2, 2.1.1, 2.2.0
>            Reporter: Kent Yao
>            Assignee: Kent Yao
>            Priority: Minor
>             Fix For: 2.3.0
>
>
> When using bin/spark-sql with the builtin hive jars, we are expecting to reuse the instance of    CliSessionState.
> {quote}
>         // In `SparkSQLCLIDriver`, we have already started a `CliSessionState`,
>         // which contains information like configurations from command line. Later
>         // we call `SparkSQLEnv.init()` there, which would run into this part again.
>         // so we should keep `conf` and reuse the existing instance of `CliSessionState`.
> {quote}
> Actually it never ever happened since SessionState.get()  at https://github.com/apache/spark/blob/master/sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveClientImpl.scala#L138 will always be null by IsolatedClientLoader.
> The SessionState.start was called many times, which will creates `hive.exec.strachdir`, see the following case...
> {code:java}
> spark git:(master) bin/spark-sql --conf spark.sql.hive.metastore.jars=builtin
> Using Spark's default log4j profile: org/apache/spark/log4j-defaults.properties
> 17/07/16 23:29:04 WARN NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
> 17/07/16 23:29:04 INFO HiveMetaStore: 0: Opening raw store with implemenation class:org.apache.hadoop.hive.metastore.ObjectStore
> 17/07/16 23:29:04 INFO ObjectStore: ObjectStore, initialize called
> 17/07/16 23:29:04 INFO Persistence: Property hive.metastore.integral.jdo.pushdown unknown - will be ignored
> 17/07/16 23:29:04 INFO Persistence: Property datanucleus.cache.level2 unknown - will be ignored
> 17/07/16 23:29:05 INFO ObjectStore: Setting MetaStore object pin classes with hive.metastore.cache.pinobjtypes="Table,StorageDescriptor,SerDeInfo,Partition,Database,Type,FieldSchema,Order"
> 17/07/16 23:29:06 INFO Datastore: The class "org.apache.hadoop.hive.metastore.model.MFieldSchema" is tagged as "embedded-only" so does not have its own datastore table.
> 17/07/16 23:29:06 INFO Datastore: The class "org.apache.hadoop.hive.metastore.model.MOrder" is tagged as "embedded-only" so does not have its own datastore table.
> 17/07/16 23:29:07 INFO Datastore: The class "org.apache.hadoop.hive.metastore.model.MFieldSchema" is tagged as "embedded-only" so does not have its own datastore table.
> 17/07/16 23:29:07 INFO Datastore: The class "org.apache.hadoop.hive.metastore.model.MOrder" is tagged as "embedded-only" so does not have its own datastore table.
> 17/07/16 23:29:07 INFO MetaStoreDirectSql: Using direct SQL, underlying DB is DERBY
> 17/07/16 23:29:07 INFO ObjectStore: Initialized ObjectStore
> 17/07/16 23:29:07 WARN ObjectStore: Version information not found in metastore. hive.metastore.schema.verification is not enabled so recording the schema version 1.2.0
> 17/07/16 23:29:07 WARN ObjectStore: Failed to get database default, returning NoSuchObjectException
> 17/07/16 23:29:08 INFO HiveMetaStore: Added admin role in metastore
> 17/07/16 23:29:08 INFO HiveMetaStore: Added public role in metastore
> 17/07/16 23:29:08 INFO HiveMetaStore: No user is added in admin role, since config is empty
> 17/07/16 23:29:08 INFO HiveMetaStore: 0: get_all_databases
> 17/07/16 23:29:08 INFO audit: ugi=Kent	ip=unknown-ip-addr	cmd=get_all_databases
> 17/07/16 23:29:08 INFO HiveMetaStore: 0: get_functions: db=default pat=*
> 17/07/16 23:29:08 INFO audit: ugi=Kent	ip=unknown-ip-addr	cmd=get_functions: db=default pat=*
> 17/07/16 23:29:08 INFO Datastore: The class "org.apache.hadoop.hive.metastore.model.MResourceUri" is tagged as "embedded-only" so does not have its own datastore table.
> 17/07/16 23:29:08 INFO SessionState: Created local directory: /var/folders/k2/04p4k4ws73l6711h_mz2_tq00000gn/T/a2c40e42-08e2-4023-8464-3432ed690184_resources
> 17/07/16 23:29:08 INFO SessionState: Created HDFS directory: /tmp/hive/Kent/a2c40e42-08e2-4023-8464-3432ed690184
> 17/07/16 23:29:08 INFO SessionState: Created local directory: /var/folders/k2/04p4k4ws73l6711h_mz2_tq00000gn/T/Kent/a2c40e42-08e2-4023-8464-3432ed690184
> 17/07/16 23:29:08 INFO SessionState: Created HDFS directory: /tmp/hive/Kent/a2c40e42-08e2-4023-8464-3432ed690184/_tmp_space.db
> 17/07/16 23:29:08 INFO SparkContext: Running Spark version 2.3.0-SNAPSHOT
> 17/07/16 23:29:08 INFO SparkContext: Submitted application: SparkSQL::10.0.0.8
> 17/07/16 23:29:08 INFO SecurityManager: Changing view acls to: Kent
> 17/07/16 23:29:08 INFO SecurityManager: Changing modify acls to: Kent
> 17/07/16 23:29:08 INFO SecurityManager: Changing view acls groups to:
> 17/07/16 23:29:08 INFO SecurityManager: Changing modify acls groups to:
> 17/07/16 23:29:08 INFO SecurityManager: SecurityManager: authentication disabled; ui acls disabled; users  with view permissions: Set(Kent); groups with view permissions: Set(); users  with modify permissions: Set(Kent); groups with modify permissions: Set()
> 17/07/16 23:29:09 INFO Utils: Successfully started service 'sparkDriver' on port 51369.
> 17/07/16 23:29:09 INFO SparkEnv: Registering MapOutputTracker
> 17/07/16 23:29:09 INFO SparkEnv: Registering BlockManagerMaster
> 17/07/16 23:29:09 INFO BlockManagerMasterEndpoint: Using org.apache.spark.storage.DefaultTopologyMapper for getting topology information
> 17/07/16 23:29:09 INFO BlockManagerMasterEndpoint: BlockManagerMasterEndpoint up
> 17/07/16 23:29:09 INFO DiskBlockManager: Created local directory at /private/var/folders/k2/04p4k4ws73l6711h_mz2_tq00000gn/T/blockmgr-8bf36015-6c27-4b4b-b90b-126f7a59b5a0
> 17/07/16 23:29:09 INFO MemoryStore: MemoryStore started with capacity 366.3 MB
> 17/07/16 23:29:09 INFO SparkEnv: Registering OutputCommitCoordinator
> 17/07/16 23:29:09 INFO Utils: Successfully started service 'SparkUI' on port 4040.
> 17/07/16 23:29:09 INFO SparkUI: Bound SparkUI to 0.0.0.0, and started at http://10.0.0.8:4040
> 17/07/16 23:29:09 INFO Executor: Starting executor ID driver on host localhost
> 17/07/16 23:29:09 INFO Utils: Successfully started service 'org.apache.spark.network.netty.NettyBlockTransferService' on port 51371.
> 17/07/16 23:29:09 INFO NettyBlockTransferService: Server created on 10.0.0.8:51371
> 17/07/16 23:29:09 INFO BlockManager: Using org.apache.spark.storage.RandomBlockReplicationPolicy for block replication policy
> 17/07/16 23:29:09 INFO BlockManagerMaster: Registering BlockManager BlockManagerId(driver, 10.0.0.8, 51371, None)
> 17/07/16 23:29:09 INFO BlockManagerMasterEndpoint: Registering block manager 10.0.0.8:51371 with 366.3 MB RAM, BlockManagerId(driver, 10.0.0.8, 51371, None)
> 17/07/16 23:29:09 INFO BlockManagerMaster: Registered BlockManager BlockManagerId(driver, 10.0.0.8, 51371, None)
> 17/07/16 23:29:09 INFO BlockManager: Initialized BlockManager: BlockManagerId(driver, 10.0.0.8, 51371, None)
> 17/07/16 23:29:10 INFO SharedState: Setting hive.metastore.warehouse.dir ('null') to the value of spark.sql.warehouse.dir ('file:/Users/Kent/Documents/spark/spark-warehouse').
> 17/07/16 23:29:10 INFO SharedState: Warehouse path is 'file:/Users/Kent/Documents/spark/spark-warehouse'.
> 17/07/16 23:29:10 INFO HiveUtils: Initializing HiveMetastoreConnection version 1.2.1 using Spark classes.
> 17/07/16 23:29:10 INFO HiveMetaStore: 0: Opening raw store with implemenation class:org.apache.hadoop.hive.metastore.ObjectStore
> 17/07/16 23:29:10 INFO ObjectStore: ObjectStore, initialize called
> 17/07/16 23:29:11 INFO Persistence: Property hive.metastore.integral.jdo.pushdown unknown - will be ignored
> 17/07/16 23:29:11 INFO Persistence: Property datanucleus.cache.level2 unknown - will be ignored
> 17/07/16 23:29:12 INFO ObjectStore: Setting MetaStore object pin classes with hive.metastore.cache.pinobjtypes="Table,StorageDescriptor,SerDeInfo,Partition,Database,Type,FieldSchema,Order"
> 17/07/16 23:29:13 INFO Datastore: The class "org.apache.hadoop.hive.metastore.model.MFieldSchema" is tagged as "embedded-only" so does not have its own datastore table.
> 17/07/16 23:29:13 INFO Datastore: The class "org.apache.hadoop.hive.metastore.model.MOrder" is tagged as "embedded-only" so does not have its own datastore table.
> 17/07/16 23:29:13 INFO Datastore: The class "org.apache.hadoop.hive.metastore.model.MFieldSchema" is tagged as "embedded-only" so does not have its own datastore table.
> 17/07/16 23:29:13 INFO Datastore: The class "org.apache.hadoop.hive.metastore.model.MOrder" is tagged as "embedded-only" so does not have its own datastore table.
> 17/07/16 23:29:13 INFO Query: Reading in results for query "org.datanucleus.store.rdbms.query.SQLQuery@0" since the connection used is closing
> 17/07/16 23:29:13 INFO MetaStoreDirectSql: Using direct SQL, underlying DB is DERBY
> 17/07/16 23:29:13 INFO ObjectStore: Initialized ObjectStore
> 17/07/16 23:29:13 INFO HiveMetaStore: Added admin role in metastore
> 17/07/16 23:29:13 INFO HiveMetaStore: Added public role in metastore
> 17/07/16 23:29:13 INFO HiveMetaStore: No user is added in admin role, since config is empty
> 17/07/16 23:29:13 INFO HiveMetaStore: 0: get_all_databases
> 17/07/16 23:29:13 INFO audit: ugi=Kent	ip=unknown-ip-addr	cmd=get_all_databases
> 17/07/16 23:29:13 INFO HiveMetaStore: 0: get_functions: db=default pat=*
> 17/07/16 23:29:13 INFO audit: ugi=Kent	ip=unknown-ip-addr	cmd=get_functions: db=default pat=*
> 17/07/16 23:29:13 INFO Datastore: The class "org.apache.hadoop.hive.metastore.model.MResourceUri" is tagged as "embedded-only" so does not have its own datastore table.
> 17/07/16 23:29:13 INFO SessionState: Created local directory: /var/folders/k2/04p4k4ws73l6711h_mz2_tq00000gn/T/27f3c8f8-38d1-4b64-9ade-b41a07937937_resources
> 17/07/16 23:29:13 INFO SessionState: Created HDFS directory: /tmp/hive/Kent/27f3c8f8-38d1-4b64-9ade-b41a07937937
> 17/07/16 23:29:13 INFO SessionState: Created local directory: /var/folders/k2/04p4k4ws73l6711h_mz2_tq00000gn/T/Kent/27f3c8f8-38d1-4b64-9ade-b41a07937937
> 17/07/16 23:29:13 INFO SessionState: Created HDFS directory: /tmp/hive/Kent/27f3c8f8-38d1-4b64-9ade-b41a07937937/_tmp_space.db
> 17/07/16 23:29:13 INFO HiveClientImpl: Warehouse location for Hive client (version 1.2.2) is file:/Users/Kent/Documents/spark/spark-warehouse
> 17/07/16 23:29:13 INFO HiveMetaStore: 0: get_database: default
> 17/07/16 23:29:13 INFO audit: ugi=Kent	ip=unknown-ip-addr	cmd=get_database: default
> 17/07/16 23:29:14 INFO SessionState: Created local directory: /var/folders/k2/04p4k4ws73l6711h_mz2_tq00000gn/T/4b589938-6df6-4a9f-b044-99da6b1c76b4_resources
> 17/07/16 23:29:14 INFO SessionState: Created HDFS directory: /tmp/hive/Kent/4b589938-6df6-4a9f-b044-99da6b1c76b4
> 17/07/16 23:29:14 INFO SessionState: Created local directory: /var/folders/k2/04p4k4ws73l6711h_mz2_tq00000gn/T/Kent/4b589938-6df6-4a9f-b044-99da6b1c76b4
> 17/07/16 23:29:14 INFO SessionState: Created HDFS directory: /tmp/hive/Kent/4b589938-6df6-4a9f-b044-99da6b1c76b4/_tmp_space.db
> 17/07/16 23:29:14 INFO HiveClientImpl: Warehouse location for Hive client (version 1.2.2) is file:/Users/Kent/Documents/spark/spark-warehouse
> 17/07/16 23:29:14 INFO HiveMetaStore: 0: get_database: global_temp
> 17/07/16 23:29:14 INFO audit: ugi=Kent	ip=unknown-ip-addr	cmd=get_database: global_temp
> 17/07/16 23:29:14 WARN ObjectStore: Failed to get database global_temp, returning NoSuchObjectException
> 17/07/16 23:29:14 INFO SessionState: Created local directory: /var/folders/k2/04p4k4ws73l6711h_mz2_tq00000gn/T/1317e8bf-5a10-4758-bd56-3196ad9ae95e_resources
> 17/07/16 23:29:14 INFO SessionState: Created HDFS directory: /tmp/hive/Kent/1317e8bf-5a10-4758-bd56-3196ad9ae95e
> 17/07/16 23:29:14 INFO SessionState: Created local directory: /var/folders/k2/04p4k4ws73l6711h_mz2_tq00000gn/T/Kent/1317e8bf-5a10-4758-bd56-3196ad9ae95e
> 17/07/16 23:29:14 INFO SessionState: Created HDFS directory: /tmp/hive/Kent/1317e8bf-5a10-4758-bd56-3196ad9ae95e/_tmp_space.db
> 17/07/16 23:29:14 INFO HiveClientImpl: Warehouse location for Hive client (version 1.2.2) is file:/Users/Kent/Documents/spark/spark-warehouse
> 17/07/16 23:29:14 INFO StateStoreCoordinatorRef: Registered StateStoreCoordinator endpoint
> {code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org