You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Alexander Zarei (JIRA)" <ji...@apache.org> on 2015/05/07 02:49:59 UTC
[jira] [Commented] (DRILL-2767) Fragment error on TPCH Scale Factor
30 on a query that completed successfully previously
[ https://issues.apache.org/jira/browse/DRILL-2767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14531796#comment-14531796 ]
Alexander Zarei commented on DRILL-2767:
----------------------------------------
As suggested by [~vkorukanti] The following line was added to the hive storage config and it was resolved for Scale Factor 2. I will try it for Scale Factor 100 but I think it will work.
"fs.hdfs.impl.disable.cache": "true"
The plugin looks like this:
{
"type": "hive",
"enabled": true,
"configProps": {
"hive.metastore.uris": "hdfs://10.69.50.58:9083/",
"hive.metastore.local": "false",
"hive.metastore.warehouse.dir": "/user/hive/warehouse",
"fs.hdfs.impl.disable.cache": "true"
}
}
> Fragment error on TPCH Scale Factor 30 on a query that completed successfully previously
> ----------------------------------------------------------------------------------------
>
> Key: DRILL-2767
> URL: https://issues.apache.org/jira/browse/DRILL-2767
> Project: Apache Drill
> Issue Type: Bug
> Components: Storage - Hive
> Affects Versions: 0.8.0
> Environment: AWS EMR cluster of three m1.xlarge nodes
> Reporter: Alexander Zarei
> Assignee: Venki Korukanti
> Fix For: 1.2.0
>
> Attachments: drillbitcore1.log, drillbitcore1.out, drillbitcore2.log, drillbitcore2.out, drillbitmaster.out, lineitem table schema .png, second-set-core-1-drillbit.log, second-set-core-2-drillbit.log
>
>
> The following sequence led to the error:
> Executed the query
> bq. SELECT * FROM `realhive`.`tpch_text_30`.`lineitem`
> and it took about 43 minutes to execute successfully.
> After ward I ran the query
> bq. SELECT * FROM `realhive`.`tpch_text_2`.`lineitem`
> for 6 times to find an optimization value for the ODBC driver.
> Afterward, I submitted the first query again
> bq. SELECT * FROM `realhive`.`tpch_text_30`.`lineitem`
>
> and the Drill Cluster returned a fragment error.
> bq. ***[HY000]: [MapR][Drill] (1040) Drill failed to execute the query: SELECT * FROM `realhive`.`tpch_text_30`.`lineitem`[30024]Query execution error. Details:[RemoteRpcException: Failure while running fragment.[ fb97e7be-d09e-46fe-8728-9577fd0d8795 on ip-10-12-62-65
> Log files with debug level for the Drillbits on the master node as well as the core nodes of the cluster are attached.
> Also the connection through the ODBC driver on Linux 32 bit was "Direct" to the drillbit on the master node of the Hadoop cluster.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)