You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Alexander Zarei (JIRA)" <ji...@apache.org> on 2015/05/07 02:49:59 UTC

[jira] [Commented] (DRILL-2767) Fragment error on TPCH Scale Factor 30 on a query that completed successfully previously

    [ https://issues.apache.org/jira/browse/DRILL-2767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14531796#comment-14531796 ] 

Alexander Zarei commented on DRILL-2767:
----------------------------------------

As suggested by [~vkorukanti] The following line was added to the hive storage config and it was resolved for Scale Factor 2. I will try it for Scale Factor 100 but I think it will work.

   "fs.hdfs.impl.disable.cache": "true"

The plugin looks like this:

{
  "type": "hive",
  "enabled": true,
  "configProps": {
    "hive.metastore.uris": "hdfs://10.69.50.58:9083/",
    "hive.metastore.local": "false",
    "hive.metastore.warehouse.dir": "/user/hive/warehouse",
    "fs.hdfs.impl.disable.cache": "true"
  }
}

> Fragment error on TPCH Scale Factor 30 on a query that completed successfully previously
> ----------------------------------------------------------------------------------------
>
>                 Key: DRILL-2767
>                 URL: https://issues.apache.org/jira/browse/DRILL-2767
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - Hive
>    Affects Versions: 0.8.0
>         Environment: AWS EMR cluster of three m1.xlarge nodes
>            Reporter: Alexander Zarei
>            Assignee: Venki Korukanti
>             Fix For: 1.2.0
>
>         Attachments: drillbitcore1.log, drillbitcore1.out, drillbitcore2.log, drillbitcore2.out, drillbitmaster.out, lineitem table schema .png, second-set-core-1-drillbit.log, second-set-core-2-drillbit.log
>
>
> The following sequence led to the error:
> Executed the query 
> bq. SELECT * FROM `realhive`.`tpch_text_30`.`lineitem`
> and it took about 43 minutes to execute successfully. 
> After ward I ran the query 
> bq. SELECT * FROM `realhive`.`tpch_text_2`.`lineitem`
> for 6 times to find an optimization value for the ODBC driver. 
> Afterward, I submitted the first query again
> bq. SELECT * FROM `realhive`.`tpch_text_30`.`lineitem`
>  
> and the Drill Cluster returned a fragment error.
> bq. ***[HY000]: [MapR][Drill] (1040) Drill failed to execute the query: SELECT * FROM `realhive`.`tpch_text_30`.`lineitem`[30024]Query execution error. Details:[RemoteRpcException: Failure while running fragment.[ fb97e7be-d09e-46fe-8728-9577fd0d8795 on ip-10-12-62-65
> Log files with debug level for the Drillbits on the master node as well as the core nodes of the cluster are attached.
> Also the connection through the ODBC driver on Linux 32 bit was "Direct" to the drillbit on the master node of the Hadoop cluster.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)