You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Pritesh Maker (JIRA)" <ji...@apache.org> on 2018/05/22 18:28:00 UTC

[jira] [Updated] (DRILL-6263) Improvements to DoY initial experience

     [ https://issues.apache.org/jira/browse/DRILL-6263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Pritesh Maker updated DRILL-6263:
---------------------------------
    Fix Version/s:     (was: 1.14.0)

> Improvements to DoY initial experience
> --------------------------------------
>
>                 Key: DRILL-6263
>                 URL: https://issues.apache.org/jira/browse/DRILL-6263
>             Project: Apache Drill
>          Issue Type: Improvement
>    Affects Versions: 1.13.0
>            Reporter: Paul Rogers
>            Priority: Minor
>
> As part of the Drill 1.13 release process, I tested out DoY after a year of not having used it. That time gap pointed out some improvements for first-time users.
> * Copy the [USAGE.md|https://github.com/apache/drill/blob/master/drill-yarn/USAGE.md] file into the Drill home directory with the name "DRILL_YARN_USAGE.md.".
> * Change the {{drill-on-yarn-example.conf}} file to be a valid file for the default Drill and YARN configurations.
> {noformat}
>     heap: "2G"
>     max-direct-memory: "2G"
>    memory-mb: 5125
> {noformat}
> * Change the {{drill-on-yarn-example.conf}} to disable SSL by default. Just comment out the following line:
> {noformat}
>     #ssl-enabled: true
> {noformat}
> * Change the {{drill-on-yarn-example.conf}} to disable authorization by default. That is, comment out the following line:
> {noformat}
>     #auth-type: "drill"
> {noformat}
> * Change the {{drill-on-yarn-example.conf}} to use no AM node labels by default. That is, comment out the following line:
> {noformat}
>     #node-label-expr: "drill-am"
> {noformat}
> Failure to comment out this line results in the following error:
> {noformat}
> Failed to start Drill application master
>   Caused by: Submit application failed
>   Caused by: Invalid resource request, node label not enabled but request contains label expression
> {noformat}
> Also, add this to the Troubleshooting section in {{USAGE.md}}.
> * Change {{DrillOnYarnConfig.findSuffix}}, to allow the {{.tar}} suffix. This is what one ends up with it ht Mac does its automatic extract. A tar file is larger than the compressed version, but no reason it should not be allowed (assuming YARN supports it.)
> * Otherwise, change {{DrillOnYarnConfig.getRemoteDrillHome()}}, where we emit the error "does not name a valid archive" to differentiate between no sufficient and an unsupported suffix. (I got the following error and had to look at the source to figure out what I'd done wrong):
> {noformat}
> drill.yarn.drill-install.client-path does not name a valid archive: /Users/paulrogers/bin/apache-drill-1.13.0.tar
> {noformat}
> * Change the newly-added error reporting code in {{DrillOnYarn.displayError}} to omit displaying the exception cause it if just repeats the main error message. Here is the full error message from above, the second line is redundant:
> {noformat}
> drill.yarn.drill-install.client-path does not name a valid archive: /Users/paulrogers/bin/apache-drill-1.13.0.tar
>   Caused by: drill.yarn.drill-install.client-path does not name a valid archive: /Users/paulrogers/bin/apache-drill-1.13.0.tar
> {noformat}
> * Add to [USAGE.md|https://github.com/apache/drill/blob/master/drill-yarn/USAGE.md] pointers for how to set up a basic HDFS, ZK and YARN configuration. Mostly just state what is to be done and point to the [relevant Hadoop docs|https://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-common/SingleCluster.html], "Pseudo-Distributed Operation". In particular, we want to create an actual HDFS file system, not use the default of local file system.
> * Add to {{USAGE.md}} a description of the supported YARN (actually Hadoop) versions. Feature was developed with 2.7.1. Currently verifying with 2.9.0. Probably needs to be rechecked on the 3.x series.
> * Add to {{USAGE.md}} the fact that Drill is built with, and includes the jars for, Hadoop 2.7.1. It is not clear what version compatibility Hadoop has; are these jars compatible with the latest 2.x series Hadoop? With Hadoop 3.x?
> * Until DRILL-6268 is fixed, explain that the HDFS configuration *must* use port 8020. Also, add this to the Troubleshooting section in {{USAGE.md}}.
> * Add to {{USAGE.md}}, Troubleshooting: if configuration issues cause Drill to fail to start, then Drill-on-YARN will blacklist each node after several tries. Unfortunately, the YARN UI appears to not provide access to the logs for failed application containers. So, to track down the failure, look for the container logs in YARN. In the default single-node install, they are in {{$HADOOP_HOME/logs/userlogs/application_xxx/container_xx_00000y}} where y=1 is the AM, y>1 are the Drillbit containers.
> * Change {{USAGE.md}} to change the following line:
> {noformat}
> cp $DRILL_HOME/conf/drill-override-example.conf $DRILL_SITE/drill-override.conf
> {noformat}
> To the following:
> {noformat}
> cp $DRILL_HOME/conf/drill-override.conf $DRILL_SITE
> {noformat}
> Without this change, Drill will fail to start and you'll see the following in the YARN container log directory, {{drillbit.log}} file:
> {noformat}
> 2018-03-17 16:11:25,293 [main] ERROR o.a.d.e.r.u.s.PamUserAuthenticator - Problem in finding the native library of JPAM (Pluggable Authenticator Module API). Make sure to set Drillbit JVM option 'java.library.path' to point to the directory where the native JPAM exists.
> java.lang.UnsatisfiedLinkError: no jpam in java.library.path
> {noformat}
> None of these are show stoppers, each is instead just a bit of sand in the gears that makes progress a bit slower than it need be.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)