You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@drill.apache.org by br...@apache.org on 2017/02/24 20:54:06 UTC

drill git commit: updates to Tableau 9 desktop doc

Repository: drill
Updated Branches:
  refs/heads/gh-pages 2d250f9db -> 2d27fe5ed


updates to Tableau 9 desktop doc


Project: http://git-wip-us.apache.org/repos/asf/drill/repo
Commit: http://git-wip-us.apache.org/repos/asf/drill/commit/2d27fe5e
Tree: http://git-wip-us.apache.org/repos/asf/drill/tree/2d27fe5e
Diff: http://git-wip-us.apache.org/repos/asf/drill/diff/2d27fe5e

Branch: refs/heads/gh-pages
Commit: 2d27fe5ed6f89d9b89b0ae5ad47d377b2784abf6
Parents: 2d250f9
Author: Bridget Bevens <bb...@maprtech.com>
Authored: Fri Feb 24 12:52:52 2017 -0800
Committer: Bridget Bevens <bb...@maprtech.com>
Committed: Fri Feb 24 12:52:52 2017 -0800

----------------------------------------------------------------------
 ...using-apache-drill-with-tableau-9-desktop.md | 99 ++++++++++++++++++++
 ...using-apache-drill-with-tableau-9-desktop.md | 87 -----------------
 .../020-installing-the-tdc-file-on-windows.md   | 18 ----
 3 files changed, 99 insertions(+), 105 deletions(-)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/drill/blob/2d27fe5e/_docs/odbc-jdbc-interfaces/using-drill-with-bi-tools/060-using-apache-drill-with-tableau-9-desktop.md
----------------------------------------------------------------------
diff --git a/_docs/odbc-jdbc-interfaces/using-drill-with-bi-tools/060-using-apache-drill-with-tableau-9-desktop.md b/_docs/odbc-jdbc-interfaces/using-drill-with-bi-tools/060-using-apache-drill-with-tableau-9-desktop.md
new file mode 100644
index 0000000..5751768
--- /dev/null
+++ b/_docs/odbc-jdbc-interfaces/using-drill-with-bi-tools/060-using-apache-drill-with-tableau-9-desktop.md
@@ -0,0 +1,99 @@
+---
+title: "Using Apache Drill with Tableau 9 Desktop"
+date:  
+parent: "Using Drill with BI Tools"
+---
+This document describes how to connect Tableau 9 Desktop to Apache Drill and explore multiple data formats instantly on Hadoop. Use the combined power of these tools to get direct access to semi-structured data, without having to rely on IT teams for schema creation.
+
+To use Apache Drill with Tableau 9 Desktop, complete the following steps: 
+
+1. Install the Drill ODBC driver from MapR.
+2. Install the Tableau Data-connection Customization (TDC) file.
+3. Connect Tableau to Drill Using ODBC.
+4. Query and analyze various data formats with Tableau and Drill.
+
+----------
+
+### Step 1: Install and Configure the MapR Drill ODBC Driver 
+
+Drill uses standard ODBC connectivity to provide easy data-exploration capabilities on complex, schema-less data sets. For the best experience use the latest release of Apache Drill. For Tableau 9.0 Desktop, Drill Version 0.9 or higher is recommended.
+
+Complete the following steps to install and configure the driver:
+
+1. Download the 64-bit MapR Drill ODBC Driver for Windows from the following location:<br> [http://package.mapr.com/tools/MapR-ODBC/MapR_Drill/](http://package.mapr.com/tools/MapR-ODBC/MapR_Drill/)     
+**Note:** Tableau 9.0 Desktop 64 bit can use either the 32-bit driver or the 64-bit driver.
+2. [Install the ODBC driver on Windows]({{site.baseurl}}/docs/installing-the-driver-on-windows/).
+3. [Configure the driver]({{site.baseurl}}/docs/configuring-odbc-on-windows/).
+4. If Drill authentication is enabled, select **Basic Authentication** as the authentication type. Enter a valid user and password. ![drill query flow]({{ site.baseurl }}/docs/img/tableau-odbc-setup.png)
+
+Note: If you select **ZooKeeper Quorum** as the ODBC connection type, the client system must be able to resolve the hostnames of the ZooKeeper nodes. The simplest way is to add the hostnames and IP addresses for the ZooKeeper nodes to the `%WINDIR%\system32\drivers\etc\hosts` file. ![drill query flow]({{ site.baseurl }}/docs/img/tableau-odbc-setup-2.png)
+
+Also make sure to test the ODBC connection to Drill before using it with Tableau.
+
+
+----------
+
+### Step 2: Install the Tableau Data-connection Customization (TDC) File
+
+The MapR Drill ODBC Driver includes a file named `MapRDrillODBC.TDC`. The TDC file includes customizations that improve ODBC configuration and performance when using Tableau. The MapR Drill ODBC driver installer automatically installs the TDC file if the installer can find the Tableau installation. If you installed the MapR Drill ODBC driver first and then installed Tableau, the TDC file is not installed automatically. You must install the TDC file manually. 
+
+**To install the MapRDrillODBC.TDC file manually:**
+
+1. Click **Start**, and locate the Install Tableau TDC File app that the MapR Drill ODBC Driver installer installed.   
+   For example, on Windows 8.1 in Apps, the Install Tableau TDC File appears under MaprDrill ODBC Driver:
+   ![]({{ site.baseurl }}/docs/img/odbc-mapr-drill-apps.png)
+
+2. Click **Install Tableau TDC File**. 
+3. When the installation completes, press any key to continue.   
+For example, you can press the SPACEBAR key.
+
+If the installation of the TDC file fails, this is likely due to your Tableau repository being in location other than the default one.  In this case, manually copy the My Tableau Repository to C:\Users\<user>\Documents\My Tableau Repository. Repeat the procedure to install the MapRDrillODBC.TDC file manually.
+
+----------
+
+
+### Step 3: Connect Tableau to Drill via ODBC
+Complete the following steps to configure an ODBC data connection: 
+
+To connect Tableau to Drill use the following steps:
+
+1.	In a Tableau Workbook click **Data > New Data Source**.![drill query flow]({{ site.baseurl }}/docs/img/new-data-source.png)
+2.	In the **Connect** list, select **Other Databases (ODBC)**. ![drill query flow]({{ site.baseurl }}/docs/img/connect-list.png)
+3.	On the Server Connection window, select the DSN configured in Step 1 from the drop-down list of ODBC data sources. Then click **Connect**. Note: You will be prompted to enter a username and password; these entries will be passed to the Server Connection window. 
+![drill query flow]({{ site.baseurl }}/docs/img/other-dbs.png) ![drill query flow]({{ site.baseurl }}/docs/img/other-dbs-2.png)
+Tableau is now connected to Drill, and you can select various tables and views. ![drill query flow]({{ site.baseurl }}/docs/img/tableau-schemas.png)
+4.	Click the **Schema** drop-down list and choose a schema by clicking the search icon: all available Drill schemas will be displayed. When you have selected the schema, click the search icon in the Table dialog box: all available tables or views will be displayed. 
+
+You can select tables and views to build a Tableau Visualization. You can also use custom SQL by clicking the **New Custom SQL** option. 
+
+Tableau can natively work with Hive tables and Drill views. For Drill data sources, including directly accessible  file systems or HBase/MapR-DB tables, you can either use the Custom SQL option, or you can create a view in Drill to easily represent the complex data to Tableau. For more information, see the following links:
+http://drill.apache.org/docs/step-3-connect-to-drill-data-sources-from-a-bi-tool/
+http://drill.apache.org/docs/tableau-examples/
+
+Note: If Drill authentication and impersonation is enabled, only the views that the user has access to will be displayed in the Table dialog box. Also, if custom SQL is being used to try and access data sources that the user does not have access to, an error message will be displayed. ![drill query flow]({{ site.baseurl }}/docs/img/tableau-error.png)
+
+----------
+
+### Step 4: Query and Analyze the Data 
+
+Tableau Desktop can now use Drill to query various data sources and visualize the information.
+
+Assume that a retailer has weblog data stored as JSON files in a nested directory structure and product information stored in a Hive table. Using Drill and Tableau, users would like to understand the volume of product sold by state and product category.
+
+1.	Using the New Custom SQL function in Tableau, write a Drill query to read the JSON files without the need for ETL. Casting data types is recommended when you are working directly with files. 
+For example: ![drill query flow]({{ site.baseurl }}/docs/img/edit-custom-sql.png)
+
+2.	Next select the Hive products table: ![drill query flow]({{ site.baseurl }}/docs/img/custom-sql-query.png)
+
+3.	Verify that Tableau is joining the two data sources (JSON files and Hive table) on the prod_id key:![drill query flow]({{ site.baseurl }}/docs/img/tableau-join-key.png)
+The data sources are now configured and ready to be used in the visualization.
+4.	Drag State from the Custom SQL (JSON files) to **Columns**, and drag Category from the Hive products table to **Rows**. 
+5.	Create a calculated field called `Total Number of Products` and enter `count(prod_id)` for the `prod_id` field from the weblog files (Custom SQL), then drag it to **Rows** next to Category. The visualization will now show the total products by category and state. 
+6.	To filter out weblog data where products were not bought, drag the `purch_flag` field from the weblog files to **Filters**. Select only data where the `purch_flag` was true.
+7.	Finally, order the data from the state with the most products sold to the one with the least.
+8.	Add a grand total row by clicking **Analysis > Totals > Show Column Grand Totals**. ![drill query flow]({{ site.baseurl }}/docs/img/tableau-desktop-query.png)
+
+----------
+
+In this quick tutorial, you saw how you can configure Tableau Desktop 9.0 to work with Apache Drill. 
+

http://git-wip-us.apache.org/repos/asf/drill/blob/2d27fe5e/_docs/odbc-jdbc-interfaces/using-drill-with-bi-tools/using-apache-drill-with-tableau-9-desktop/010-using-apache-drill-with-tableau-9-desktop.md
----------------------------------------------------------------------
diff --git a/_docs/odbc-jdbc-interfaces/using-drill-with-bi-tools/using-apache-drill-with-tableau-9-desktop/010-using-apache-drill-with-tableau-9-desktop.md b/_docs/odbc-jdbc-interfaces/using-drill-with-bi-tools/using-apache-drill-with-tableau-9-desktop/010-using-apache-drill-with-tableau-9-desktop.md
deleted file mode 100644
index ce26d1d..0000000
--- a/_docs/odbc-jdbc-interfaces/using-drill-with-bi-tools/using-apache-drill-with-tableau-9-desktop/010-using-apache-drill-with-tableau-9-desktop.md
+++ /dev/null
@@ -1,87 +0,0 @@
----
-title: "Using Apache Drill with Tableau 9 Desktop"
-date:  
-parent: "Using Drill with BI Tools"
----
-This document describes how to connect Tableau 9 Desktop to Apache Drill and explore multiple data formats instantly on Hadoop. Use the combined power of these tools to get direct access to semi-structured data, without having to rely on IT teams for schema creation.
-
-To use Apache Drill with Tableau 9 Desktop, complete the following steps: 
-
-1. Install the Drill ODBC driver from MapR.
-2. Install the Tableau Data-connection Customization (TDC) file.
-3. Connect Tableau to Drill Using ODBC.
-4. Query and analyze various data formats with Tableau and Drill.
-
-----------
-
-### Step 1: Install and Configure the MapR Drill ODBC Driver 
-
-Drill uses standard ODBC connectivity to provide easy data-exploration capabilities on complex, schema-less data sets. For the best experience use the latest release of Apache Drill. For Tableau 9.0 Desktop, Drill Version 0.9 or higher is recommended.
-
-Complete the following steps to install and configure the driver:
-
-1. Download the 64-bit MapR Drill ODBC Driver for Windows from the following location:<br> [http://package.mapr.com/tools/MapR-ODBC/MapR_Drill/](http://package.mapr.com/tools/MapR-ODBC/MapR_Drill/)     
-**Note:** Tableau 9.0 Desktop 64 bit can use either the 32-bit driver or the 64-bit driver.
-2. [Install the ODBC driver on Windows]({{site.baseurl}}/docs/installing-the-driver-on-windows/).
-3. [Configure the driver]({{site.baseurl}}/docs/configuring-odbc-on-windows/).
-4. If Drill authentication is enabled, select **Basic Authentication** as the authentication type. Enter a valid user and password. ![drill query flow]({{ site.baseurl }}/docs/img/tableau-odbc-setup.png)
-
-Note: If you select **ZooKeeper Quorum** as the ODBC connection type, the client system must be able to resolve the hostnames of the ZooKeeper nodes. The simplest way is to add the hostnames and IP addresses for the ZooKeeper nodes to the `%WINDIR%\system32\drivers\etc\hosts` file. ![drill query flow]({{ site.baseurl }}/docs/img/tableau-odbc-setup-2.png)
-
-Also make sure to test the ODBC connection to Drill before using it with Tableau.
-
-
-----------
-
-### Step 2: Install the Tableau Data-connection Customization (TDC) File
-
-The MapR Drill ODBC Driver includes a file named `MapRDrillODBC.TDC`. The TDC file includes customizations that improve ODBC configuration and performance when using Tableau. The MapR Drill ODBC Driver installer automatically installs the TDC file if the installer can find the Tableau installation. If you installed the MapR Drill ODBC Driver first and then installed Tableau, the TDC file is not installed automatically, and you need to [install the TDC file manually]({{site.baseurl}}/docs/installing-the-tdc-file-on-windows/). 
-
-----------
-
-
-### Step 3: Connect Tableau to Drill via ODBC
-Complete the following steps to configure an ODBC data connection: 
-
-To connect Tableau to Drill use the following steps:
-
-1.	In a Tableau Workbook click **Data > New Data Source**.![drill query flow]({{ site.baseurl }}/docs/img/new-data-source.png)
-2.	In the **Connect** list, select **Other Databases (ODBC)**. ![drill query flow]({{ site.baseurl }}/docs/img/connect-list.png)
-3.	On the Server Connection window, select the DSN configured in Step 1 from the drop-down list of ODBC data sources. Then click **Connect**. Note: You will be prompted to enter a username and password; these entries will be passed to the Server Connection window. 
-![drill query flow]({{ site.baseurl }}/docs/img/other-dbs.png) ![drill query flow]({{ site.baseurl }}/docs/img/other-dbs-2.png)
-Tableau is now connected to Drill, and you can select various tables and views. ![drill query flow]({{ site.baseurl }}/docs/img/tableau-schemas.png)
-4.	Click the **Schema** drop-down list and choose a schema by clicking the search icon: all available Drill schemas will be displayed. When you have selected the schema, click the search icon in the Table dialog box: all available tables or views will be displayed. 
-
-You can select tables and views to build a Tableau Visualization. You can also use custom SQL by clicking the **New Custom SQL** option. 
-
-Tableau can natively work with Hive tables and Drill views. For Drill data sources, including directly accessible  file systems or HBase/MapR-DB tables, you can either use the Custom SQL option, or you can create a view in Drill to easily represent the complex data to Tableau. For more information, see the following links:
-http://drill.apache.org/docs/step-3-connect-to-drill-data-sources-from-a-bi-tool/
-http://drill.apache.org/docs/tableau-examples/
-
-Note: If Drill authentication and impersonation is enabled, only the views that the user has access to will be displayed in the Table dialog box. Also, if custom SQL is being used to try and access data sources that the user does not have access to, an error message will be displayed. ![drill query flow]({{ site.baseurl }}/docs/img/tableau-error.png)
-
-----------
-
-### Step 4: Query and Analyze the Data 
-
-Tableau Desktop can now use Drill to query various data sources and visualize the information.
-
-Assume that a retailer has weblog data stored as JSON files in a nested directory structure and product information stored in a Hive table. Using Drill and Tableau, users would like to understand the volume of product sold by state and product category.
-
-1.	Using the New Custom SQL function in Tableau, write a Drill query to read the JSON files without the need for ETL. Casting data types is recommended when you are working directly with files. 
-For example: ![drill query flow]({{ site.baseurl }}/docs/img/edit-custom-sql.png)
-
-2.	Next select the Hive products table: ![drill query flow]({{ site.baseurl }}/docs/img/custom-sql-query.png)
-
-3.	Verify that Tableau is joining the two data sources (JSON files and Hive table) on the prod_id key:![drill query flow]({{ site.baseurl }}/docs/img/tableau-join-key.png)
-The data sources are now configured and ready to be used in the visualization.
-4.	Drag State from the Custom SQL (JSON files) to **Columns**, and drag Category from the Hive products table to **Rows**. 
-5.	Create a calculated field called `Total Number of Products` and enter `count(prod_id)` for the `prod_id` field from the weblog files (Custom SQL), then drag it to **Rows** next to Category. The visualization will now show the total products by category and state. 
-6.	To filter out weblog data where products were not bought, drag the `purch_flag` field from the weblog files to **Filters**. Select only data where the `purch_flag` was true.
-7.	Finally, order the data from the state with the most products sold to the one with the least.
-8.	Add a grand total row by clicking **Analysis > Totals > Show Column Grand Totals**. ![drill query flow]({{ site.baseurl }}/docs/img/tableau-desktop-query.png)
-
-----------
-
-In this quick tutorial, you saw how you can configure Tableau Desktop 9.0 to work with Apache Drill. 
-

http://git-wip-us.apache.org/repos/asf/drill/blob/2d27fe5e/_docs/odbc-jdbc-interfaces/using-drill-with-bi-tools/using-apache-drill-with-tableau-9-desktop/020-installing-the-tdc-file-on-windows.md
----------------------------------------------------------------------
diff --git a/_docs/odbc-jdbc-interfaces/using-drill-with-bi-tools/using-apache-drill-with-tableau-9-desktop/020-installing-the-tdc-file-on-windows.md b/_docs/odbc-jdbc-interfaces/using-drill-with-bi-tools/using-apache-drill-with-tableau-9-desktop/020-installing-the-tdc-file-on-windows.md
deleted file mode 100644
index ef48f67..0000000
--- a/_docs/odbc-jdbc-interfaces/using-drill-with-bi-tools/using-apache-drill-with-tableau-9-desktop/020-installing-the-tdc-file-on-windows.md
+++ /dev/null
@@ -1,18 +0,0 @@
----
-title: "Installing the TDC File on Windows"
-date:  
-parent: "Installing the ODBC Driver"
----
-The MapR Drill ODBC driver installer automatically installs the TDC file if the installer can find the Tableau installation. If you installed the MapR Drill ODBC driver first and then installed Tableau, the TDC file is not installed automatically, and you need to install the TDC file manually. 
-
-**To install the MapRDrillODBC.TDC file manually:**
-
-1. Click **Start**, and locate the Install Tableau TDC File app that the MapR Drill ODBC Driver installer installed.   
-   For example, on Windows 8.1 in Apps, the Install Tableau TDC File appears under MaprDrill ODBC Driver:
-   ![]({{ site.baseurl }}/docs/img/odbc-mapr-drill-apps.png)
-
-2. Click **Install Tableau TDC File**. 
-3. When the installation completes, press any key to continue.   
-For example, you can press the SPACEBAR key.
-
-If the installation of the TDC file fails, this is likely due to your Tableau repository being in location other than the default one.  In this case, manually copy the My Tableau Repository to C:\Users\<user>\Documents\My Tableau Repository. Repeat the procedure to install the MapRDrillODBC.TDC file manually.