You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Michael Brown (JIRA)" <ji...@apache.org> on 2017/05/02 15:02:04 UTC

[jira] [Resolved] (IMPALA-5162) support kerberized+ssl TPC-H nested data loading

     [ https://issues.apache.org/jira/browse/IMPALA-5162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Michael Brown resolved IMPALA-5162.
-----------------------------------
       Resolution: Fixed
    Fix Version/s: Impala 2.9.0

{noformat}
commit 8b459dffec9e093e87da9ab6e8b2e5a9de50a7bd
Author: Michael Brown <mi...@cloudera.com>
Date:   Fri Mar 31 10:39:54 2017 -0700

    IMPALA-5162,IMPALA-5163: stress test support on secure clusters

    This patch adds support for running the stress test
    (concurrent_select.py) and loading nested data (load_nested.py) into a
    Kerberized, SSL-enabled Impala cluster. It assumes the calling user
    already has a valid Kerberos ticket. One way to do that is:

    1. Get access to a keytab and krb5.config
    2. Set KRB5_CONFIG and KRB5CCNAME appropriately
    3. Run kinit(1)
    4. Run load_nested.py and/or concurrent_select.py within this
       environment.

    Because our Python clients already support Kerberos and SSL, we simply
    need to make sure to use the correct options when calling the entry
    points and initializing the clients:

    Impala: Impyla
    Hive: Impyla
    HDFS: hdfs.ext.kerberos.KerberosClient

    With this patch, I was able to manually do a short concurrent_select.py
    run against a secure cluster without connection or auth errors, and I
    was able to do the same with load_nested.py for a cluster that already
    had TPC-H loaded.

    Follow-ons for future cleanup work:

    IMPALA-5263: support CA bundles when running stress test against SSL'd
                 Impala

    IMPALA-5264: fix InsecurePlatformWarning under stress test with SSL

    Change-Id: I0daad57bb8ceeb5071b75125f11c1997ed7e0179
    Reviewed-on: http://gerrit.cloudera.org:8080/6763
    Reviewed-by: Matthew Mulder <mm...@cloudera.com>
    Reviewed-by: Alex Behm <al...@cloudera.com>
    Tested-by: Impala Public Jenkins
{noformat}

> support kerberized+ssl TPC-H nested data loading
> ------------------------------------------------
>
>                 Key: IMPALA-5162
>                 URL: https://issues.apache.org/jira/browse/IMPALA-5162
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Infrastructure
>    Affects Versions: Impala 2.9.0
>            Reporter: Michael Brown
>            Assignee: Michael Brown
>             Fix For: Impala 2.9.0
>
>
> {{load_nested.py}} should be enhanced to work against a kerberized+SSL Impala. {{load_nested.py}} uses the "Cluster" abstraction hierarchy in {{tests.comparison.cluster}} and the connection hierarchy in {{tests.comparison.db_connection}}. The patch will involve supporting SSL and bitrotted Kerberos test code in these abstractions.
> Testing the patch should involve both loading of nested data and regression testing data load and the following regression tests, which are needed because they also use the "Cluster" abstraction hierarchy:
>   * full data load
>   * data load with snapshot
>   * loading of random data into Impala
>   * copying data from Impala to PostgreSQL
>   * stress test run (smoke)
>   * query generator run (smoke)



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)