You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by "Shreyas Joshi (JIRA)" <ji...@apache.org> on 2016/09/08 19:02:21 UTC
[jira] [Created] (AIRFLOW-496) HiveServer2Hook invokes incorrect
Auth mechanism when user not specified
Shreyas Joshi created AIRFLOW-496:
-------------------------------------
Summary: HiveServer2Hook invokes incorrect Auth mechanism when user not specified
Key: AIRFLOW-496
URL: https://issues.apache.org/jira/browse/AIRFLOW-496
Project: Apache Airflow
Issue Type: Bug
Components: hive_hooks
Reporter: Shreyas Joshi
h3. Summary
{{HiveServer2Hook}} Seems to be ignoring the auth_mechanism when the user is not specified. I am not entirely sure if the solution should be should change impyala or Airflow.
h3. Reproducing the problem
With this connection string for Hive: {{AIRFLOW_CONN_GH_HIVE=hive2://@localhost:10000/}} (No user name and no password)
I get the following error from {{HiveServer2hook}}:
{code}
from airflow.hooks import HiveServer2Hook
hive_hook = HiveServer2Hook (hiveserver2_conn_id='GH_HIVE')
{code}
{noformat}
[2016-09-08 14:30:52,420] {base_hook.py:53} INFO - Using connection to: localhost
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/Users/shreyasjoshis/python-envs/default-env/lib/python3.5/site-packages/airflow/hooks/hive_hooks.py", line 464, in get_conn
database=db.schema or 'default')
File "/Users/shreyasjoshis/python-envs/default-env/lib/python3.5/site-packages/impala/dbapi.py", line 147, in connect
auth_mechanism=auth_mechanism)
File "/Users/shreyasjoshis/python-envs/default-env/lib/python3.5/site-packages/impala/hiveserver2.py", line 658, in connect
transport.open()
File "/Users/shreyasjoshis/python-envs/default-env/lib/python3.5/site-packages/thrift_sasl/__init__.py", line 72, in open
message=("Could not start SASL: %s" % self.sasl.getError()))
thriftpy.transport.TTransportException: TTransportException(type=1, message="Could not start SASL: b'Error in sasl_client_start (-4) SASL(-4): no mechanism available: No worthy mechs found'")
{noformat}
h3. More detail
[Here|https://github.com/apache/incubator-airflow/blob/master/airflow/hooks/hive_hooks.py#L591] {{db.login}} ends up being an empty string rather than {{None}}. This seems to cause impala to try sasl. Changing {{db.login}} from an empty string to {{None}} seems to fix the issue.
So, the following does not work
{code}
from impala.dbapi import connect
connect (host='localhost', port=10000, user='', auth_mechanism='PLAIN', database= 'default')
{code}
The error is:
{noformat}
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/Users/shreyasjoshis/python-envs/default-env/lib/python3.5/site-packages/impala/dbapi.py", line 147, in connect
auth_mechanism=auth_mechanism)
File "/Users/shreyasjoshis/python-envs/default-env/lib/python3.5/site-packages/impala/hiveserver2.py", line 658, in connect
transport.open()
File "/Users/shreyasjoshis/python-envs/default-env/lib/python3.5/site-packages/thrift_sasl/__init__.py", line 72, in open
message=("Could not start SASL: %s" % self.sasl.getError()))
thriftpy.transport.TTransportException: TTransportException(type=1, message="Could not start SASL: b'Error in sasl_client_start (-4) SASL(-4): no mechanism available: No worthy mechs found'")
{noformat}
But the following does:
{code}
from impala.dbapi import connect
connect (host='localhost', port=10000, user=None, auth_mechanism='PLAIN', database= 'default')
{code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)