You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/01/11 17:30:01 UTC

[jira] [Work logged] (HIVE-24348) Beeline: Isolating dependencies and execution with java

     [ https://issues.apache.org/jira/browse/HIVE-24348?focusedWorklogId=534359&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-534359 ]

ASF GitHub Bot logged work on HIVE-24348:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 11/Jan/21 17:29
            Start Date: 11/Jan/21 17:29
    Worklog Time Spent: 10m 
      Work Description: nrg4878 opened a new pull request #1852:
URL: https://github.com/apache/hive/pull/1852


               1) Changes to use java to execute beeline if HADOOP_HOME is not set
               2) POM changes to build an uber beeline-with-dependencies jar
               3) Eliminates some of the not required jars in classpath for beeline.
   
   
   ### What changes were proposed in this pull request?
               1) Changes to use java to execute beeline if HADOOP_HOME is not set
               2) POM changes to build an uber beeline-with-dependencies jar
               3) Eliminates some of the not required jars in classpath for beeline.
   
   
   ### Why are the changes needed?
   Currently beeline has a bunch of dependencies on other jars, some required and some unnecessary. beeline script also uses HADOOP_HOME to execute beeline class. This adds some other jars to the classpath that makes it hard to run beeline on an edge node. This fix is a first pass at building a standalone jar with all needed dependencies and also make it work using "java" when HADOOP_HOME is not set.
   
   ### Does this PR introduce _any_ user-facing change?
   NO
   
   ### How was this patch tested?
   Manually.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

            Worklog Id:     (was: 534359)
    Remaining Estimate: 0h
            Time Spent: 10m

> Beeline: Isolating dependencies and execution with java
> -------------------------------------------------------
>
>                 Key: HIVE-24348
>                 URL: https://issues.apache.org/jira/browse/HIVE-24348
>             Project: Hive
>          Issue Type: Improvement
>          Components: Beeline
>    Affects Versions: 3.1.0
>            Reporter: Naveen Gangam
>            Assignee: Naveen Gangam
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> Currently, beeline code, binaries and executables are somewhat tightly coupled with the hive product. To be able to execute beeline from a node with just JRE installed and some jars in classpath is impossible.
> * beeline.sh/hive scripts rely on HADOOP_HOME to be set which are designed to use "hadoop" executable to run beeline.
> * Ideally, just the hive-beeline.jar and hive-jdbc-standalone jars should be enough but sadly they arent. The latter jar adds more problems than it solves because all the classfiles are shaded some dependencies cannot be resolved.
> * Beeline has many other dependencies like hive-exec, hive-common. hadoop-common, supercsv, jline, commons-cli, commons-io, commons-logging etc. While it may not be possible to eliminate some of these, we should atleast have a self-contains jar that contains all these to be able to make it work.
> * the underlying script used to run beeline should use JAVA as an alternate means to execute if HADOOP_HOME is not set



--
This message was sent by Atlassian Jira
(v8.3.4#803005)