You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Raghotham Murthy (JIRA)" <ji...@apache.org> on 2009/06/03 20:14:07 UTC

[jira] Created: (HIVE-538) make hive_jdbc.jar self-containing

make hive_jdbc.jar self-containing
----------------------------------

                 Key: HIVE-538
                 URL: https://issues.apache.org/jira/browse/HIVE-538
             Project: Hadoop Hive
          Issue Type: Improvement
          Components: Clients
    Affects Versions: 0.2.0, 0.3.0, 0.3.1, 0.4.0
            Reporter: Raghotham Murthy


Currently, the following additional jars are required.

# from hive/build/dist/lib
#     commons-logging-*.jar
#     hive_jdbc.jar
#     hive_metastore.jar
#     hive_serde.jar
#     hive_service.jar
#     libfb303.jar
#     libthrift.jar
#
# from hadoop/build
#     hadoop-*-core.jar

Some of the dependencies can be removed - like commons-logging, hadoop-*-core.jar. Others can be rolled in to a single jar.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HIVE-538) make hive_jdbc.jar self-containing

Posted by "Raghotham Murthy (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Raghotham Murthy updated HIVE-538:
----------------------------------

    Description: 
Currently, most jars in hive/build/dist/lib and the hadoop-*-core.jar are required in the classpath to run jdbc applications on hive. We need to do atleast the following to get rid of most unnecessary dependencies:
1. get rid of dynamic serde and use a standard serialization format, maybe tab separated, json or avro
2. dont use hadoop configuration parameters
3. repackage thrift and fb303 classes into hive_jdbc.jar

  was:
Currently, the following additional jars are required.

* from hive/build/dist/lib
-     commons-logging-*.jar
-     hive_jdbc.jar
-     hive_metastore.jar
-     hive_serde.jar
-     hive_service.jar
-     libfb303.jar
-     libthrift.jar
* from hadoop/build
-     hadoop-*-core.jar

Some of the dependencies can be removed - like commons-logging, hadoop-*-core.jar. Others can be rolled in to a single jar.


> make hive_jdbc.jar self-containing
> ----------------------------------
>
>                 Key: HIVE-538
>                 URL: https://issues.apache.org/jira/browse/HIVE-538
>             Project: Hadoop Hive
>          Issue Type: Improvement
>          Components: Clients
>    Affects Versions: 0.2.0, 0.3.0, 0.3.1, 0.4.0
>            Reporter: Raghotham Murthy
>
> Currently, most jars in hive/build/dist/lib and the hadoop-*-core.jar are required in the classpath to run jdbc applications on hive. We need to do atleast the following to get rid of most unnecessary dependencies:
> 1. get rid of dynamic serde and use a standard serialization format, maybe tab separated, json or avro
> 2. dont use hadoop configuration parameters
> 3. repackage thrift and fb303 classes into hive_jdbc.jar

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HIVE-538) make hive_jdbc.jar self-containing

Posted by "Bill Graham (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HIVE-538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12896746#action_12896746 ] 

Bill Graham commented on HIVE-538:
----------------------------------

>From a purely empirical approach it appears that the following jars are currently required to use the hive JDBC driver (version 0.5.0):

{noformat}
dist/lib/hive-exec-0.7.0.jar
dist/lib/hive-jdbc-0.7.0.jar
dist/lib/hive-metastore-0.7.0.jar
dist/lib/hive-service-0.7.0.jar
dist/lib/libfb303.jar
dist/lib/libthrift.jar

hadoop-core-{version}.jar
{noformat}

I propose modifying the build process to combine the classes from the first set of jars into one single jar. That way users only need to add the hadoop-core jar and and hive-jdbc-combined jar to their classpath. As other dependencies are removed or refactored away, we could thin out what goes in the jar. 

I can take on this JIRA if others agree with the approach.




> make hive_jdbc.jar self-containing
> ----------------------------------
>
>                 Key: HIVE-538
>                 URL: https://issues.apache.org/jira/browse/HIVE-538
>             Project: Hadoop Hive
>          Issue Type: Improvement
>          Components: Clients
>    Affects Versions: 0.3.0, 0.3.1, 0.4.0, 0.6.0
>            Reporter: Raghotham Murthy
>
> Currently, most jars in hive/build/dist/lib and the hadoop-*-core.jar are required in the classpath to run jdbc applications on hive. We need to do atleast the following to get rid of most unnecessary dependencies:
> 1. get rid of dynamic serde and use a standard serialization format, maybe tab separated, json or avro
> 2. dont use hadoop configuration parameters
> 3. repackage thrift and fb303 classes into hive_jdbc.jar

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HIVE-538) make hive_jdbc.jar self-containing

Posted by "Raghotham Murthy (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Raghotham Murthy updated HIVE-538:
----------------------------------

    Description: 
Currently, the following additional jars are required.

* from hive/build/dist/lib
-     commons-logging-*.jar
-     hive_jdbc.jar
-     hive_metastore.jar
-     hive_serde.jar
-     hive_service.jar
-     libfb303.jar
-     libthrift.jar
* from hadoop/build
-     hadoop-*-core.jar

Some of the dependencies can be removed - like commons-logging, hadoop-*-core.jar. Others can be rolled in to a single jar.

  was:
Currently, the following additional jars are required.

# from hive/build/dist/lib
#     commons-logging-*.jar
#     hive_jdbc.jar
#     hive_metastore.jar
#     hive_serde.jar
#     hive_service.jar
#     libfb303.jar
#     libthrift.jar
#
# from hadoop/build
#     hadoop-*-core.jar

Some of the dependencies can be removed - like commons-logging, hadoop-*-core.jar. Others can be rolled in to a single jar.


> make hive_jdbc.jar self-containing
> ----------------------------------
>
>                 Key: HIVE-538
>                 URL: https://issues.apache.org/jira/browse/HIVE-538
>             Project: Hadoop Hive
>          Issue Type: Improvement
>          Components: Clients
>    Affects Versions: 0.2.0, 0.3.0, 0.3.1, 0.4.0
>            Reporter: Raghotham Murthy
>
> Currently, the following additional jars are required.
> * from hive/build/dist/lib
> -     commons-logging-*.jar
> -     hive_jdbc.jar
> -     hive_metastore.jar
> -     hive_serde.jar
> -     hive_service.jar
> -     libfb303.jar
> -     libthrift.jar
> * from hadoop/build
> -     hadoop-*-core.jar
> Some of the dependencies can be removed - like commons-logging, hadoop-*-core.jar. Others can be rolled in to a single jar.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.