You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Venki Korukanti (JIRA)" <ji...@apache.org> on 2015/06/10 20:16:01 UTC

[jira] [Commented] (DRILL-3266) Drill's hive storage plugin cannot find RegexSerDe

    [ https://issues.apache.org/jira/browse/DRILL-3266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14580902#comment-14580902 ] 

Venki Korukanti commented on DRILL-3266:
----------------------------------------

This particular SerDe is part of the hive-contrib.jar which we don't package with Drill. Fix would be to fetch this jar as part of the hive storage plugin, so that SerDes shipped with Hive are by default available in Drill.

> Drill's hive storage plugin cannot find RegexSerDe
> --------------------------------------------------
>
>                 Key: DRILL-3266
>                 URL: https://issues.apache.org/jira/browse/DRILL-3266
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - Hive
>            Reporter: Rahul Challapalli
>            Assignee: Venki Korukanti
>         Attachments: apache_log.txt, error.log
>
>
> git.commit.id.abbrev=5f26b8b
> HIve DDL :
> {code}
> CREATE EXTERNAL TABLE IF NOT EXISTS hive_storage.raw_access_log(
> remote_host STRING,
> remote_logname STRING,
> remote_user STRING,
> request_time STRING,
> first_line STRING,
> http_status STRING,
> bytes STRING
> )
> ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.RegexSerDe'
> WITH SERDEPROPERTIES (
> "input.regex" = "([^ ]*) ([^ ]*) ([^ ]*) (-|\\[[^\\]]*\\]) ([^ \"]*|\"[^\"]*\") (-|[0-9]*) (-|[0-9]*)",
> "output.format.string" = "%1$s %2$s %3$s %4$s %5$s %6$s %7$s"
> )
> LOCATION '/drill/testdata/hive_storage/raw_access_log';
> {code}
> When I try to query this table from sqlline, I get the below error
> {code}
> 0: jdbc:drill:schema=dfs_eea> select * from raw_access_log;
> Error: PARSE ERROR: MetaException(message:java.lang.ClassNotFoundException Class org.apache.hadoop.hive.contrib.serde2.RegexSerDe not found)
> [Error Id: 5961c4c4-1cf5-495c-9c8a-0616b58d2488 on qa-node190.qa.lab:31010] (state=,code=0)
> {code}
> The query works from hive without any issues. I attached the data set and the error log



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)