You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by "Rahul Challapalli (JIRA)" <ji...@apache.org> on 2015/06/09 20:58:00 UTC

[jira] [Created] (DRILL-3266) Drill's hive storage plugin cannot find RegexSerDe

Rahul Challapalli created DRILL-3266:
----------------------------------------

             Summary: Drill's hive storage plugin cannot find RegexSerDe
                 Key: DRILL-3266
                 URL: https://issues.apache.org/jira/browse/DRILL-3266
             Project: Apache Drill
          Issue Type: Bug
          Components: Storage - Hive
            Reporter: Rahul Challapalli
            Assignee: Venki Korukanti


git.commit.id.abbrev=5f26b8b

HIve DDL :
{code}
CREATE EXTERNAL TABLE IF NOT EXISTS hive_storage.raw_access_log(
remote_host STRING,
remote_logname STRING,
remote_user STRING,
request_time STRING,
first_line STRING,
http_status STRING,
bytes STRING
)
ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.RegexSerDe'
WITH SERDEPROPERTIES (
"input.regex" = "([^ ]*) ([^ ]*) ([^ ]*) (-|\\[[^\\]]*\\]) ([^ \"]*|\"[^\"]*\") (-|[0-9]*) (-|[0-9]*)",
"output.format.string" = "%1$s %2$s %3$s %4$s %5$s %6$s %7$s"
)
LOCATION '/drill/testdata/hive_storage/raw_access_log';
{code}

When I try to query this table from sqlline, I get the below error
{code}
0: jdbc:drill:schema=dfs_eea> select * from raw_access_log;
Error: PARSE ERROR: MetaException(message:java.lang.ClassNotFoundException Class org.apache.hadoop.hive.contrib.serde2.RegexSerDe not found)


[Error Id: 5961c4c4-1cf5-495c-9c8a-0616b58d2488 on qa-node190.qa.lab:31010] (state=,code=0)
{code}

The query works from hive without any issues. I attached the data set and the error log




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)