You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Edward Capriolo (JIRA)" <ji...@apache.org> on 2012/07/06 17:49:34 UTC

[jira] [Commented] (HIVE-3227) Implement data loading from user provided string directly for test

    [ https://issues.apache.org/jira/browse/HIVE-3227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13408086#comment-13408086 ] 

Edward Capriolo commented on HIVE-3227:
---------------------------------------

@Navis  This is a good idea but we have to be very careful about features we add to the language. Also we have to cover the cases of overwritten files etc.

Your idea though prompted me to write:
https://issues.apache.org/jira/browse/HIVE-3238

I think user-space is a better answer for this problem. We can still consider adding this issue but I think 3238 is a little safer.

You should hang out on hive IRC so we can discuss more. 3238 is a bit more verbose and will not speed up until testing like you mentioned, but I like the approach better.
                
> Implement data loading from user provided string directly for test
> ------------------------------------------------------------------
>
>                 Key: HIVE-3227
>                 URL: https://issues.apache.org/jira/browse/HIVE-3227
>             Project: Hive
>          Issue Type: Improvement
>          Components: Query Processor, Testing Infrastructure
>    Affects Versions: 0.10.0
>            Reporter: Navis
>            Assignee: Navis
>            Priority: Trivial
>
> {code}
> load data instream 'key value\nkey2 value2' into table test;
> {code}
> This will make test easier and also can reduce test time. For example,
> {code}
> -- ppr_pushdown.q
> create table ppr_test (key string) partitioned by (ds string);
> alter table ppr_test add partition (ds = '1234');
> insert overwrite table ppr_test partition(ds = '1234') select * from (select '1234' from src limit 1 union all select 'abcd' from src limit 1) s;
> {code}
> last query is 4MR job. But can be replaced by
> {code}
> create table ppr_test (key string) partitioned by (ds string) ROW FORMAT delimited fields terminated by ' ';
> alter table ppr_test add partition (ds = '1234');
> load data local instream '1234\nabcd' overwrite into table ppr_test partition(ds = '1234');
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira