You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Ning Zhang (JIRA)" <ji...@apache.org> on 2010/08/27 21:41:54 UTC

[jira] Updated: (HIVE-1603) support CSV text file format

     [ https://issues.apache.org/jira/browse/HIVE-1603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ning Zhang updated HIVE-1603:
-----------------------------

    Description: 
Comma Separated Values (CSV) text format are commonly used in exchanging relational data between heterogeneous systems. Currently Hive uses TextFile format when displaying query results. This could cause confusions when column values contain new lines or tabs. A CSVTextFile format could get around this problem. This will require a new CSVTextInputFormat, CSVTextOutputFormat, and CSVSerDe. 

A proposed use case is like:

{code}
-- exporting a table to CSV files in a directory
hive> set hive.io.output.fileformat=CSVTextFile;
hive> insert overwrite local directory '/tmp/CSVrepos/' select * from S where ... ;

-- query result in CSV
hive -e 'set hive.io.output.fileformat=CSVTextFile; select * from T;' | sql_loader_to_other_systems

-- query CSV files directory from Hive
hive> create table T (...) stored as CSVTextFile;
hive> load data local inpath '/my/CSVfiles' into table T;
hive> select * from T where ...;
{code}

  was:
Comma Separated Values (CSV) text format are commonly used in exchanging relational data between heterogeneous systems. Currently Hive uses TextFile format when displaying query results. This could cause confusions when column values contain new lines or tabs. A CSVTextFile format could get around this problem. This will require a new CSVTextInputFormat, CSVTextOutputFormat, and CSVSerDe. 

A proposed use case is like:

{code}
-- exporting a table to CSV files in a directory
hive> set hive.io.output.fileformat=CSVTextFile;
hive> insert overwrite local directory '/tmp/CSVrepos/' select * from S where ... ;

-- query result in CSV
hive -e 'set hive.io.output.fileformat=CSVTextFile; select * from T;' | sql_loader_to_other_systems
{code}


> support CSV text file format
> ----------------------------
>
>                 Key: HIVE-1603
>                 URL: https://issues.apache.org/jira/browse/HIVE-1603
>             Project: Hadoop Hive
>          Issue Type: New Feature
>    Affects Versions: 0.7.0
>            Reporter: Ning Zhang
>
> Comma Separated Values (CSV) text format are commonly used in exchanging relational data between heterogeneous systems. Currently Hive uses TextFile format when displaying query results. This could cause confusions when column values contain new lines or tabs. A CSVTextFile format could get around this problem. This will require a new CSVTextInputFormat, CSVTextOutputFormat, and CSVSerDe. 
> A proposed use case is like:
> {code}
> -- exporting a table to CSV files in a directory
> hive> set hive.io.output.fileformat=CSVTextFile;
> hive> insert overwrite local directory '/tmp/CSVrepos/' select * from S where ... ;
> -- query result in CSV
> hive -e 'set hive.io.output.fileformat=CSVTextFile; select * from T;' | sql_loader_to_other_systems
> -- query CSV files directory from Hive
> hive> create table T (...) stored as CSVTextFile;
> hive> load data local inpath '/my/CSVfiles' into table T;
> hive> select * from T where ...;
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.