You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Edward Capriolo (JIRA)" <ji...@apache.org> on 2009/09/09 21:12:57 UTC

[jira] Commented: (HIVE-820) Describe Extended Line Breaks When Delimiter is \n

    [ https://issues.apache.org/jira/browse/HIVE-820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12753218#action_12753218 ] 

Edward Capriolo commented on HIVE-820:
--------------------------------------

Can I drop a late comment in....

-            outStream.writeBytes(tbl.getTTable().toString());
+            outStream.writeBytes(tbl.getTTable().toString().replaceAll("\n", "<LF>").replaceAll("\t", "<TAB>"));

We should do this in a uniform format. There are lots of non printable characters we use US UnitSeparator for example

http://web.cs.mun.ca/~michael/c/ascii-table.html
Why not output in the same format the create table would specify?

{noformat}
 FIELDS TERMINATED BY '\054' " +
        " LINES TERMINATED BY '\012' " );
{noformat}

> Describe Extended Line Breaks When Delimiter is \n
> --------------------------------------------------
>
>                 Key: HIVE-820
>                 URL: https://issues.apache.org/jira/browse/HIVE-820
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.2.0, 0.3.0, 0.3.1, 0.3.2, 0.4.0, 0.5.0
>            Reporter: Matt Pestritto
>            Assignee: Matt Pestritto
>            Priority: Minor
>             Fix For: 0.5.0
>
>         Attachments: hive_820.patch
>
>
> Tables defined delimited with \t and breaks using \n has output of describe extended that is not contiguous.
> Line.delim outputs an actual \n which breaks the display output so using the hiveservice you have to do another FetchOne to get the rest of the line.
> For example.
> Original Output:
> Detailed Table Information    Table(tableName:cobra_merchandise, dbName:default, owner:hive, createTime:1248726291, lastAccessTime:0, retention:0, sd:StorageDescriptor(cols:[FieldSchema(name:merchandise_tid, type:string, comment:null), FieldSchema(name:client_merch_type_tid, type:string, comment:null), FieldSchema(name:description, type:string, comment:null), FieldSchema(name:client_description, type:string, comment:null), FieldSchema(name:price, type:string, comment:null), FieldSchema(name:cost, type:string, comment:null), FieldSchema(name:start_date, type:string, comment:null), FieldSchema(name:end_date, type:string, comment:null)], location:hdfs://mustique:9000/user/hive/warehouse/m, inputFormat:org.apache.hadoop.mapred.TextInputFormat, outputFormat:org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat, compressed:false, numBuckets:-1, serdeInfo:SerDeInfo(name:null, serializationLib:org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe, parameters:{serialization.format=9,line.delim=
> ,field.delim=    }), bucketCols:[], sortCols:[], parameters:{}), partitionKeys:[FieldSchema(name:client_tid, type:int, comment:null)], parameters:{})   
> Proposed Output:
> Detailed Table Information    Table(tableName:cobra_merchandise, dbName:default, owner:hive, createTime:1248726291, lastAccessTime:0, retention:0, sd:StorageDescriptor(cols:[FieldSchema(name:merchandise_tid, type:string, comment:null), FieldSchema(name:client_merch_type_tid, type:string, comment:null), FieldSchema(name:description, type:string, comment:null), FieldSchema(name:client_description, type:string, comment:null), FieldSchema(name:price, type:string, comment:null), FieldSchema(name:cost, type:string, comment:null), FieldSchema(name:start_date, type:string, comment:null), FieldSchema(name:end_date, type:string, comment:null)], location:hdfs://mustique:9000/user/hive/warehouse/m, inputFormat:org.apache.hadoop.mapred.TextInputFormat, outputFormat:org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat, compressed:false, numBuckets:-1, serdeInfo:SerDeInfo(name:null, serializationLib:org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe, parameters:{serialization.format=9,line.delim=<LF>,field.delim=<TAB>}), bucketCols:[], sortCols:[], parameters:{}), partitionKeys:[FieldSchema(name:client_tid, type:int, comment:null)], parameters:{})   

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.