You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Andrew Ahmad (JIRA)" <ji...@apache.org> on 2013/10/03 16:21:42 UTC

[jira] [Commented] (HIVE-3065) New lines in columns can cause problems even when using sequence files

    [ https://issues.apache.org/jira/browse/HIVE-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13785229#comment-13785229 ] 

Andrew Ahmad commented on HIVE-3065:
------------------------------------

This problem still exists in 0.10.0. I'm using RCFile and ran into this issue today. Not sure about 0.11.0 as I'm limited to the packages available in the CDH distribution.

> New lines in columns can cause problems even when using sequence files
> ----------------------------------------------------------------------
>
>                 Key: HIVE-3065
>                 URL: https://issues.apache.org/jira/browse/HIVE-3065
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: 0.7.1, 0.8.1
>            Reporter: Joey Echeverria
>
> When using sequence files as the container format, I'd expect to be able to embed new lines in a column. However, this causes problems when the data is output if the newlines aren't manually stripped or escaped. This tends to show up as each row of output generating two (or more) rows with nulls after the column with a new line and nulls for the "empty" columns on the second row.



--
This message was sent by Atlassian JIRA
(v6.1#6144)