You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hive.apache.org by tabraiz anwer <ta...@yahoo.com> on 2012/06/04 18:08:12 UTC

Problem: LINES TERMINATED BY only supports newline '\n' right now.

Hi,
    i had tried to create the table by "LINES terminated by '\001' " 
and it is giving me the error

Error in semantic analysis: 3:66 LINES TERMINATED BY only supports newline '\n' right now. Error encountered near token ''\001''


CREATE TABLE xmlgw4 (      transactionid string,    typeid string,       
       sentxml string,receivedxml string     )
ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\001'
STORED AS TEXTFILE;


instead of '\n' i am using '\001' because i have an xml value , which i want to store in hive that invludes \n values..
using hive version : hive-0.8.1

any suggestion?
Regards.

Re: Problem: LINES TERMINATED BY only supports newline '\n' right now.

Posted by Mark Grover <mg...@oanda.com>.
Hi Tabraiz,
The 10 in the source code is what '\n' is in ASCII (base 10). That's why you see it. It still represents a linefeed.

Mark

----- Original Message -----
From: "tabraiz anwer" <ta...@yahoo.com>
To: "Mark Grover" <mg...@oanda.com>, "hive group" <us...@hive.apache.org>
Sent: Monday, June 4, 2012 4:42:02 PM
Subject: Re: Problem: LINES TERMINATED BY only supports newline '\n' right now.



hello Mark, 
instead of '\n\ we can also termincate records by '10' i have see the exmaple in hive wiki where they are creating tables and their records are terminating by '\001\' 
i have checked src of hive syntax analyzer . there are only two option of LINES termination one is '\n' and other is '10' now the question arises is how i can add another line termination values of '\001\ 
Regards. 





From: Mark Grover <mg...@oanda.com> 
To: user@hive.apache.org; tabraiz anwer <ta...@yahoo.com> 
Sent: Monday, 4 June 2012 9:26 PM 
Subject: Re: Problem: LINES TERMINATED BY only supports newline '\n' right now. 

Hi Tabriz, 
As far as I know, newlines are the only supported way to separate records right now. As a corollary if a single logical records exists across multiple lines, you will have to get rid of the extra newlines for all of it to be in the same record. 

So, to get around it, you can do one of two things: 
1) Pre-process your files to break records apart on newlines. 
2) As Ed Capriolo suggested in a previous email thread, you could try to use streaming, parse out your XML there and emit out multiple records. 

Mark 

----- Original Message ----- 
From: "tabraiz anwer" < tabraizanwer@yahoo.com > 
To: "hive group" < user@hive.apache.org > 
Sent: Monday, June 4, 2012 12:08:12 PM 
Subject: Problem: LINES TERMINATED BY only supports newline '\n' right now. 



Hi, 
i had tried to create the table by "LINES terminated by '\001' " 
and it is giving me the error 


Error in semantic analysis: 3:66 LINES TERMINATED BY only supports newline '\n' right now. Error encountered near token ''\001'' 


CREATE TABLE xmlgw4 ( transactionid string, typeid string, 
sentxml string,receivedxml string ) 
ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\001' 
STORED AS TEXTFILE; 



instead of '\n' i am using '\001' because i have an xml value , which i want to store in hive that invludes \n values.. 
using hive version : hive-0.8.1 

any suggestion? 
Regards. 



Re: Problem: LINES TERMINATED BY only supports newline '\n' right now.

Posted by tabraiz anwer <ta...@yahoo.com>.
hello Mark,
instead of '\n\ we can also termincate records by '10' i have see the  exmaple in hive wiki where they are creating tables and their records are terminating by '\001\'
i have checked src of hive syntax analyzer . there are only two option of LINES termination one is '\n' and other is '10' now the question arises is how i can add another line termination values of '\001\
Regards.



________________________________
 From: Mark Grover <mg...@oanda.com>
To: user@hive.apache.org; tabraiz anwer <ta...@yahoo.com> 
Sent: Monday, 4 June 2012 9:26 PM
Subject: Re: Problem: LINES TERMINATED BY only supports newline '\n' right now.
 
Hi Tabriz,
As far as I know, newlines are the only supported way to separate records right now. As a corollary if a single logical records exists across multiple lines, you will have to get rid of the extra newlines for all of it to be in the same record.

So, to get around it, you can do one of two things:
1) Pre-process your files to break records apart on newlines.
2) As Ed Capriolo suggested in a previous email thread, you could try to use streaming, parse out your XML there and emit out multiple records.

Mark

----- Original Message -----
From: "tabraiz anwer" <ta...@yahoo.com>
To: "hive group" <us...@hive.apache.org>
Sent: Monday, June 4, 2012 12:08:12 PM
Subject: Problem: LINES TERMINATED BY only supports newline '\n' right now.



Hi, 
i had tried to create the table by "LINES terminated by '\001' " 
and it is giving me the error 


Error in semantic analysis: 3:66 LINES TERMINATED BY only supports newline '\n' right now. Error encountered near token ''\001'' 


CREATE TABLE xmlgw4 ( transactionid string, typeid string, 
sentxml string,receivedxml string ) 
ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\001' 
STORED AS TEXTFILE; 



instead of '\n' i am using '\001' because i have an xml value , which i want to store in hive that invludes \n values.. 
using hive version : hive-0.8.1 

any suggestion? 
Regards. 

Re: Problem: LINES TERMINATED BY only supports newline '\n' right now.

Posted by Mark Grover <mg...@oanda.com>.
Hi Tabriz,
As far as I know, newlines are the only supported way to separate records right now. As a corollary if a single logical records exists across multiple lines, you will have to get rid of the extra newlines for all of it to be in the same record.

So, to get around it, you can do one of two things:
1) Pre-process your files to break records apart on newlines.
2) As Ed Capriolo suggested in a previous email thread, you could try to use streaming, parse out your XML there and emit out multiple records.

Mark

----- Original Message -----
From: "tabraiz anwer" <ta...@yahoo.com>
To: "hive group" <us...@hive.apache.org>
Sent: Monday, June 4, 2012 12:08:12 PM
Subject: Problem: LINES TERMINATED BY only supports newline '\n' right now.



Hi, 
i had tried to create the table by "LINES terminated by '\001' " 
and it is giving me the error 


Error in semantic analysis: 3:66 LINES TERMINATED BY only supports newline '\n' right now. Error encountered near token ''\001'' 


CREATE TABLE xmlgw4 ( transactionid string, typeid string, 
sentxml string,receivedxml string ) 
ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\001' 
STORED AS TEXTFILE; 



instead of '\n' i am using '\001' because i have an xml value , which i want to store in hive that invludes \n values.. 
using hive version : hive-0.8.1 

any suggestion? 
Regards.