You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@sqoop.apache.org by "Henrique Andrade (JIRA)" <ji...@apache.org> on 2015/04/16 20:50:59 UTC

[jira] [Created] (SQOOP-2312) Problem when exporting files that has \n as part as the content columns

Henrique Andrade created SQOOP-2312:
---------------------------------------

             Summary: Problem when exporting files that has \n as part as the content columns
                 Key: SQOOP-2312
                 URL: https://issues.apache.org/jira/browse/SQOOP-2312
             Project: Sqoop
          Issue Type: Bug
          Components: connectors/generic
         Environment: Sqoop 1.4.6-rc1
            Reporter: Henrique Andrade
            Priority: Critical


I have exported from my SQL Server some data related to our customers.

One of the columns has some comments from customers and this is the data that is there:

"Pecém\n" +
                        "                                \n" +
                        "								(São Gonçalo do Amarante)

The problem is that Sqoop is breaking the Record at this point and the rest of the process is failing.

I tried to use some different options such as lines-terminated by with different character (ˆ) but looks like hadoop library is not accepting that and is taking all the 29.000 records as a single record.
   "--fields-terminated-by", "|",
                "--lines-terminated-by", "ˆ",
                "--enclosed-by","'",
                "--escaped-by","\\"};
I have read in some threads that looks like the only lines-terminated-by character that was accepted was \n. Is this changed on this 1.4.6 version?

Is there a way for avoiding the content of the columns to break the import?
 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)