You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Andreas Paepcke (JIRA)" <ji...@apache.org> on 2011/03/21 00:06:06 UTC

[jira] [Updated] (PIG-1924) CSV Loader/Store that handles newlines in fields, and other Excel CSV features.

     [ https://issues.apache.org/jira/browse/PIG-1924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Andreas Paepcke updated PIG-1924:
---------------------------------

    Attachment: TestCSVExcelStorage.java
                CSVExcelStorage.java

> CSV Loader/Store that handles newlines in fields, and other Excel CSV features.
> -------------------------------------------------------------------------------
>
>                 Key: PIG-1924
>                 URL: https://issues.apache.org/jira/browse/PIG-1924
>             Project: Pig
>          Issue Type: New Feature
>          Components: tools
>    Affects Versions: 0.8.0
>            Reporter: Andreas Paepcke
>         Attachments: CSVExcelStorage.java, TestCSVExcelStorage.java
>
>   Original Estimate: 0h
>  Remaining Estimate: 0h
>
> CSVExcelStorage() combines load and store of CSV encoded data. Handles newlines within fields, escaped double quotes, and double quoting of fields with embedded field delimiters. Newline handling is optional, and controlled by a parameter. The module also offers an option to output with Windows style newlines (CRLF, instead of the Unix LF). All CSV related syntax decisions were made to match Excel 2007.
> The module comes with a test file, and javadoc produces proper documentation files.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira