You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Jonathan Packer (JIRA)" <ji...@apache.org> on 2013/01/28 21:39:12 UTC
[jira] [Created] (PIG-3141) Giving CSVExcelStorage an option to
handle header rows
Jonathan Packer created PIG-3141:
------------------------------------
Summary: Giving CSVExcelStorage an option to handle header rows
Key: PIG-3141
URL: https://issues.apache.org/jira/browse/PIG-3141
Project: Pig
Issue Type: Improvement
Components: piggybank
Affects Versions: 0.11
Reporter: Jonathan Packer
Fix For: 0.11
Attachments: csv.patch
Adds an argument to CSVExcelStorage to skip the header row when loading. This works properly with multiple small files each with a header being combined into one split, or a large file with a single header being split into multiple splits.
Also fixes a few bugs with CSVExcelStorage, including PIG-2470 and a bug involving quoted fields at the end of a line not escaping properly.
Removes the choice of delimiter, since a CSV file ought to only use a comma delimiter, hence the name.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira