You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@phoenix.apache.org by "Prashant Vithani (JIRA)" <ji...@apache.org> on 2019/04/30 15:12:00 UTC

[jira] [Commented] (PHOENIX-5258) Add support to parse header from the input CSV file as input columns for CsvBulkLoadTool

    [ https://issues.apache.org/jira/browse/PHOENIX-5258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16830381#comment-16830381 ] 

Prashant Vithani commented on PHOENIX-5258:
-------------------------------------------

Cool [~elserj]. I've updated the title as well as a description. I'm currently working with the proposed solution specified in the description.

> Add support to parse header from the input CSV file as input columns for CsvBulkLoadTool
> ----------------------------------------------------------------------------------------
>
>                 Key: PHOENIX-5258
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-5258
>             Project: Phoenix
>          Issue Type: Improvement
>            Reporter: Prashant Vithani
>            Priority: Minor
>
> Currently, CsvBulkLoadTool does not support reading header from the input csv and expects the content of the csv to match with the table schema. The support for the header can be added to dynamically map the schema with the header.
> The proposed solution is to introduce another option for the tool `–header`. If this option is passed, the input columns list is constructed by reading the first line of the input CSV file.
>  * If there is only one file, read the header from the first line and generate the `ColumnInfo` list.
>  * If there are multiple files, read the header from all the files, and throw an error if the headers across files do not match.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)