You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Guowei Ma (Jira)" <ji...@apache.org> on 2020/08/10 08:35:00 UTC

[jira] [Commented] (FLINK-11991) Set headers to use for CSV output

    [ https://issues.apache.org/jira/browse/FLINK-11991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17174181#comment-17174181 ] 

Guowei Ma commented on FLINK-11991:
-----------------------------------

Hi [~jnioche]

Thanks you for opening this jira.

However the DataSet API would be deprecated in the near future[1].  DataStream would use the StreamingFileSink for writing data to the file. So I think maybe you could look at the StreamingFileSink first.

My gut feeling is that maybe we could introduce some new api such as `WriteHeader` & `WriteFooter`.The csv format developer could implement the `WriteHeader` the interface.

What do you think?

[1] [https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=158866741]

> Set headers to use for CSV output
> ---------------------------------
>
>                 Key: FLINK-11991
>                 URL: https://issues.apache.org/jira/browse/FLINK-11991
>             Project: Flink
>          Issue Type: Improvement
>          Components: Formats (JSON, Avro, Parquet, ORC, SequenceFile)
>            Reporter: Julien Nioche
>            Priority: Minor
>              Labels: pull-request-available
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> As discussed in [https://stackoverflow.com/questions/54530755/flink-write-tuples-with-csv-header-into-file/54536586?noredirect=1#comment97248717_54536586], it would be nice to be able to specify headers to print out at the beginning of a CSV output.
> I've written a patch for this and will add submit it as a PR.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)