You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Guowei Ma (Jira)" <ji...@apache.org> on 2020/08/10 08:35:00 UTC
[jira] [Commented] (FLINK-11991) Set headers to use for CSV output
[ https://issues.apache.org/jira/browse/FLINK-11991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17174181#comment-17174181 ]
Guowei Ma commented on FLINK-11991:
-----------------------------------
Hi [~jnioche]
Thanks you for opening this jira.
However the DataSet API would be deprecated in the near future[1]. DataStream would use the StreamingFileSink for writing data to the file. So I think maybe you could look at the StreamingFileSink first.
My gut feeling is that maybe we could introduce some new api such as `WriteHeader` & `WriteFooter`.The csv format developer could implement the `WriteHeader` the interface.
What do you think?
[1] [https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=158866741]
> Set headers to use for CSV output
> ---------------------------------
>
> Key: FLINK-11991
> URL: https://issues.apache.org/jira/browse/FLINK-11991
> Project: Flink
> Issue Type: Improvement
> Components: Formats (JSON, Avro, Parquet, ORC, SequenceFile)
> Reporter: Julien Nioche
> Priority: Minor
> Labels: pull-request-available
> Time Spent: 10m
> Remaining Estimate: 0h
>
> As discussed in [https://stackoverflow.com/questions/54530755/flink-write-tuples-with-csv-header-into-file/54536586?noredirect=1#comment97248717_54536586], it would be nice to be able to specify headers to print out at the beginning of a CSV output.
> I've written a patch for this and will add submit it as a PR.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)