You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Zijie Lu (Jira)" <ji...@apache.org> on 2019/09/17 04:01:00 UTC
[jira] [Commented] (FLINK-992) Create CollectionDataSets by reading
(client) local files.
[ https://issues.apache.org/jira/browse/FLINK-992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16931068#comment-16931068 ]
Zijie Lu commented on FLINK-992:
--------------------------------
[~fhueske], I am new here. What's the status of this issue now? If no one work on at the moment, can you assign it to me?
> Create CollectionDataSets by reading (client) local files.
> ----------------------------------------------------------
>
> Key: FLINK-992
> URL: https://issues.apache.org/jira/browse/FLINK-992
> Project: Flink
> Issue Type: New Feature
> Components: API / DataSet, API / Python
> Reporter: Fabian Hueske
> Priority: Minor
> Labels: starter
>
> {{CollectionDataSets}} are a nice way to feed data into programs.
> We could add support to read a client-local file at program construction time using a FileInputFormat, put its data into a CollectionDataSet, and ship its data together with the program.
> This would remove the need to upload small files into DFS which are used together with some large input (stored in DFS).
--
This message was sent by Atlassian Jira
(v8.3.2#803003)