You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Neelesh Srinivas Salian (JIRA)" <ji...@apache.org> on 2016/07/25 03:13:20 UTC
[jira] [Commented] (FLINK-992) Create CollectionDataSets by reading
(client) local files.
[ https://issues.apache.org/jira/browse/FLINK-992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15391284#comment-15391284 ]
Neelesh Srinivas Salian commented on FLINK-992:
-----------------------------------------------
Shall I begin to work on this if no one else is?
> Create CollectionDataSets by reading (client) local files.
> ----------------------------------------------------------
>
> Key: FLINK-992
> URL: https://issues.apache.org/jira/browse/FLINK-992
> Project: Flink
> Issue Type: New Feature
> Components: DataSet API, Python API
> Reporter: Fabian Hueske
> Assignee: niraj rai
> Priority: Minor
> Labels: starter
>
> {{CollectionDataSets}} are a nice way to feed data into programs.
> We could add support to read a client-local file at program construction time using a FileInputFormat, put its data into a CollectionDataSet, and ship its data together with the program.
> This would remove the need to upload small files into DFS which are used together with some large input (stored in DFS).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)