You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Neelesh Srinivas Salian (JIRA)" <ji...@apache.org> on 2016/07/25 03:13:20 UTC

[jira] [Commented] (FLINK-992) Create CollectionDataSets by reading (client) local files.

    [ https://issues.apache.org/jira/browse/FLINK-992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15391284#comment-15391284 ] 

Neelesh Srinivas Salian commented on FLINK-992:
-----------------------------------------------

Shall I begin to work on this if no one else is?


> Create CollectionDataSets by reading (client) local files.
> ----------------------------------------------------------
>
>                 Key: FLINK-992
>                 URL: https://issues.apache.org/jira/browse/FLINK-992
>             Project: Flink
>          Issue Type: New Feature
>          Components: DataSet API, Python API
>            Reporter: Fabian Hueske
>            Assignee: niraj rai
>            Priority: Minor
>              Labels: starter
>
> {{CollectionDataSets}} are a nice way to feed data into programs.
> We could add support to read a client-local file at program construction time using a FileInputFormat, put its data into a CollectionDataSet, and ship its data together with the program.
> This would remove the need to upload small files into DFS which are used together with some large input (stored in DFS).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)