You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@reef.apache.org by "Markus Weimer (JIRA)" <ji...@apache.org> on 2016/01/19 20:54:39 UTC

[jira] [Commented] (REEF-873) Fix DataSet id issue

    [ https://issues.apache.org/jira/browse/REEF-873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15107309#comment-15107309 ] 

Markus Weimer commented on REEF-873:
------------------------------------

[~juliaw], is this still an issue? I believe we now use guids, right?

> Fix DataSet id issue
> --------------------
>
>                 Key: REEF-873
>                 URL: https://issues.apache.org/jira/browse/REEF-873
>             Project: REEF
>          Issue Type: Bug
>            Reporter: Julia
>            Assignee: Julia
>
> Currently the id of the dataset is formed from extract some string from the input file name. In reality, the input file name itself can be a generated file with some random numbers that makes the file name has no meaning at all. And since the file name can be very long, that can also break the current way to form the id. 
> We should remove this id generation dependency on the input file names, instead, comes out a id with a prefix like "FileSystemDataSet-" plus a guid for example, as long as it is unique. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)