You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Marty Kube (JIRA)" <ji...@apache.org> on 2013/02/27 22:35:12 UTC
[jira] [Created] (MAHOUT-1150) ARFF Integeration does not support
quoted identifiers
Marty Kube created MAHOUT-1150:
----------------------------------
Summary: ARFF Integeration does not support quoted identifiers
Key: MAHOUT-1150
URL: https://issues.apache.org/jira/browse/MAHOUT-1150
Project: Mahout
Issue Type: Bug
Components: Integration
Affects Versions: 0.7
Environment: All
Reporter: Marty Kube
I ran NSL-KDD data set (http://nsl.cs.unb.ca/NSL-KDD/) through the ARFF integration. The process failed to parse the arff formatted file. The file has quoted identifiers:
@relation 'KDDTrain-20Percent'
@attribute 'duration' real
@attribute 'protocol_type' {'tcp','udp', 'icmp'}
The quotes caused the problem. The "official" arff DNF shows that quotes should be supported:
https://list.scms.waikato.ac.nz/mailman/htdig/wekalist/2008-January/012153.html
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira