You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@samoa.apache.org by "Simon Dugas (JIRA)" <ji...@apache.org> on 2015/04/24 10:42:38 UTC

[jira] [Created] (SAMOA-26) VHT throws NumberFormatException on class attribute

Simon Dugas created SAMOA-26:
--------------------------------

             Summary: VHT throws NumberFormatException on class attribute
                 Key: SAMOA-26
                 URL: https://issues.apache.org/jira/browse/SAMOA-26
             Project: SAMOA
          Issue Type: Bug
          Components: SAMOA-Local
         Environment: MAC OSX 10.10.3
java version "1.7.0_71"
Java(TM) SE Runtime Environment (build 1.7.0_71-b14)
Java HotSpot(TM) 64-Bit Server VM (build 24.71-b01, mixed mode)
            Reporter: Simon Dugas


I'm trying to debug the following error, PrequentialEvaluation with VHT (classification) throws a NumberFormatException for the class attribute. Why is it trying to parse the class attribute as an integer? I can't find a format error in my ARFF file. It was created with weka.core.converters.CSVLoader. Other datasets (nominal only) work fine with my install of SAMOA. This configuration runs fine in MOA.

Command Line Argument

bin/samoa local target/SAMOA-Local-0.3.0-SNAPSHOT.jar "PrequentialEvaluation -l com.yahoo.labs.samoa.learners.classifiers.trees.VerticalHoeffdingTree -s (ArffFileStream -f test.arff) -f 1"

ARFF File

@relation test.txt

@attribute Dur numeric
@attribute Proto {udp,tcp,icmp,arp,ipx/spx,ipv6-icmp,pim,esp,igmp,rtcp,rtp,ipv6,udt}
@attribute Dir {' <->',' <?>',' ->',' ?>',' who',' <-',' <?'}
@attribute State {CON,PA_PA,PA_FRA, ...}
@attribute sTos numeric
@attribute dTos numeric
@attribute TotPkts numeric
@attribute TotBytes numeric
@attribute SrcBytes numeric
@attribute class {Background,Normal,Botnet}

@data

1065.731934,udp,' <->',CON,0,0,2,252,145,Background
1471.787109,udp,' <->',CON,0,0,2,252,145,Background
...

Error Output

Command line string = PrequentialEvaluation -l com.yahoo.labs.samoa.learners.classifiers.trees.VerticalHoeffdingTree -s (ArffFileStream -f test.arff) -f 1
2015-04-23 12:05:45,277 [main] INFO com.yahoo.labs.samoa.LocalDoTask (LocalDoTask.java:80) - Successfully instantiating com.yahoo.labs.samoa.tasks.PrequentialEvaluation
Exception in thread "main" java.lang.NumberFormatException: For input string: "Background"
at sun.misc.FloatingDecimal.readJavaFormatString(FloatingDecimal.java:1250)
at java.lang.Double.valueOf(Double.java:504)
at com.yahoo.labs.samoa.instances.ArffLoader.readInstanceDense(ArffLoader.java:105)
at com.yahoo.labs.samoa.instances.ArffLoader.readInstance(ArffLoader.java:77)
at com.yahoo.labs.samoa.instances.Instances.readInstance(Instances.java:182)
at com.yahoo.labs.samoa.moa.streams.ArffFileStream.getNextInstanceFromFile(ArffFileStream.java:183)
at com.yahoo.labs.samoa.moa.streams.ArffFileStream.readNextInstanceFromFile(ArffFileStream.java:145)
at com.yahoo.labs.samoa.moa.streams.ArffFileStream.nextInstance(ArffFileStream.java:118)
at com.yahoo.labs.samoa.moa.streams.ArffFileStream.nextInstance(ArffFileStream.java:46)
at com.yahoo.labs.samoa.streams.StreamSource.nextInstance(StreamSource.java:70)
at com.yahoo.labs.samoa.streams.PrequentialSourceProcessor.initStreamSource(PrequentialSourceProcessor.java:197)
at com.yahoo.labs.samoa.streams.PrequentialSourceProcessor.getDataset(PrequentialSourceProcessor.java:170)
at com.yahoo.labs.samoa.tasks.PrequentialEvaluation.init(PrequentialEvaluation.java:161)
at com.yahoo.labs.samoa.LocalDoTask.main(LocalDoTask.java:87)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Re: [jira] [Created] (SAMOA-26) VHT throws NumberFormatException on class attribute

Posted by Albert Bifet <ab...@waikato.ac.nz>.
Yes, the problem is in the parsing of the Dir attribute that
propagates down to the other attributes.

Cheers, Albert

On Mon, Apr 27, 2015 at 3:31 PM, Gianmarco De Francisci Morales
<gd...@apache.org> wrote:
> It's weird, because the error seems to be on the class attribute.
> Maybe some error propagates down the line while parsing?
>
> --
> Gianmarco
>
> On 26 April 2015 at 06:21, Albert Bifet <ab...@waikato.ac.nz> wrote:
>
>> It seems that the problem is with the Dir attribute. Without this
>> attribute it's working fine. It contains non standard characters. We
>> should check how this attribute is parsed.
>>
>> On Fri, Apr 24, 2015 at 4:42 PM, Simon Dugas (JIRA) <ji...@apache.org>
>> wrote:
>> > Simon Dugas created SAMOA-26:
>> > --------------------------------
>> >
>> >              Summary: VHT throws NumberFormatException on class attribute
>> >                  Key: SAMOA-26
>> >                  URL: https://issues.apache.org/jira/browse/SAMOA-26
>> >              Project: SAMOA
>> >           Issue Type: Bug
>> >           Components: SAMOA-Local
>> >          Environment: MAC OSX 10.10.3
>> > java version "1.7.0_71"
>> > Java(TM) SE Runtime Environment (build 1.7.0_71-b14)
>> > Java HotSpot(TM) 64-Bit Server VM (build 24.71-b01, mixed mode)
>> >             Reporter: Simon Dugas
>> >
>> >
>> > I'm trying to debug the following error, PrequentialEvaluation with VHT
>> (classification) throws a NumberFormatException for the class attribute.
>> Why is it trying to parse the class attribute as an integer? I can't find a
>> format error in my ARFF file. It was created with
>> weka.core.converters.CSVLoader. Other datasets (nominal only) work fine
>> with my install of SAMOA. This configuration runs fine in MOA.
>> >
>> > Command Line Argument
>> >
>> > bin/samoa local target/SAMOA-Local-0.3.0-SNAPSHOT.jar
>> "PrequentialEvaluation -l
>> com.yahoo.labs.samoa.learners.classifiers.trees.VerticalHoeffdingTree -s
>> (ArffFileStream -f test.arff) -f 1"
>> >
>> > ARFF File
>> >
>> > @relation test.txt
>> >
>> > @attribute Dur numeric
>> > @attribute Proto
>> {udp,tcp,icmp,arp,ipx/spx,ipv6-icmp,pim,esp,igmp,rtcp,rtp,ipv6,udt}
>> > @attribute Dir {' <->',' <?>',' ->',' ?>',' who',' <-',' <?'}
>> > @attribute State {CON,PA_PA,PA_FRA, ...}
>> > @attribute sTos numeric
>> > @attribute dTos numeric
>> > @attribute TotPkts numeric
>> > @attribute TotBytes numeric
>> > @attribute SrcBytes numeric
>> > @attribute class {Background,Normal,Botnet}
>> >
>> > @data
>> >
>> > 1065.731934,udp,' <->',CON,0,0,2,252,145,Background
>> > 1471.787109,udp,' <->',CON,0,0,2,252,145,Background
>> > ...
>> >
>> > Error Output
>> >
>> > Command line string = PrequentialEvaluation -l
>> com.yahoo.labs.samoa.learners.classifiers.trees.VerticalHoeffdingTree -s
>> (ArffFileStream -f test.arff) -f 1
>> > 2015-04-23 12:05:45,277 [main] INFO com.yahoo.labs.samoa.LocalDoTask
>> (LocalDoTask.java:80) - Successfully instantiating
>> com.yahoo.labs.samoa.tasks.PrequentialEvaluation
>> > Exception in thread "main" java.lang.NumberFormatException: For input
>> string: "Background"
>> > at
>> sun.misc.FloatingDecimal.readJavaFormatString(FloatingDecimal.java:1250)
>> > at java.lang.Double.valueOf(Double.java:504)
>> > at
>> com.yahoo.labs.samoa.instances.ArffLoader.readInstanceDense(ArffLoader.java:105)
>> > at
>> com.yahoo.labs.samoa.instances.ArffLoader.readInstance(ArffLoader.java:77)
>> > at
>> com.yahoo.labs.samoa.instances.Instances.readInstance(Instances.java:182)
>> > at
>> com.yahoo.labs.samoa.moa.streams.ArffFileStream.getNextInstanceFromFile(ArffFileStream.java:183)
>> > at
>> com.yahoo.labs.samoa.moa.streams.ArffFileStream.readNextInstanceFromFile(ArffFileStream.java:145)
>> > at
>> com.yahoo.labs.samoa.moa.streams.ArffFileStream.nextInstance(ArffFileStream.java:118)
>> > at
>> com.yahoo.labs.samoa.moa.streams.ArffFileStream.nextInstance(ArffFileStream.java:46)
>> > at
>> com.yahoo.labs.samoa.streams.StreamSource.nextInstance(StreamSource.java:70)
>> > at
>> com.yahoo.labs.samoa.streams.PrequentialSourceProcessor.initStreamSource(PrequentialSourceProcessor.java:197)
>> > at
>> com.yahoo.labs.samoa.streams.PrequentialSourceProcessor.getDataset(PrequentialSourceProcessor.java:170)
>> > at
>> com.yahoo.labs.samoa.tasks.PrequentialEvaluation.init(PrequentialEvaluation.java:161)
>> > at com.yahoo.labs.samoa.LocalDoTask.main(LocalDoTask.java:87)
>> >
>> >
>> >
>> > --
>> > This message was sent by Atlassian JIRA
>> > (v6.3.4#6332)
>>

Re: [jira] [Created] (SAMOA-26) VHT throws NumberFormatException on class attribute

Posted by Gianmarco De Francisci Morales <gd...@apache.org>.
It's weird, because the error seems to be on the class attribute.
Maybe some error propagates down the line while parsing?

--
Gianmarco

On 26 April 2015 at 06:21, Albert Bifet <ab...@waikato.ac.nz> wrote:

> It seems that the problem is with the Dir attribute. Without this
> attribute it's working fine. It contains non standard characters. We
> should check how this attribute is parsed.
>
> On Fri, Apr 24, 2015 at 4:42 PM, Simon Dugas (JIRA) <ji...@apache.org>
> wrote:
> > Simon Dugas created SAMOA-26:
> > --------------------------------
> >
> >              Summary: VHT throws NumberFormatException on class attribute
> >                  Key: SAMOA-26
> >                  URL: https://issues.apache.org/jira/browse/SAMOA-26
> >              Project: SAMOA
> >           Issue Type: Bug
> >           Components: SAMOA-Local
> >          Environment: MAC OSX 10.10.3
> > java version "1.7.0_71"
> > Java(TM) SE Runtime Environment (build 1.7.0_71-b14)
> > Java HotSpot(TM) 64-Bit Server VM (build 24.71-b01, mixed mode)
> >             Reporter: Simon Dugas
> >
> >
> > I'm trying to debug the following error, PrequentialEvaluation with VHT
> (classification) throws a NumberFormatException for the class attribute.
> Why is it trying to parse the class attribute as an integer? I can't find a
> format error in my ARFF file. It was created with
> weka.core.converters.CSVLoader. Other datasets (nominal only) work fine
> with my install of SAMOA. This configuration runs fine in MOA.
> >
> > Command Line Argument
> >
> > bin/samoa local target/SAMOA-Local-0.3.0-SNAPSHOT.jar
> "PrequentialEvaluation -l
> com.yahoo.labs.samoa.learners.classifiers.trees.VerticalHoeffdingTree -s
> (ArffFileStream -f test.arff) -f 1"
> >
> > ARFF File
> >
> > @relation test.txt
> >
> > @attribute Dur numeric
> > @attribute Proto
> {udp,tcp,icmp,arp,ipx/spx,ipv6-icmp,pim,esp,igmp,rtcp,rtp,ipv6,udt}
> > @attribute Dir {' <->',' <?>',' ->',' ?>',' who',' <-',' <?'}
> > @attribute State {CON,PA_PA,PA_FRA, ...}
> > @attribute sTos numeric
> > @attribute dTos numeric
> > @attribute TotPkts numeric
> > @attribute TotBytes numeric
> > @attribute SrcBytes numeric
> > @attribute class {Background,Normal,Botnet}
> >
> > @data
> >
> > 1065.731934,udp,' <->',CON,0,0,2,252,145,Background
> > 1471.787109,udp,' <->',CON,0,0,2,252,145,Background
> > ...
> >
> > Error Output
> >
> > Command line string = PrequentialEvaluation -l
> com.yahoo.labs.samoa.learners.classifiers.trees.VerticalHoeffdingTree -s
> (ArffFileStream -f test.arff) -f 1
> > 2015-04-23 12:05:45,277 [main] INFO com.yahoo.labs.samoa.LocalDoTask
> (LocalDoTask.java:80) - Successfully instantiating
> com.yahoo.labs.samoa.tasks.PrequentialEvaluation
> > Exception in thread "main" java.lang.NumberFormatException: For input
> string: "Background"
> > at
> sun.misc.FloatingDecimal.readJavaFormatString(FloatingDecimal.java:1250)
> > at java.lang.Double.valueOf(Double.java:504)
> > at
> com.yahoo.labs.samoa.instances.ArffLoader.readInstanceDense(ArffLoader.java:105)
> > at
> com.yahoo.labs.samoa.instances.ArffLoader.readInstance(ArffLoader.java:77)
> > at
> com.yahoo.labs.samoa.instances.Instances.readInstance(Instances.java:182)
> > at
> com.yahoo.labs.samoa.moa.streams.ArffFileStream.getNextInstanceFromFile(ArffFileStream.java:183)
> > at
> com.yahoo.labs.samoa.moa.streams.ArffFileStream.readNextInstanceFromFile(ArffFileStream.java:145)
> > at
> com.yahoo.labs.samoa.moa.streams.ArffFileStream.nextInstance(ArffFileStream.java:118)
> > at
> com.yahoo.labs.samoa.moa.streams.ArffFileStream.nextInstance(ArffFileStream.java:46)
> > at
> com.yahoo.labs.samoa.streams.StreamSource.nextInstance(StreamSource.java:70)
> > at
> com.yahoo.labs.samoa.streams.PrequentialSourceProcessor.initStreamSource(PrequentialSourceProcessor.java:197)
> > at
> com.yahoo.labs.samoa.streams.PrequentialSourceProcessor.getDataset(PrequentialSourceProcessor.java:170)
> > at
> com.yahoo.labs.samoa.tasks.PrequentialEvaluation.init(PrequentialEvaluation.java:161)
> > at com.yahoo.labs.samoa.LocalDoTask.main(LocalDoTask.java:87)
> >
> >
> >
> > --
> > This message was sent by Atlassian JIRA
> > (v6.3.4#6332)
>

Re: [jira] [Created] (SAMOA-26) VHT throws NumberFormatException on class attribute

Posted by Albert Bifet <ab...@waikato.ac.nz>.
It seems that the problem is with the Dir attribute. Without this
attribute it's working fine. It contains non standard characters. We
should check how this attribute is parsed.

On Fri, Apr 24, 2015 at 4:42 PM, Simon Dugas (JIRA) <ji...@apache.org> wrote:
> Simon Dugas created SAMOA-26:
> --------------------------------
>
>              Summary: VHT throws NumberFormatException on class attribute
>                  Key: SAMOA-26
>                  URL: https://issues.apache.org/jira/browse/SAMOA-26
>              Project: SAMOA
>           Issue Type: Bug
>           Components: SAMOA-Local
>          Environment: MAC OSX 10.10.3
> java version "1.7.0_71"
> Java(TM) SE Runtime Environment (build 1.7.0_71-b14)
> Java HotSpot(TM) 64-Bit Server VM (build 24.71-b01, mixed mode)
>             Reporter: Simon Dugas
>
>
> I'm trying to debug the following error, PrequentialEvaluation with VHT (classification) throws a NumberFormatException for the class attribute. Why is it trying to parse the class attribute as an integer? I can't find a format error in my ARFF file. It was created with weka.core.converters.CSVLoader. Other datasets (nominal only) work fine with my install of SAMOA. This configuration runs fine in MOA.
>
> Command Line Argument
>
> bin/samoa local target/SAMOA-Local-0.3.0-SNAPSHOT.jar "PrequentialEvaluation -l com.yahoo.labs.samoa.learners.classifiers.trees.VerticalHoeffdingTree -s (ArffFileStream -f test.arff) -f 1"
>
> ARFF File
>
> @relation test.txt
>
> @attribute Dur numeric
> @attribute Proto {udp,tcp,icmp,arp,ipx/spx,ipv6-icmp,pim,esp,igmp,rtcp,rtp,ipv6,udt}
> @attribute Dir {' <->',' <?>',' ->',' ?>',' who',' <-',' <?'}
> @attribute State {CON,PA_PA,PA_FRA, ...}
> @attribute sTos numeric
> @attribute dTos numeric
> @attribute TotPkts numeric
> @attribute TotBytes numeric
> @attribute SrcBytes numeric
> @attribute class {Background,Normal,Botnet}
>
> @data
>
> 1065.731934,udp,' <->',CON,0,0,2,252,145,Background
> 1471.787109,udp,' <->',CON,0,0,2,252,145,Background
> ...
>
> Error Output
>
> Command line string = PrequentialEvaluation -l com.yahoo.labs.samoa.learners.classifiers.trees.VerticalHoeffdingTree -s (ArffFileStream -f test.arff) -f 1
> 2015-04-23 12:05:45,277 [main] INFO com.yahoo.labs.samoa.LocalDoTask (LocalDoTask.java:80) - Successfully instantiating com.yahoo.labs.samoa.tasks.PrequentialEvaluation
> Exception in thread "main" java.lang.NumberFormatException: For input string: "Background"
> at sun.misc.FloatingDecimal.readJavaFormatString(FloatingDecimal.java:1250)
> at java.lang.Double.valueOf(Double.java:504)
> at com.yahoo.labs.samoa.instances.ArffLoader.readInstanceDense(ArffLoader.java:105)
> at com.yahoo.labs.samoa.instances.ArffLoader.readInstance(ArffLoader.java:77)
> at com.yahoo.labs.samoa.instances.Instances.readInstance(Instances.java:182)
> at com.yahoo.labs.samoa.moa.streams.ArffFileStream.getNextInstanceFromFile(ArffFileStream.java:183)
> at com.yahoo.labs.samoa.moa.streams.ArffFileStream.readNextInstanceFromFile(ArffFileStream.java:145)
> at com.yahoo.labs.samoa.moa.streams.ArffFileStream.nextInstance(ArffFileStream.java:118)
> at com.yahoo.labs.samoa.moa.streams.ArffFileStream.nextInstance(ArffFileStream.java:46)
> at com.yahoo.labs.samoa.streams.StreamSource.nextInstance(StreamSource.java:70)
> at com.yahoo.labs.samoa.streams.PrequentialSourceProcessor.initStreamSource(PrequentialSourceProcessor.java:197)
> at com.yahoo.labs.samoa.streams.PrequentialSourceProcessor.getDataset(PrequentialSourceProcessor.java:170)
> at com.yahoo.labs.samoa.tasks.PrequentialEvaluation.init(PrequentialEvaluation.java:161)
> at com.yahoo.labs.samoa.LocalDoTask.main(LocalDoTask.java:87)
>
>
>
> --
> This message was sent by Atlassian JIRA
> (v6.3.4#6332)