You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@apex.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2017/11/29 05:47:00 UTC

[jira] [Commented] (APEXMALHAR-2550) NycTaxiDataReader throws exception when encountering lines with unrecognized format in the NYC taxi example

    [ https://issues.apache.org/jira/browse/APEXMALHAR-2550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16270189#comment-16270189 ] 

ASF GitHub Bot commented on APEXMALHAR-2550:
--------------------------------------------

davidyan74 opened a new pull request #680: APEXMALHAR-2550 Made NycTaxiDataReader and NycTaxiCsvParser more resi…
URL: https://github.com/apache/apex-malhar/pull/680
 
 
   …lient to data with bad format
   
   @tweise Please take a look.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


> NycTaxiDataReader throws exception when encountering lines with unrecognized format in the NYC taxi example
> -----------------------------------------------------------------------------------------------------------
>
>                 Key: APEXMALHAR-2550
>                 URL: https://issues.apache.org/jira/browse/APEXMALHAR-2550
>             Project: Apache Apex Malhar
>          Issue Type: Bug
>            Reporter: David Yan
>            Assignee: David Yan
>
> 17/11/28 16:21:42 ERROR engine.StreamingContainer: Operator set [OperatorDeployInfo[id=1,name=NycTaxiDataReader,type=INPUT,checkpoint={ffffffffffffffff, 0, 0},inputs=[],outputs=[OperatorDeployInfo.OutputDeployInfo[portName=output,streamId=input_to_parser,bufferServer=localhost]]]] stopped running due to an exception.
> java.lang.ArrayIndexOutOfBoundsException: 1
> 	at org.apache.apex.examples.nyctaxi.NycTaxiDataReader.readEntity(NycTaxiDataReader.java:58)
> 	at org.apache.apex.examples.nyctaxi.NycTaxiDataReader.readEntity(NycTaxiDataReader.java:34)
> 	at com.datatorrent.lib.io.fs.AbstractFileInputOperator.emitTuples(AbstractFileInputOperator.java:684)
> 	at com.datatorrent.stram.engine.InputNode.run(InputNode.java:124)
> and
> ask=0,partitionKeys=<null>]],outputs=[OperatorDeployInfo.OutputDeployInfo[portName=output,streamId=parser_to_extractor,bufferServer=localhost]]]] stopped running due to an exception.
> java.lang.ArrayIndexOutOfBoundsException: 18
> at org.apache.apex.examples.nyctaxi.NycTaxiCsvParser$1.process(NycTaxiCsvParser.java:49)
> at org.apache.apex.examples.nyctaxi.NycTaxiCsvParser$1.process(NycTaxiCsvParser.java:39)
> at com.datatorrent.api.DefaultInputPort.put(DefaultInputPort.java:79)
> at com.datatorrent.stram.stream.BufferServerSubscriber$BufferReservoir.sweep(BufferServerSubscriber.java:288)
> at com.datatorrent.stram.engine.GenericNode.run(GenericNode.java:269)
> at com.datatorrent.stram.engine.StreamingContainer$2.run(StreamingContainer.java:1428)



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)