You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Demeter Sztanko (JIRA)" <ji...@apache.org> on 2013/10/18 12:12:42 UTC

[jira] [Updated] (PIG-3413) JsonLoader fails the pig job in case of malformed json input

     [ https://issues.apache.org/jira/browse/PIG-3413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Demeter Sztanko updated PIG-3413:
---------------------------------

    Labels: json loader pig  (was: )
    Status: Patch Available  (was: Open)

> JsonLoader fails the pig job in case of malformed json input
> ------------------------------------------------------------
>
>                 Key: PIG-3413
>                 URL: https://issues.apache.org/jira/browse/PIG-3413
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.11.1
>            Reporter: Demeter Sztanko
>            Priority: Minor
>              Labels: json, pig, loader
>             Fix For: 0.11.2
>
>   Original Estimate: 2h
>  Remaining Estimate: 2h
>
> The following pig script: 
> b = load 'bad.input' using JsonLoader('a0: chararray');
> dump b;
> runs well for the input:
> {"a": "good"}
> and fails the whole job for the following input (mallformed json)
> {"a", bad}
> I was expecting that it will just skip the line and go further.
> Getting this error:
> org.codehaus.jackson.JsonParseException: Unexpected character ('g' (code 103)): was expecting comma to separate OBJECT entries
>  at [Source: java.io.ByteArrayInputStream@4610c772; line: 1, column: 4100]
> 	at org.codehaus.jackson.JsonParser._constructError(JsonParser.java:1433)
> 	at org.codehaus.jackson.impl.JsonParserMinimalBase._reportError(JsonParserMinimalBase.java:521)
> 	at org.codehaus.jackson.impl.JsonParserMinimalBase._reportUnexpectedChar(JsonParserMinimalBase.java:442)
> 	at org.codehaus.jackson.impl.Utf8StreamParser.nextToken(Utf8StreamParser.java:482)
> 	at org.apache.pig.builtin.JsonLoader.readField(JsonLoader.java:173)
> 	at org.apache.pig.builtin.JsonLoader.getNext(JsonLoader.java:157)
> 	at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigRecordReader.nextKeyValue(PigRecordReader.java:211)
> 	at org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.nextKeyValue(MapTask.java:540)
> 	at org.apache.hadoop.mapreduce.MapContext.nextKeyValue(MapContext.java:67)
> 	at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:143)
> 	at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:771)
> 	at org.apache.hadoop.mapred.MapTask.run(MapTask.java:375)
> 	at org.apache.hadoop.mapred.Child$4.run(Child.java:255)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at javax.security.auth.Subject.doAs(Subject.java:415)
> 	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1132)
> 	at org.apache.hadoop.mapred.Child.main(Child.java:249)



--
This message was sent by Atlassian JIRA
(v6.1#6144)