You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@avro.apache.org by "Donatello (Jira)" <ji...@apache.org> on 2021/11/30 14:36:00 UTC

[jira] [Commented] (AVRO-2890) java JSON decoder does not respect default values for fields

    [ https://issues.apache.org/jira/browse/AVRO-2890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17451166#comment-17451166 ] 

Donatello commented on AVRO-2890:
---------------------------------

it seems to also affect version 1.9.2 and 1.11.0, at least, but probably all
so there is no workaround today.

> java JSON decoder does not respect default values for fields
> ------------------------------------------------------------
>
>                 Key: AVRO-2890
>                 URL: https://issues.apache.org/jira/browse/AVRO-2890
>             Project: Apache Avro
>          Issue Type: Bug
>          Components: java
>    Affects Versions: 1.10.0
>            Reporter: Sharath Avadoot Gururaj
>            Priority: Major
>
> Consider the following schema:
> {code:java}
> {"namespace": "example.avro",
>  "type": "record",
>  "name": "Nic",
>  "fields": [
>    {"name" : "ip", "type" : "string", "default" : ""}
>  ]
> }
> and the following empty json{code}
> {code:java}
> {}{code}
> I expect that parsing is successful with this code
> {code:java}
>         public void jsonToAvro() throws Exception {
>             JsonParser parser;
>             Schema schema = new Schema.Parser().parse(readClasspathFile(s.schema));
>             Decoder decoder;
>             JsonFactory factory = new JsonFactory();
>             if(s.linesep) {
>                 parser = factory.createParser(Files.newInputStream(Paths.get(s.input)));
>                 decoder = DecoderFactory.get().jsonDecoder(schema, Files.newInputStream(Paths.get(s.input)));
>             } else {
>                 parser = factory.createParser(Files.readAllBytes(Paths.get(s.input)));
>                 decoder = DecoderFactory.get().jsonDecoder(schema, new String(Files.readAllBytes(Paths.get(s.input))));
>             }
>             parser.configure(JsonParser.Feature.INCLUDE_SOURCE_IN_LOCATION, true);
> //        Decoder decoder = new ExtendedJsonDecoder(schema, parser, true );
>             DataFileWriter<GenericRecord> writer;
>             CountingOutputStream output = new CountingOutputStream(Files.newOutputStream(Paths.get(s.output)));
>             DatumReader<GenericRecord> reader = new GenericDatumReader<>(schema);
>             writer = new DataFileWriter<>(new GenericDatumWriter<>());
>             writer.create(schema, output);
> //        Decoder decoder = new ExtendedJsonDecoder(schema, parser, true );
>             GenericRecord datum = null;
>             while (true) {
>                 try {
>                     datum = reader.read(datum, decoder);
>                 } catch (EOFException eofe) {
>                     break;
>                 }
>                 writer.append(datum);
>             }
>             writer.flush();
>     }
> {code}
> But I get the following error
>  
> {noformat}
> org.apache.avro.AvroTypeException: Expected field name not found: ip
> 	at org.apache.avro.io.JsonDecoder.doAction(JsonDecoder.java:473) ~[avro-1.10.0.jar:1.10.0]
> 	at org.apache.avro.io.parsing.Parser.advance(Parser.java:86) ~[avro-1.10.0.jar:1.10.0]
> 	at org.apache.avro.io.JsonDecoder.advance(JsonDecoder.java:132) ~[avro-1.10.0.jar:1.10.0]
> 	at org.apache.avro.io.JsonDecoder.readString(JsonDecoder.java:212) ~[avro-1.10.0.jar:1.10.0]
> 	at org.apache.avro.io.JsonDecoder.readString(JsonDecoder.java:207) ~[avro-1.10.0.jar:1.10.0]
> 	at org.apache.avro.io.ResolvingDecoder.readString(ResolvingDecoder.java:208) ~[avro-1.10.0.jar:1.10.0]
> 	at org.apache.avro.generic.GenericDatumReader.readString(GenericDatumReader.java:469) ~[avro-1.10.0.jar:1.10.0]
> 	at org.apache.avro.generic.GenericDatumReader.readString(GenericDatumReader.java:459) ~[avro-1.10.0.jar:1.10.0]
> 	at org.apache.avro.generic.GenericDatumReader.readWithoutConversion(GenericDatumReader.java:191) ~[avro-1.10.0.jar:1.10.0]
> 	at org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:160) ~[avro-1.10.0.jar:1.10.0]
> 	at org.apache.avro.generic.GenericDatumReader.readField(GenericDatumReader.java:259) ~[avro-1.10.0.jar:1.10.0]
> 	at org.apache.avro.generic.GenericDatumReader.readRecord(GenericDatumReader.java:247) ~[avro-1.10.0.jar:1.10.0]
> 	at org.apache.avro.generic.GenericDatumReader.readWithoutConversion(GenericDatumReader.java:179) ~[avro-1.10.0.jar:1.10.0]
> 	at org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:160) ~[avro-1.10.0.jar:1.10.0]
> 	at org.apache.avro.generic.GenericDatumReader.read(GenericDatumReader.java:153) ~[avro-1.10.0.jar:1.10.0]
> 	at sha.Deser.jsonToAvro(Deser.java:101) ~[classes/:?]
> 	at sha.Deser.go(Deser.java:70) ~[classes/:?]
> 	at sha.Deser.main(Deser.java:43) [classes/:?]
> {noformat}
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)