You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@avro.apache.org by "ASF subversion and git services (Jira)" <ji...@apache.org> on 2022/01/05 11:38:00 UTC

[jira] [Commented] (AVRO-3216) Rust: failure reading multiple use of named schemas in file

    [ https://issues.apache.org/jira/browse/AVRO-3216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17469231#comment-17469231 ] 

ASF subversion and git services commented on AVRO-3216:
-------------------------------------------------------

Commit b76a437b970889255703ff48f7ee5981dfbcc17a in avro's branch refs/heads/master from Martin Grigorov
[ https://gitbox.apache.org/repos/asf?p=avro.git;h=b76a437 ]

AVRO-3216 Reuse records' schema by name (#1345)

* AVRO-3197 Fallback to the 'type' when the logical type does not support the type

Signed-off-by: Martin Tzvetanov Grigorov <mg...@apache.org>

* AVRO-3197 Allow only when the "type" is "string"

* AVRO-3197 Handle problematic complex type for date/time logical types

Read the complex type recursively. It seems Avro Java may produce {"type": {"type": "string", "avro.java.string": "String"}, "logicalType": "timestamp-millis"}}, i.e. logicalType is on the same level as the outer "type"

Signed-off-by: Martin Tzvetanov Grigorov <mg...@apache.org>

* AVRO-3216 Allow to reuse record's schema by name

* AVRO-3216 Extend the test case to do more assertions

* AVRO-3216 Print err with Debug

> Rust: failure reading multiple use of named schemas in file
> -----------------------------------------------------------
>
>                 Key: AVRO-3216
>                 URL: https://issues.apache.org/jira/browse/AVRO-3216
>             Project: Apache Avro
>          Issue Type: Bug
>            Reporter: Guðjón
>            Assignee: Martin Tzvetanov Grigorov
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 1.12.0
>
>         Attachments: weather-complex.avro
>
>          Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> I have an avro file that contains records of a specific schema. The file is written using the java implementation, with the generated classes. 
> The schema itself uses named schemas in a way where two or more fields use the same named schema. A good example of this could be this weather schema 
> {code:json}
> {
> 	"doc": "A weather reading.",
> 	"name": "Weather",
> 	"namespace": "test",
> 	"type": "record",
> 	"fields": [
> 		{
> 			"name": "station",
> 			"type": "string"
> 		},
> 		{
> 			"name": "max_temp",
> 			"type": "Temp"
> 		},
> 		{
> 			"name": "min_temp",
> 			"type": "Temp"
> 		}
> 	]
> }
> {code}
> with the named schema Temp being 
> {code:json}
> {
> 	"doc": "A temperature reading.",
> 	"name": "Temp",
> 	"namespace": "test",
> 	"type": "record",
> 	"fields": [
> 		{
> 			"name": "temp",
> 			"type": "long"
> 		}
> 	]
> }
> {code}
> The schema that is written in the file inlines the Temp schema in the case of the first field (max_temp) but references it by name in the second field (min_temp). In both java and python implementations it manages to read the file correctly but the avro-rs crate fails on the second field with a message:
> {noformat}
> panicked at 'Failure reading file: Unknown primitive type: Temp'
> {noformat}
> The attached file weather-complex.avro is a file written using the java library with generated classes and contains one record. 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)