You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@avro.apache.org by "David McIntosh (JIRA)" <ji...@apache.org> on 2013/10/24 22:36:03 UTC

[jira] [Updated] (AVRO-1332) Improve C# DatumReader performance

     [ https://issues.apache.org/jira/browse/AVRO-1332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

David McIntosh updated AVRO-1332:
---------------------------------

    Attachment: AVRO-1332-missing-files-plus-defaults-patch.patch
                AVRO-1332-defaults-patch-only.patch

I've noticed two problems with this patch. The first is that only the modified files from the patch got committed into SVN. The newly added files were missed. I also found a bug with default value handling that was introduced with this patch. I adjusted the unit tests to trigger it and have made a fix.

The "defaults-patch-only" patch includes only the new fix. The "missing-fles-plus-defaults-patch" patch includes the fix and the files from the "-4" patch that didn't make it into svn.

If needed I can submit the defaults fix as a separate issue.

> Improve C# DatumReader performance
> ----------------------------------
>
>                 Key: AVRO-1332
>                 URL: https://issues.apache.org/jira/browse/AVRO-1332
>             Project: Avro
>          Issue Type: Improvement
>          Components: csharp
>    Affects Versions: 1.7.5
>            Reporter: David McIntosh
>            Assignee: David McIntosh
>            Priority: Minor
>              Labels: performance
>             Fix For: 1.7.6
>
>         Attachments: AVRO-1332-2.patch, AVRO-1332-3.patch, AVRO-1332-4.patch, AVRO-1332-defaults-patch-only.patch, AVRO-1332-missing-files-plus-defaults-patch.patch, AVRO-1332.patch
>
>
> The current implementations of the C# datum readers perform resolution of the reader and writer schema on every call to Read. In my tests this was causing it to perform poorly when reading a large number of records (slower than parsing the same data from delimited text files). It would be more efficient if the reader only needed to resolve the schemas once.



--
This message was sent by Atlassian JIRA
(v6.1#6144)