You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by GitBox <gi...@apache.org> on 2019/01/17 12:44:26 UTC

[GitHub] StephanEwen commented on issue #7506: [FLINK-11347] Optimize the ParquetAvroWriters factory

StephanEwen commented on issue #7506: [FLINK-11347] Optimize the ParquetAvroWriters factory
URL: https://github.com/apache/flink/pull/7506#issuecomment-455159341
 
 
   The schema must be serializable, hence we convert it to a string and back.
   The schema is in the closure of the factory, which itself is part of the user code that is shipped for distributed execution, hence the requirement to be serializable.
   
   The parsing also happens just once when the writer is created, so my assumption is that the cost is acceptable.
   
   I would close this PR, because the solution here is not possible.
   Please reopen the issue, if you disagree and would like t pursue this further.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services