You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by "Amol Umbarkar (Jira)" <ji...@apache.org> on 2020/05/18 09:21:00 UTC
[jira] [Created] (ARROW-8845) Selective compression on the wire
Amol Umbarkar created ARROW-8845:
------------------------------------
Summary: Selective compression on the wire
Key: ARROW-8845
URL: https://issues.apache.org/jira/browse/ARROW-8845
Project: Apache Arrow
Issue Type: Improvement
Components: FlightRPC
Reporter: Amol Umbarkar
Dask seems to be selectively do compression if it is found to be useful. They sort of pick 10kb of sample upfront to calculate compression and if the results are good then the whole batch is compressed. This seems to save de-compression effort on receiver side.
Please take a look at [https://blog.dask.org/2016/04/14/dask-distributed-optimizing-protocol#problem-3-unwanted-compression]
Thought this could be relevant to arrow batch transfers as well.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)