You are viewing a plain text version of this content. The canonical link for it is here.

Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2022/04/03 08:07:54 UTC

[GitHub] [arrow-datafusion] Dandandan commented on issue #2109: Almost 100x slowdown on 0.7.0 with CSV file due to parsing entire file to infer schema

Dandandan commented on issue #2109:
URL: https://github.com/apache/arrow-datafusion/issues/2109#issuecomment-1086799631


   Great find!👍
   
   Another thing might be useful in the future is to optimize inferring inferring the types.
   
   It makes sense it is slower than parsing the CSV, given that we don't know the types, but it sounds it shouldn't be ~100x as slow.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org