You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@arrow.apache.org by "Tim Lantz (Jira)" <ji...@apache.org> on 2020/01/22 17:13:00 UTC

[jira] [Created] (ARROW-7656) [Python] csv.ConvertOptions Documentation Is Unclear Around Disabling Type Inference

Tim Lantz created ARROW-7656:
--------------------------------

             Summary: [Python] csv.ConvertOptions Documentation Is Unclear Around Disabling Type Inference
                 Key: ARROW-7656
                 URL: https://issues.apache.org/jira/browse/ARROW-7656
             Project: Apache Arrow
          Issue Type: Bug
          Components: Python
    Affects Versions: 0.15.1
         Environment: Documentation, N/A.
            Reporter: Tim Lantz


High level description:
 * The documentation [here|[https://arrow.apache.org/docs/python/generated/pyarrow.csv.ConvertOptions.html#pyarrow.csv.ConvertOptions]] says that setting column_types disables type inference.
 * Under the hood I can see why it is clear you need to also set ReadOptions.column_names to support all current use cases however it is unclear to new users of the library when you read the docs. Especially since you can supply a Schema object to column_types in the Python bindings.
 * Suggested change: update the csv.ConvertOptions to note that you also must set csv.ReadOptions.column_names in order to disable type inference.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)