You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Sai Krishna Chaitanya Chaganti (Jira)" <ji...@apache.org> on 2021/09/10 11:56:00 UTC

[jira] [Created] (ARROW-13972) [python] read csv with different number of columns per row

Sai Krishna Chaitanya Chaganti created ARROW-13972:
------------------------------------------------------

             Summary: [python] read csv with different number of columns per row
                 Key: ARROW-13972
                 URL: https://issues.apache.org/jira/browse/ARROW-13972
             Project: Apache Arrow
          Issue Type: New Feature
          Components: Python
    Affects Versions: 5.0.0
            Reporter: Sai Krishna Chaitanya Chaganti


When tried to read CSV data with multiple columns per row, arrows fails with an error message like below. When tried to read the CSV using other libs such as spark and pandas, they are filling up the remaining columns with null values. Is it possible to introduce such feature in pyarrow, CSV may or may not contain headers.
{noformat}
Expected 952 columns, got 620:{noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)