You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Sai Krishna Chaitanya Chaganti (Jira)" <ji...@apache.org> on 2021/09/10 11:56:00 UTC
[jira] [Created] (ARROW-13972) [python] read csv with different
number of columns per row
Sai Krishna Chaitanya Chaganti created ARROW-13972:
------------------------------------------------------
Summary: [python] read csv with different number of columns per row
Key: ARROW-13972
URL: https://issues.apache.org/jira/browse/ARROW-13972
Project: Apache Arrow
Issue Type: New Feature
Components: Python
Affects Versions: 5.0.0
Reporter: Sai Krishna Chaitanya Chaganti
When tried to read CSV data with multiple columns per row, arrows fails with an error message like below. When tried to read the CSV using other libs such as spark and pandas, they are filling up the remaining columns with null values. Is it possible to introduce such feature in pyarrow, CSV may or may not contain headers.
{noformat}
Expected 952 columns, got 620:{noformat}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)