You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Jung Kim (Jira)" <ji...@apache.org> on 2022/05/31 18:42:00 UTC
[jira] [Created] (ARROW-16698) Read First N rows of feather file through Python
Jung Kim created ARROW-16698:
--------------------------------
Summary: Read First N rows of feather file through Python
Key: ARROW-16698
URL: https://issues.apache.org/jira/browse/ARROW-16698
Project: Apache Arrow
Issue Type: New Feature
Components: Python
Reporter: Jung Kim
It would be helpfully to be able to read first N rows of feather file.
e.g. [read_feather|https://arrow.apache.org/docs/python/generated/pyarrow.feather.read_feather.html] or [read_table|https://arrow.apache.org/docs/python/generated/pyarrow.feather.read_table.html] could have "nrows" argument that behaves like "nrows" argument in [pd.read_csv|https://pandas.pydata.org/docs/reference/api/pandas.read_csv.html]
my particular use case is to development and test my data jobs. Say I saved a large data file(s) in feather format. For development, I don't want to read all rows but just first few rows of data to make sure my code runs.
I suppose there are many other use-cases.
Reference: https://github.com/wesm/feather/issues/158
--
This message was sent by Atlassian Jira
(v8.20.7#820007)