You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@arrow.apache.org by "Jung Kim (Jira)" <ji...@apache.org> on 2022/05/31 18:42:00 UTC

[jira] [Created] (ARROW-16698) Read First N rows of feather file through Python

Jung Kim created ARROW-16698:
--------------------------------

             Summary: Read First N rows of feather file through Python 
                 Key: ARROW-16698
                 URL: https://issues.apache.org/jira/browse/ARROW-16698
             Project: Apache Arrow
          Issue Type: New Feature
          Components: Python
            Reporter: Jung Kim


It would be helpfully to be able to read first N rows of feather file.
e.g.  [read_feather|https://arrow.apache.org/docs/python/generated/pyarrow.feather.read_feather.html] or [read_table|https://arrow.apache.org/docs/python/generated/pyarrow.feather.read_table.html] could have "nrows" argument that behaves like "nrows" argument in [pd.read_csv|https://pandas.pydata.org/docs/reference/api/pandas.read_csv.html] 

my particular use case is to development and test my data jobs. Say I saved a large data file(s) in feather format. For development, I don't want to read all rows but just first few rows of data to make sure my code runs.

I suppose there are many other use-cases.

Reference: https://github.com/wesm/feather/issues/158



--
This message was sent by Atlassian Jira
(v8.20.7#820007)